WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



Vibe Transcribe is described as 'Provides fully offline audio and video transcription with local AI models, batch processing, real-time preview, multi-format export, translation, and privacy on Windows, Linux, and macOS' and is a popular audio transcription tool in the ai tools & services category. There are more than 100 alternatives to Vibe Transcribe for a variety of platforms, including Mac, Web-based, Windows, iPhone and Linux apps. The best Vibe Transcribe alternative is Handy STT, which is both free and Open Source. Other great apps like Vibe Transcribe are FUTO Voice Input, Voxtral, OpenWhispr and TypeWhisper.
WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


AI transcription solution converts audio or video to text in over 100 languages, includes speaker identification, precise timestamps, instant translation to 145+ languages, supports imports from 1,000+ platforms, cloud editing, multi-format export, and link-based sharing.




Hold-to-talk speech-to-text for macOS. 100% local, powered by WhisperKit and local LLM cleanup. Hold Control to record, release to transcribe and paste.
Local speech-to-text app for Windows 10/11 using Whisper AI, transcribes audio and video files offline, protects privacy, supports 90+ languages and multiple formats, offers GPU acceleration, drag-and-drop, and exports to SRT, VTT, TXT, or LRC formats.



Vowen is a fast, voice-first productivity app that lets you dictate anywhere, trigger context-aware AI, record meetings, and control your computer with voice. Works on macOS and Windows.




Verbatim turns your voice into polished text fast. It uses Mistral Voxtral Mini for speech-to-text — a transcription model that won our benchmark against GPT-4o mini, Gemini 2.5 Flash, AssemblyAI Universal, and Deepgram Nova on accuracy, and runs around 3x faster than ElevenLabs...




Whisper AI is a speech to text and voice to text Chrome extension for fast AI voice typing. Convert audio to text, capture transcripts and voice notes, and transcribe with smart cleanup and modes.



