Record meetings, lectures, and podcasts. Transcribe in 10+ languages with on-device Apple models. Get ChatGPT-powered summaries via Apple Intelligence — no subscriptions.




Audiotype - Audio & Video Transcription is described as 'Audiotype is a transcription software that convert audio and video file into editable text transcript and subtitles. More than +10 000 users use Audiotype to transcribe their media files (video, podcast, recordings, MP4, MP3, interviews) into exportable transcripts or subtitles' and is a audio transcription tool in the audio & music category. There are more than 100 alternatives to Audiotype - Audio & Video Transcription, not only websites but also apps for a variety of platforms, including Mac, iPhone, Windows and iPad apps. The best Audiotype - Audio & Video Transcription alternative is Handy STT, which is both free and Open Source. Other great sites and apps similar to Audiotype - Audio & Video Transcription are Vibe Transcribe, FUTO Voice Input, Voxtral and TypeWhisper.
Record meetings, lectures, and podcasts. Transcribe in 10+ languages with on-device Apple models. Get ChatGPT-powered summaries via Apple Intelligence — no subscriptions.




AI-driven online converter transcribes uploaded audio files into accurate text, supporting multiple languages and dialects. Works fully in-browser, requires no registration, offers fast speech recognition, and supports formats suited for interviews or meetings.

Local speech-to-text app for Windows 10/11 using Whisper AI, transcribes audio and video files offline, protects privacy, supports 90+ languages and multiple formats, offers GPU acceleration, drag-and-drop, and exports to SRT, VTT, TXT, or LRC formats.



NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.


Voice-first AI for meeting notes, voice notes, and dictation. 5× faster than typing. Just speak, and it's done.


WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


Transcribes video speech into subtitles using advanced AI models with multilingual translation, live subtitle editing and preview, robust quality checks, support for offline processing, customizable exports as SRT, ASS, or burnt-in, suitable for creators and professionals.


Wave is a lightweight, native macOS dictation app focused on fast voice-to-text workflows with minimal UI overhead. Press a shortcut, speak, and your words are instantly pasted at the cursor. Supports on-device transcription via Whisper and cloud transcription via Groq, plus an...



Hold-to-talk speech-to-text for macOS. 100% local, powered by WhisperKit and local LLM cleanup. Hold Control to record, release to transcribe and paste.