Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

BetterDictation is described as 'Is your personal scribe. You speak, and it will quickly and flawless transcribe into any app' and is a audio transcription tool in the audio & music category. There are more than 50 alternatives to BetterDictation for a variety of platforms, including Web-based, Mac, Windows, iPhone and SaaS apps. The best BetterDictation alternative is FUTO Voice Input, which is both free and Open Source. Other great apps like BetterDictation are Whisper, Moonshine AI, MacWhisper and Aqua Voice.
Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

Letterly is a mobile app that converts any speech to clear and well-structured text. It's more than just a transcription. With the help of AI, you can transform your voice into structured notes, catchy social media posts, readable meeting summaries, formal emails and much more




CMU Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.
Windows Speech Recognition makes using a keyboard and mouse optional. You can control your PC with your voice and dictate text instead.
Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Write with your voice in any app on macOS. Faster and more accurate than ChatGPT, Google and OpenAI Whisper. Start talking. Stop typing.

High-quality on-device transcription. Easily convert speech to text from meetings, lectures, and more.



Turn your voice memos into organized text! Just talk & let the AI create lists, blog post and more for you!.

Read and listen to beautifully punctuated transcripts of any YouTube video, in sync with audio (and full video in landscape)




Bring structure to your meetings and save time using speech to text notes. Turn the team's conversation into meaningful notes. Easy to use workspace provides everything you need for productive meetings: AI speech to text transcription, real-time editor, agenda timeboxing and...

Aiko is similar in that it uses the Whisper model to transcribe speech into text, but it only does so using recordings, not real time dictation into any text field. With Aiko you have to record all of what you want to say and then copy paste it into the place you need the text, whereas BetterDictation transcribes what you are saying directly into the selected text field in any other app.