Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.

Gladia is described as 'Production-ready Speech-to-Text API built for teams shipping real-world voice products—delivering high accuracy, multilingual coverage, real-time + async transcription, and a growing set of add-ons (diarization, translation, summarization, sentiment, formatting, and more)' and is a audio transcription tool in the audio & music category. There are more than 50 alternatives to Gladia, not only websites but also apps for a variety of platforms, including Mac, Windows, iPhone and Self-Hosted apps. The best Gladia alternative is Vibe Transcribe, which is both free and Open Source. Other great sites and apps similar to Gladia are Voxtral, Whisper, TranscribeX and Moonshine AI.
Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.

SpeechPulse is a dictation software for Windows 10/11 and Apple Silicon Macs. It can type into any text input, including text editors, web browsers, and office applications. SpeechPulse works fully offline and doesn’t require any internet connectivity.




Efficiently convert speech to text with this easy-to-navigate tool. Offers real-time transcription with secure storage on iCloud, supporting 20 languages from English to Vietnamese.




Power your apps with world-class speech-to-text and domain-specific language models (DSLMs). Effortlessly accurate. Blazing fast. Enterprise-ready scale. Unbeatable pricing. Everything developers need to build with confidence and ship faster.

The Tomedes Free AI Transcription Tool transforms audio and video files into clear, accurate text in seconds. Supporting formats like MP3, MP4, WAV, and more, it offers seamless transcriptions in nearly 100 languages.

Private, on-device audio transcription for macOS. Your audio never leaves your Mac — no cloud uploads, no subscriptions, no data collection. Real-time ASR with Qwen3-ASR, MLX Whisper & Whisper, plus system-wide dictation, all 100% local.








Automatically convert all of your voice recordings into clean, organized, neat text files. Unlimited and free.



This is an iOS application written in Objective-c for assisting the people who want to work out a piece of audio in order to write it out.




Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.