Read and listen to beautifully punctuated transcripts of any YouTube video, in sync with audio (and full video in landscape)
Cost / License
- Freemium
- Proprietary
Application types
Platforms
- iPhone
- iPad
- Google Chrome
- Online




Whisper is described as 'End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification' and is a audio transcription tool in the audio & music category. There are more than 100 alternatives to Whisper for a variety of platforms, including Mac, Web-based, Windows, iPhone and iPad apps. The best Whisper alternative is Handy STT, which is both free and Open Source. Other great apps like Whisper are Vibe Transcribe, Voxtral, FUTO Voice Input and TypeWhisper.
Read and listen to beautifully punctuated transcripts of any YouTube video, in sync with audio (and full video in landscape)




Speakr is a personal, self-hosted web application designed for transcribing audio recordings (like meetings), generating concise summaries and titles, and interacting with the content through a chat interface. Keep all your meeting notes and insights securely on your own server.



Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.
AudioNotes app allows you to effortlessly record, transcribe, and enhance audio from anywhere using AI. Whether you're capturing thoughts, ideas, interviews, meetings, or lectures, this app has you covered.




Transcripo is an intuitive tool crafted for converting audio and video files into precise text transcriptions. It supports multiple languages, most video/audio file formats and offers features such as AI-generated summaries and subtitle exports for videos.


Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.

Provides automatic and human-powered transcription for audio and video files in over 119 languages, enables accurate subtitles, translates transcripts, integrates with major communication tools, and supports team collaboration and review for accessible content.



Transcriboar is a lightweight Android transcription app that uses the device’s built-in SpeechRecognizer to convert speech to text in real time.




Transcripts are your new secret weapon. Access the full potential of your audio and video content by converting it to searchable, editable interactive transcripts with Trint.
Glasscribe is a lightweight macOS menu bar app that transcribes speech in real time — entirely on your device. Built on Apple's native Speech framework (macOS 26 Tahoe), it captures both system audio and microphone input across 22+ languages with real-time on-device...
Whisper Dictator is a free desktop dictation application that runs entirely on your computer. Powered by OpenAI's Whisper AI model, it converts speech to text with high accuracy in over 90 languages — without sending any data to the cloud.

Live Subtitles is an AI live caption and dual subtitles app that transcribes any audio and translates it into 30+ languages in real time — live captions and bilingual subtitles for Zoom, Teams, YouTube, Netflix, Twitch, Discord and any app on Windows, macOS and iPhone.



