A privacy-focused online video editor that processes videos entirely in your browser using FFmpeg and WebAssembly. No uploads, no signups required.
Cost / License
- Free
- Open Source (MIT)
Platforms
- Online

Voxtral is described as 'State-of-the-art speech models with transcription, translation, and audio understanding, available via API or self-hosted, optimized for cost and efficiency' and is a audio transcription tool in the ai tools & services category. There are more than 100 alternatives to Voxtral for a variety of platforms, including Mac, Web-based, Windows, iPhone and Linux apps. The best Voxtral alternative is Handy STT, which is both free and Open Source. Other great apps like Voxtral are Vibe Transcribe, FUTO Voice Input, TypeWhisper and Google AI Edge Eloquent.
A privacy-focused online video editor that processes videos entirely in your browser using FFmpeg and WebAssembly. No uploads, no signups required.

Wave is a lightweight, native macOS dictation app focused on fast voice-to-text workflows with minimal UI overhead. Press a shortcut, speak, and your words are instantly pasted at the cursor. Supports on-device transcription via Whisper and cloud transcription via Groq, plus an...



NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.


The local voice AI app. Generate lifelike voiceovers, clone voices, create audiobooks, build multi-speaker conversations, and run private local voice workflows without uploading your scripts.




Writevoice lets you write at the speed of thought. Click record, speak naturally, and get clean, accurate text ready for docs, tickets, or your CRM. It’s fast, precise, and privacy-first: we never store your recordings or transcripts.


Transform your manuscript into a professional audiobook with Narratory's cutting-edge AI technology. This platform offers AI-powered audiobook narration that delivers natural-sounding voices, making your audiobook creation process seamless and efficient.


Hold-to-talk speech-to-text for macOS. 100% local, powered by WhisperKit and local LLM cleanup. Hold Control to record, release to transcribe and paste.
Voice dictation for Mac and Windows. Press a hotkey, speak, paste into any text field. Audio stays on your device — no cloud, no account, unlimited free tier.



Make an AI copy of your voice that keeps your tone and accent. Our voice cloning tech lets you create natural-sounding speech for videos and podcasts by reading a short text.

VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


Record meetings, lectures, and podcasts. Transcribe in 10+ languages with on-device Apple models. Get ChatGPT-powered summaries via Apple Intelligence — no subscriptions.




Local speech-to-text app for Windows 10/11 using Whisper AI, transcribes audio and video files offline, protects privacy, supports 90+ languages and multiple formats, offers GPU acceleration, drag-and-drop, and exports to SRT, VTT, TXT, or LRC formats.


