VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




All Voice Lab is described as 'An AI-powered platform revolutionizing voice creation with cutting-edge technology. We provide advanced audio solutions for creators and businesses worldwide' and is a Text to Speech service in the ai tools & services category. There are more than 25 alternatives to All Voice Lab for a variety of platforms, including Web-based, Android, Windows, Linux and Mac apps. The best All Voice Lab alternative is VoiceCraft, which is both free and Open Source. Other great apps like All Voice Lab are ElevenReader, ElevenLabs, X to Voice and Kokoro.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Convert text, articles, PDFs, or ePubs to streaming audio in over 32 languages with ultra-realistic AI voices, voice customization, playback speed control, synced text highlights, and support for diverse content sources, designed for accessible, mobile listening.




ElevenLabs uses AI to deliver natural, expressive speech for diverse applications such as podcasts and videos. It features a user-friendly interface, customizable intonation, and offers seamless API integration. Privacy, scalability, and multilingual capabilities enhance its adaptability.




Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.



Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


SherpaTTS is an Android Text-to-Speech engine based on Next-gen Kaldi using Piper or Coqui voices.


Transform text into speech with natural synthesis, offering smooth and fine-tuned audio export. Create high-quality voiceovers, download outputs for diverse applications, and experience excellent synthesis. Supports various languages and operates on multiple platforms.




Audiomatic is a web app that seamlessly translates videos into other languages. Our state-of-the-art pipeline delivers contextually-accurate dubbed translations that preserve the tone, style, and emotion of the original speakers.


