VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




TurboTTS is described as 'Free text-to-speech online tool. Up to 70 languages, more than 300 types of real-life voices to choose from, simple and easy to use' and is a Text to Speech service. There are more than 50 alternatives to TurboTTS, not only websites but also apps for a variety of platforms, including Windows, Mac, Android and SaaS apps. The best TurboTTS alternative is VoiceCraft, which is both free and Open Source. Other great sites and apps similar to TurboTTS are ElevenLabs, Read Aloud Extension, Balabolka and X to Voice.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




ElevenLabs uses AI to deliver natural, expressive speech for diverse applications such as podcasts and videos. It features a user-friendly interface, customizable intonation, and offers seamless API integration. Privacy, scalability, and multilingual capabilities enhance its adaptability.




Read aloud the content of any web page with one click using this browser extension for desktop and mobile browsers. Capable of reading in different languages. Speed, pitch and other settings available.


Balabolka is a Text-To-Speech (TTS) program. All computer voices installed on your system are available to Balabolka. The on-screen text can be saved as a WAV, MP3, MP4, OGG or WMA file. The program can read the clipboard content, view the text from DOC, EPUB, FB2, HTML, ODT...



Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.




Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


SherpaTTS is an Android Text-to-Speech engine based on Next-gen Kaldi using Piper or Coqui voices.


Transform text into speech with natural synthesis, offering smooth and fine-tuned audio export. Create high-quality voiceovers, download outputs for diverse applications, and experience excellent synthesis. Supports various languages and operates on multiple platforms.




We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.