VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




The best free alternative to HeyGen is VoiceCraft, which is also Open Source. If that doesn't suit you, our users have ranked more than 50 alternatives to HeyGen and many of them is free so hopefully you can find a suitable replacement. Other interesting free alternatives to HeyGen are X to Voice, NaturalReader, Mirage Studio and Audiomatic.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



Meet the world's first AI model designed to generate UGC content. Mirage by Captions generates original actors with natural expressions and body language—completely free from licensing restrictions.




Audiomatic is a web app that seamlessly translates videos into other languages. Our state-of-the-art pipeline delivers contextually-accurate dubbed translations that preserve the tone, style, and emotion of the original speakers.



Amazon Polly uses deep learning technologies to synthesize natural-sounding human speech, so you can convert articles to speech. With dozens of lifelike voices across a broad set of languages, use Amazon Polly to build speech-activated applications.



Free open source AI voice cloning and text to speech synthesis. Clone a voice in 5 seconds to generate arbitrary speech in real-time.
Choose from 60+ human-like, emotional voices in various accents, languages, and characters to turn any text into a commercial-grade audio. Or Clone your own voice.





Convert text into realistic speech or short-form video with synthetic AI voices, over 700 options in 65+ languages, automatic subtitles, and fast web-based tools ideal for social, educational, e-learning, and marketing content while improving accessibility.




Resemble AI is a synthetic voice AI company that supercharges your cloned voice with a text-to-speech generator paired with real-time APIs to build immersive experiences. Resemble AI has support for 44 kHz voices and includes low latency API's.





