VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




The best AI Voice Generator alternative to HeyGen is VoiceCraft, which is both free and Open Source. If that doesn't suit you, our users have ranked more than 50 alternatives to HeyGen and eight of them are AI Voice Generators so hopefully you can find a suitable replacement. Other interesting AI Voice Generator alternatives to HeyGen are Voice Engine, X to Voice, Vidnoz AI and LOVO.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Vidnoz AI enables fast text-to-video creation with over 70 lifelike avatars and 100+ realistic voices. It offers pre-designed templates, subtitles, and effects—no editing skills needed. The user-friendly interface and customizable options support learning, social media, and more.




Choose from 60+ human-like, emotional voices in various accents, languages, and characters to turn any text into a commercial-grade audio. Or Clone your own voice.


Convert text into realistic speech or short-form video with synthetic AI voices, over 700 options in 65+ languages, automatic subtitles, and fast web-based tools ideal for social, educational, e-learning, and marketing content while improving accessibility.




Audeus is a text-to-speech app that reads your documents aloud using natural, lifelike voices. Instantly double or triple your reading speed, improve focus, and increase comprehension with synchronized text highlighting. Get started today.



