VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




LOVO is described as 'Choose from 60+ human-like, emotional voices in various accents, languages, and characters to turn any text into a commercial-grade audio. Or Clone your own voice' and is a Text to Speech service in the ai tools & services category. There are more than 50 alternatives to LOVO for a variety of platforms, including Web-based, SaaS, Windows, Mac and iPhone apps. The best LOVO alternative is VoiceCraft, which is both free and Open Source. Other great apps like LOVO are Balabolka, Speech Note, X to Voice and Kokoro.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Balabolka is a Text-To-Speech (TTS) program. All computer voices installed on your system are available to Balabolka. The on-screen text can be saved as a WAV, MP3, MP4, OGG or WMA file. The program can read the clipboard content, view the text from DOC, EPUB, FB2, HTML, ODT...







Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


Create AI podcasts by uploading websites, PDFs, or documents, selecting customizable hosts and scripts, generating episodes with outline planning, editing outputs, and publishing audio content—streamlining production for creators without manual recording.




Karaoke and transform any songs in your AI voice. No singing skill required, your AI voice can handle any song even in other languages!.




Effortlessly create videos with digital actors by inputting a script. Use AI for translating and dubbing videos, preserving voice and syncing lips seamlessly to new languages.




Wondercraft AI is a tool that allows users to easily create studio-quality podcasts using generative AI technology. It eliminates the need for extensive recording and scripting by allowing users to record just a 60-second sample of their voice, which the AI uses to clone their...



