EmotiVoice is a free open-source multi-voice, prompt-controlled TTS engine from NetEase Youdao. It supports English and Chinese speech synthesis, more than 2,000 voices, emotional prompt control, and local deployment for researchers, developers, creators, and voice application prototypes.
EmotiVoiceprompt controlled TTSemotional text to speechopen source TTSChinese TTSEnglish TTSmulti voice TTSvoice cloning research
Product Preview
A quick visual look at EmotiVoice before you visit the official site.
Published 1/21/2026
Editorial Review
About EmotiVoice
EmotiVoice: prompt-controlled emotional text to speech
EmotiVoice is an open-source text-to-speech engine from NetEase Youdao for generating expressive English and Chinese speech. Its public materials position it as a multi-voice, prompt-controlled TTS system with more than 2,000 voices and controllable delivery styles.
Key capabilities
Prompt control: guide emotion, speaking style, and delivery instead of only converting text to neutral speech.
Large voice pool: experiment with many voice identities for demos and prototypes.
English and Chinese: useful for bilingual narration, education, and localization tests.
Open-source deployment: run and customize the stack for research or internal prototypes.
Developer use: integrate TTS into bots, reading assistants, games, and content workflows.
Use cases
EmotiVoice fits research demos, audiobook samples, character dialogue, language-learning materials, product prototypes, and conversational agents that need more emotion than a plain TTS voice. For commercial voice work, review license terms and obtain rights for any voice data or generated persona you use.
GitHub project preview used as the screenshot reference for the open-source EmotiVoice repository.
IndexTTS is Bilibili’s open-source industrial-grade controllable and efficient zero-shot text-to-speech system. It is best for speech researchers and developers who need controllable TTS experiments, not for casual users looking for a polished web voice app.
Index TTStext to speechzero-shot TTS
1810
Azure Text to Speech
The best and most realistic voice tools currently available
text-to-speech
1510
Hailuo AI TTS
Hailuo AI TTS, also tied to MiniMax Audio, is a text-to-speech and voice-generation product for multilingual AI voices, voice cloning, and audio content workflows.
Hailuo AI TTSMiniMax Audiotext to speech
2830
Coqui TTS
A deep learning toolkit for Text-to-Speech, battle-tested in research and production