- Standard models (Multilingual v2, Multilingual v1, English v1) are optimized for quality and accuracy, ideal for content creation. These models offer the best quality and stability but have higher latency.
- Turbo models (Turbo v2.5, Turbo v2) are designed for low-latency applications like real-time conversational AI. They deliver great performance with faster processing speeds, though with a slight trade-off in accuracy and stability.

Model Selection
Standard Models
Eleven Multilingual v2 Our most advanced speech synthesis model, Multilingual v2, offers high stability, diverse language support, and exceptional accuracy in 29 languages. While slower than the Turbo models, it delivers more lifelike speech, making it ideal for content creation such as voiceovers, audiobooks, and post-production.Supported languages
Supported languages
- English (UK)
- English (USA)
- English (Australia)
- English (Canada)
- Japanese
- Chinese
- German
- Hindi
- French (France)
- French (Canada)
- Korean
- Portuguese (Brazil)
- Portuguese (Portugal)
- Italian
- Spanish (Spain)
- Spanish (Mexico)
- Indonesian
- Dutch
- Turkish
- Filipino
- Polish
- Swedish
- Bulgarian
- Romanian
- Arabic (Saudi Arabia)
- Arabic (UAE)
- Czech
- Greek
- Finnish
- Croatian
- Malay
- Slovak
- Danish
- Tamil
- Ukrainian
- Russian
- Best quality
- Unparalleled accuracy
- More stable
- Higher latency
Supported languages
Supported languages
- English (USA)
- English (UK)
- English (Australia)
- English (Canada)
- German
- Polish
- Spanish (Spain)
- Spanish (Mexico)
- Italian
- French (France)
- French (Canada)
- Portuguese (Portugal)
- Portuguese (Brazil)
- Hindi
Turbo Models
Eleven Turbo v2.5 Turbo v2.5 generates speech in 32 languages with low latency, optimized for real-time conversational AI use cases. This model is 300% faster than Multilingual v2 and now supports new languages such as Vietnamese, Hungarian, and Norwegian. It is best for developers requiring rapid, natural speech across multiple languages, but it lacks the stylistic range of Multilingual v2. Latency is as low as 300ms, making it ideal for real-time interactions.- Great quality
- High accuracy with Professional Voice Clones
- Slightly less stable
- Optimized for low latency
Supported languages
Supported languages
- English (USA)
- English (UK)
- English (Australia)
- English (Canada)
- Japanese
- Chinese
- German
- Hindi
- French (France)
- French (Canada)
- Korean
- Portuguese (Brazil)
- Portuguese (Portugal)
- Italian
- Spanish (Spain)
- Spanish (Mexico)
- Indonesian
- Dutch
- Turkish
- Filipino
- Polish
- Swedish
- Bulgarian
- Romanian
- Arabic (Saudi Arabia)
- Arabic (UAE)
- Czech
- Greek
- Finnish
- Croatian
- Malay
- Slovak
- Danish
- Tamil
- Ukrainian
- Russian
- Hungarian
- Norwegian
- Vietnamese
- Great quality
- High accuracy with Professional Voice Clones
- Slightly less stable
- Optimized for low latency
Supported languages
Supported languages
- English (USA)
- English (UK)
- English (Australia)
- English (Canada)