ElevenLabs
Industry-leading text-to-speech with ultra-realistic voice cloning, multilingual support, and emotion control.
Undisclosed
Parameters
N/A
Context Window
Proprietary
License
Jan 1, 2024
Released
๐ฐ Pricing
Input
Subscription
per 1M tokens
Output
N/A
per 1M tokens
โก Strengths
๐ฏ Use Cases
API Available
This model is accessible via API for integration into your applications.
๐ ๏ธ Tools Built on ElevenLabs
โญ Related Models
Whisper v4
OpenAI
Latest speech recognition model with improved accuracy across 100+ languages, real-time streaming, and speaker diarization.
Whisper Large v3
OpenAI
Battle-tested speech recognition. Widely deployed, well-supported, excellent accuracy.
OpenAI: GPT Audio
OpenAI
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.