Latest AI news, expert analysis, bold opinions, and key trends — delivered to your inbox.
On May 29, 2025, ElevenLabs officially rolled out Conversational AI 2.0, a massive upgrade to its voice agent platform—just four months after the original launch. Designed for enterprise-grade applications, the update focuses on more natural, intelligent, and secure AI-powered conversations across customer service, sales, healthcare, and more.
Here’s what’s new and why it matters:
The new turn-taking model detects real-time cues—like "um," "ah," or brief silences—to determine when to speak or wait. This eliminates awkward interruptions and makes AI responses feel fluid and human-like. Imagine a customer support agent that knows when you're thinking or searching for your password—that’s the goal.
Users can now switch between voice and text in a single interaction. This is perfect for moments when saying “E as in Echo” isn’t practical—like reading out an email or credit card number. It's all supported across ElevenLabs' HTML widget, SDKs, and WebSocket API.
Voice agents can now pull real-time data from external knowledge bases—instantly and securely. Whether it’s product specs or medical guidelines, agents stay informed and accurate. Latency remains low, and privacy protections are strong.
With batch calling, users can initiate hundreds of outbound voice calls at once. Useful for appointment reminders, surveys, or sales outreach. Each call can be personalized with dynamic variables and monitored in real-time.
AI agents can change voices or personas mid-conversation—ideal for storytelling, gaming, or dramatized customer interactions. The platform also supports automatic language detection, enabling seamless multilingual support.
With HIPAA compliance, EU data residency, and advanced security, ElevenLabs is fully equipped for sensitive sectors like healthcare. They also offer enterprise-level SLAs, dedicated support, unlimited seats, and bulk discounts.
Developers can use Claude, GPT, Gemini, or even plug in custom LLMs via server integration. Whether you're building a customer service agent or a gamified tutor, you can choose the best brain for the job.
Conversational AI 2.0 is optimized for speed, with average latency under 200ms even during 100 simultaneous conversations. It can handle thousands of calls per day—ideal for large teams or high-traffic deployments.
Here’s a snapshot of their latest plans:
Plan | Price | Minutes | Concurrency | Notes |
---|---|---|---|---|
Free | $0/month | 15 | 4 | Requires attribution |
Starter | $5/month | 50 | 6 | $0.12 per extra min |
Creator | $11/month | 250 | 6 | Discounted from $22 |
Pro | $99/month | 1,100 | 10 | $0.11 per extra min |
Scale | $330/month | 3,600 | 20 | $0.10 per extra min |
Business | $1,320/mo | 13,750 | 30 | $0.08 per extra min w/ volume discounts |
Customer Service: 24/7 multilingual support and troubleshooting
Virtual Assistants: Schedules, reminders, info lookups
Retail Support: Personalized recommendations and order tracking
Personalized Learning: One-on-one tutoring and reading assistance
Interactive Storytelling: Multi-voice dramatized experiences
Full SDK and API support (Python, JavaScript, React, Swift)
WebSocket API for real-time customization
Silence optimization: inference cost drops to 5% during pauses >10s
Premade templates let devs launch agents in minutes
Custom analysis dashboards to track performance (e.g., conversions, resolution rates)
Users love the ultra-realistic voices, seamless flow, and fast setup. One developer said they built a working customer support flow in just 20 minutes. Others say the agents feel “100% human.”
Still, challenges remain. Some users report voice quality dips when handling numbers or dates in non-English contexts—e.g., "200,000 apples" read as "20 thousand thousand" or errors in Italian phrasing. ElevenLabs hasn’t addressed this yet but may improve support in future updates.
Conversational AI 2.0 puts ElevenLabs in direct competition with OpenAI’s Realtime API, Vapi, and Retell. ElevenLabs wins on voice variety, customizability, and latency—but some users still prefer OpenAI’s model for character voice adaptability and sound effects, despite its stricter guardrails.
Despite some online noise suggesting ElevenLabs was “falling behind,” the launch of Conversational AI 2.0 shows otherwise. With a rumored $3B+ valuation round in progress and clear momentum in product development, ElevenLabs is doubling down on the future of voice AI.