Cartesia Sonic-3.5Cartesia
|
EVI 3Hume AI
|
|||||
Related Products
|
||||||
About
Sonic 3.5 is Cartesia’s fastest, most natural text-to-speech model, built for expressive, real-time voice generation with sub-90ms latency and native support for 42 languages. It is designed to follow transcripts faithfully, voice confirmation codes, and heteronyms correctly without preprocessing, and stay expressive enough to carry a real conversation. It supports languages intended to deliver native-quality speech. Sonic 3.5 focuses on clean audio across every language and voice, with no artifacts to edit out, making it practical for production voice experiences where quality, speed, and consistency matter. Its expressive conversational delivery provides strong pacing and real emotional range, tuned for support and agent transcripts. Alphanumerics such as order numbers, phone numbers, IDs, and emails are spoken naturally in every language, while context-aware English pronunciation helps words like read, bass, and bow land correctly from the surrounding text.
|
About
Hume AI's EVI 3 is a third-generation speech-language model that streams in user speech and forms natural, expressive speech and language responses. At conversational latency, it produces the same quality of speech as our text-to-speech model, Octave. Simultaneously, it responds with the same intelligence as the most advanced LLMs of similar latency. It also communicates with reasoning models and web search systems as it speaks, “thinking fast and slow” to match the intelligence of any frontier AI system. EVI 3 can instantly generate new voices and personalities instead of being limited to a handful of speakers. For instance, users can speak to any of the more than 100,000 custom voices already created on our text-to-speech platform, each with an inferred personality. No matter the voice, it responds with a wide range of emotions or styles, implicitly or on command.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Voice AI developers who need low-latency, multilingual text-to-speech for realistic support agents, conversational apps, and production voice experiences
|
Audience
Developers and businesses in search of a solution to integrate emotionally intelligent, real-time voice AI into their applications
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationCartesia
Founded: 2023
United States
docs.cartesia.ai/build-with-cartesia/tts-models/latest
|
Company InformationHume AI
Founded: 2021
United States
www.hume.ai/blog/introducing-evi-3
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
Hume AI
|
||||||
|
|
|
