TTS models: ElevenLabs (high quality, expensive), Google Wavenet, Amazon Polly, Microsoft Azure, Reverie (Indian languages), Meta MMS, plus newer open-source like F5-TTS and Kokoro.
Key dimensions: voice quality (naturalness), language support, latency, and cost. Naturalness has improved dramatically — modern TTS is often indistinguishable from human within short clips. Latency under 200ms enables real-time agents.
Indian-language TTS is still uneven. Hindi has good support. Tamil/Telugu/Bengali decent. Marathi/Gujarati/Punjabi/Odia variable. Voice cloning from short samples (ElevenLabs) opens up brand-specific voices — useful for premium customer experience.