Skip to main content

πŸŽ™οΈ Give Your Assistant a Voice

Transform text into natural, human-like speech with our integrated TTS providers. Each provider offers unique advantages for different use cases.

Quick Provider Comparison

⚑ ElevenLabs

Premium Quality & Customization70+ languages, voice cloning, advanced controlsBest for: High-quality customer interactions

πŸš€ Deepgram Aura

Ultra-Low Latency3x faster than competitors, phone-optimizedBest for: Real-time conversations

🎯 Cartesia Sonic 3

Multilingual Excellence42 languages, voice cloning, low latencyBest for: Multilingual agents, global deployments

☁️ Azure Speech

Enterprise Scale500+ neural voices, 100+ languagesBest for: Enterprise, Azure ecosystem

🎭 Inworld.ai

AI-Powered EmotionsMultilingual, emotional markup, voice cloningBest for: Expressive, contextual responses

πŸŽ™οΈ Resemble AI

Custom Voice CreationWebSocket streaming, personalized voicesBest for: Brand-specific voice identity

Feature Matrix

ProviderLatencyLanguagesVoice CloningStreamingBest For
ElevenLabs~250ms70+βœ… AdvancedWebSocketPremium quality
Deepgram~75msEnglish❌WebSocketSpeed & phone calls
Cartesia~150ms42βœ… YesWebSocketMultilingual
Azure~200ms100+❌HTTPEnterprise
Inworld~200ms11βœ… Zero-shotHTTP/WSEmotional expression
Resemble~300msEnglishβœ… CustomWebSocketBrand voices
New to TTS? Start with ElevenLabs for the best balance of quality and features, Deepgram if speed is your priority, or Cartesia for multilingual support.

Setup Overview

All providers follow the same basic setup pattern:
1

Get API Credentials

Sign up with your chosen provider and obtain API keys
2

Configure in Burki

Add your credentials in the assistant’s AI Configuration β†’ TTS tab
3

Select Voice & Model

Choose from available voices and models for your use case
4

Fine-tune Settings

Adjust speed, stability, and other provider-specific options

Provider Deep Dives

🎭 ElevenLabs - Premium Voice Quality

Latest Models: Flash v2.5 (75ms), v3 (70+ languages), Turbo v2.5Key Features: Advanced voice controls, multilingual support, custom voice creationPerfect For: Customer service, content creation, multilingual applications→ Complete ElevenLabs Guide

⚑ Deepgram Aura - Ultra-Fast TTS

Latest Models: Aura-2 (next-gen), Aura (proven)Key Features: Industry-leading speed, phone optimization, ¡-law encodingPerfect For: Real-time phone calls, live chat, interactive applications→ Complete Deepgram Guide

🎯 Cartesia Sonic 3 - Multilingual Excellence

Latest Models: sonic-3 (latest), sonic-3-2025-10-27 (stable)Key Features: 42 languages, voice cloning from ~5 sec, low latency WebSocketPerfect For: Global deployments, multilingual voice agents→ Complete Cartesia Guide

☁️ Azure Speech - Enterprise Scale

Latest Models: Neural (high-quality), Standard (basic)Key Features: 500+ voices, 100+ languages, SSML support, Microsoft integrationPerfect For: Enterprise applications, Azure ecosystem users→ Complete Azure Guide

πŸŽͺ Inworld.ai - AI-Powered Expression

Latest Models: TTS-1 (flagship), TTS-1-Max (experimental)Key Features: Emotional markup, context awareness, 11 languagesPerfect For: Gaming, entertainment, emotional customer support→ Complete Inworld Guide

πŸŽ™οΈ Resemble AI - Custom Brand Voices

Key Features: WebSocket streaming, unlimited voice creation, business plansPerfect For: Brand consistency, personalized experiences, enterprise→ Complete Resemble Guide

Advanced Topics

πŸŽ›οΈ Voice Tuning

Master stability, similarity, and style controls across all providers

πŸ”§ Troubleshooting

Common issues and solutions with step-by-step fixes

πŸ“ˆ Best Practices

Performance optimization, cost reduction, and production tips

πŸ”— See Also

Configuration: Learn how to configure TTS in your AI Configuration settings.Integration: Understand how TTS fits into the overall Architecture of Burki Voice AI.Call Management: Discover how TTS works with Call Management features.

Quick Start Guide

Recommended Setup:
  • Provider: ElevenLabs or Deepgram
  • Model: Flash v2.5 or Aura-2
  • Voice: Professional (Rachel, Asteria)
  • Settings: Stability 0.5, Speaker Boost ON
API Rate Limits: Each provider has different rate limits and pricing models. Check the individual provider pages for detailed pricing information.

πŸš€ Ready to Get Started?

Choose your provider and dive into the detailed setup guides, or check out our Best Practices for optimization tips.