☁️ Azure Speech: Enterprise Scale
Microsoft’s neural TTS service with 500+ voices across 100+ languages. Seamless integration with Azure ecosystem, SSML support, and enterprise-grade reliability. Perfect for organizations already using Microsoft services.
Quick Setup
Create Azure Speech Resource
- Go to Azure Portal
- Create a new Speech resource
- Select your subscription, resource group, and region
- Note your Key and Region from the resource’s Keys and Endpoint page
Configure in Burki
- Go to AI Configuration → TTS tab
- Select Azure Speech as provider
- Enter your API Key and Region (e.g.,
eastus,westus2)
Free Tier: Azure offers 500,000 characters per month free. Neural voices are available on all tiers.
Available Models
🧠 Neural
High-Quality Neural VoicesNatural intonation and human-like speechQuality: Premium
Best for: All production applications
📢 Standard
Standard TTS VoicesBasic text-to-speech synthesisQuality: Good
Best for: Legacy compatibility
Recommendation: Always use Neural voices for the best quality. Standard voices are legacy and should only be used for specific compatibility needs.
Available Voices
English Voices
American English (en-US)
American English (en-US)
Jenny
Clear & ProfessionalPerfect for business applications
Voice ID: en-US-JennyNeuralAria
Warm & NaturalGreat for friendly interactions
Voice ID: en-US-AriaNeuralGuy
Natural & ConfidentStrong, authoritative voice
Voice ID: en-US-GuyNeuralDavis
Friendly & ApproachableIdeal for customer service
Voice ID: en-US-DavisNeuralJane
ProfessionalClear business voice
Voice ID: en-US-JaneNeuralJason
ClearReliable male voice
Voice ID: en-US-JasonNeuralBritish English (en-GB)
British English (en-GB)
Sonia
Clear & ProfessionalBritish female voice
Voice ID: en-GB-SoniaNeuralRyan
WarmBritish male voice
Voice ID: en-GB-RyanNeuralOther Languages
Spanish (es-ES)
Spanish (es-ES)
Elvira
NaturalSpanish female voice
Voice ID: es-ES-ElviraNeuralAlvaro
ClearSpanish male voice
Voice ID: es-ES-AlvaroNeuralFrench (fr-FR)
French (fr-FR)
Denise
NaturalFrench female voice
Voice ID: fr-FR-DeniseNeuralHenri
ClearFrench male voice
Voice ID: fr-FR-HenriNeuralArabic (ar-SA)
Arabic (ar-SA)
Salma
NaturalArabic female voice
Voice ID: ar-SA-SalmaNeuralHamed
ClearArabic male voice
Voice ID: ar-SA-HamedNeuralAll Configured Voices
All Configured Voices
| Voice | Language | Gender | Voice ID | Description |
|---|---|---|---|---|
| Jenny | en-US | Female | en-US-JennyNeural | Clear and professional |
| Aria | en-US | Female | en-US-AriaNeural | Warm and natural |
| Guy | en-US | Male | en-US-GuyNeural | Natural and confident |
| Davis | en-US | Male | en-US-DavisNeural | Friendly and approachable |
| Jane | en-US | Female | en-US-JaneNeural | Professional |
| Jason | en-US | Male | en-US-JasonNeural | Clear |
| Sonia | en-GB | Female | en-GB-SoniaNeural | Clear and professional |
| Ryan | en-GB | Male | en-GB-RyanNeural | Warm |
| Elvira | es-ES | Female | es-ES-ElviraNeural | Natural |
| Alvaro | es-ES | Male | es-ES-AlvaroNeural | Clear |
| Denise | fr-FR | Female | fr-FR-DeniseNeural | Natural |
| Henri | fr-FR | Male | fr-FR-HenriNeural | Clear |
| Salma | ar-SA | Female | ar-SA-SalmaNeural | Natural |
| Hamed | ar-SA | Male | ar-SA-HamedNeural | Clear |
500+ More Voices: Azure offers hundreds of additional voices. Visit the Azure Voice Gallery for the complete list.
Voice Controls
Azure Speech provides advanced voice customization through SSML:- Speaking Rate
- Pitch
- SSML Support
Controls speech speed (Range: 0.5 - 2.0)
- 0.5: Half speed (very slow)
- 1.0: ✅ Normal speed (Recommended)
- 2.0: Double speed (very fast)
Configuration Options
Audio Format
Azure Speech automatically outputs the appropriate format for your telephony provider:- Twilio/Telnyx: PCM μ-law @ 8kHz (
Raw8Khz8BitMonoMULaw) - Vonage: PCM 16-bit @ 16kHz (
Raw16Khz16BitMonoPcm)
Configuration in Burki
To use Azure Speech TTS in your assistant:Get Azure Credentials
Create a Speech resource in Azure Portal and copy your Subscription Key and Region.
SSML Support
Azure Speech supports SSML for advanced voice control:Regional Selection
Latency Optimization: Choose the Azure region closest to your deployment for optimal latency.
| Region | Location | Best For |
|---|---|---|
eastus | East US | North America (East) |
westus2 | West US 2 | North America (West) |
westeurope | Netherlands | Europe |
southeastasia | Singapore | Asia-Pacific |
australiaeast | Australia East | Australia/Oceania |
Pricing Overview
| Tier | Characters/Month | Neural Voices | Price |
|---|---|---|---|
| Free | 500,000 | Yes | $0 |
| Standard | Pay-as-you-go | Yes | $16 per 1M chars |
Enterprise: Contact Azure for custom pricing on high-volume usage and reserved capacity.
Common Issues & Solutions
Authentication Failed
Authentication Failed
Problem: API returns 401 UnauthorizedSolutions:
- Verify your Azure Speech Key is correct in Settings → Provider Keys
- Ensure the key is from your Speech resource (not another Azure service)
- Check that the region matches your Speech resource’s region
Voice Not Available
Voice Not Available
Problem: Selected voice doesn’t workSolutions:
- Verify the voice ID format (e.g.,
en-US-JennyNeural) - Check that the voice is available in your region
- Ensure your subscription tier supports the selected voice
High Latency
High Latency
Problem: TTS response is slowSolutions:
- Select an Azure region closer to your users
- Burki uses streaming synthesis for optimal performance
- Consider caching common phrases
Best Practices
See Also
🎯 Multilingual?
Cartesia Sonic 3 - 42 languages with voice cloning
⚡ Need Speed?
Deepgram Aura - Ultra-low ~75ms latency
🔗 Additional Resources
Azure Portal: portal.azure.comVoice Gallery: Azure Voice GalleryDocumentation: Azure Speech Service DocsPricing: Azure Speech Pricing
🚀 Ready to Use Azure Speech?
Head back to your assistant configuration and set up Azure Speech for enterprise-grade TTS!