Skip to main content

☁️ Azure Speech: Enterprise Scale

Microsoft’s neural TTS service with 500+ voices across 100+ languages. Seamless integration with Azure ecosystem, SSML support, and enterprise-grade reliability. Perfect for organizations already using Microsoft services.

Quick Setup

1

Create Azure Speech Resource

  1. Go to Azure Portal
  2. Create a new Speech resource
  3. Select your subscription, resource group, and region
  4. Note your Key and Region from the resource’s Keys and Endpoint page
2

Configure in Burki

  1. Go to AI ConfigurationTTS tab
  2. Select Azure Speech as provider
  3. Enter your API Key and Region (e.g., eastus, westus2)
3

Choose Voice & Model

Select your preferred neural voice from the dropdown
Free Tier: Azure offers 500,000 characters per month free. Neural voices are available on all tiers.

Available Models

🧠 Neural

High-Quality Neural VoicesNatural intonation and human-like speechQuality: Premium Best for: All production applications

📢 Standard

Standard TTS VoicesBasic text-to-speech synthesisQuality: Good Best for: Legacy compatibility
Recommendation: Always use Neural voices for the best quality. Standard voices are legacy and should only be used for specific compatibility needs.

Available Voices

English Voices

Jenny

Clear & ProfessionalPerfect for business applicationsVoice ID: en-US-JennyNeural

Aria

Warm & NaturalGreat for friendly interactionsVoice ID: en-US-AriaNeural

Guy

Natural & ConfidentStrong, authoritative voiceVoice ID: en-US-GuyNeural

Davis

Friendly & ApproachableIdeal for customer serviceVoice ID: en-US-DavisNeural

Jane

ProfessionalClear business voiceVoice ID: en-US-JaneNeural

Jason

ClearReliable male voiceVoice ID: en-US-JasonNeural

Sonia

Clear & ProfessionalBritish female voiceVoice ID: en-GB-SoniaNeural

Ryan

WarmBritish male voiceVoice ID: en-GB-RyanNeural

Other Languages

Elvira

NaturalSpanish female voiceVoice ID: es-ES-ElviraNeural

Alvaro

ClearSpanish male voiceVoice ID: es-ES-AlvaroNeural

Denise

NaturalFrench female voiceVoice ID: fr-FR-DeniseNeural

Henri

ClearFrench male voiceVoice ID: fr-FR-HenriNeural

Salma

NaturalArabic female voiceVoice ID: ar-SA-SalmaNeural

Hamed

ClearArabic male voiceVoice ID: ar-SA-HamedNeural
VoiceLanguageGenderVoice IDDescription
Jennyen-USFemaleen-US-JennyNeuralClear and professional
Ariaen-USFemaleen-US-AriaNeuralWarm and natural
Guyen-USMaleen-US-GuyNeuralNatural and confident
Davisen-USMaleen-US-DavisNeuralFriendly and approachable
Janeen-USFemaleen-US-JaneNeuralProfessional
Jasonen-USMaleen-US-JasonNeuralClear
Soniaen-GBFemaleen-GB-SoniaNeuralClear and professional
Ryanen-GBMaleen-GB-RyanNeuralWarm
Elviraes-ESFemalees-ES-ElviraNeuralNatural
Alvaroes-ESMalees-ES-AlvaroNeuralClear
Denisefr-FRFemalefr-FR-DeniseNeuralNatural
Henrifr-FRMalefr-FR-HenriNeuralClear
Salmaar-SAFemalear-SA-SalmaNeuralNatural
Hamedar-SAMalear-SA-HamedNeuralClear
500+ More Voices: Azure offers hundreds of additional voices. Visit the Azure Voice Gallery for the complete list.

Voice Controls

Azure Speech provides advanced voice customization through SSML:
Controls speech speed (Range: 0.5 - 2.0)
  • 0.5: Half speed (very slow)
  • 1.0: ✅ Normal speed (Recommended)
  • 2.0: Double speed (very fast)
<prosody rate="+20%">
  Speaking slightly faster than normal.
</prosody>

Configuration Options

Audio Format

Azure Speech automatically outputs the appropriate format for your telephony provider:
  • Twilio/Telnyx: PCM μ-law @ 8kHz (Raw8Khz8BitMonoMULaw)
  • Vonage: PCM 16-bit @ 16kHz (Raw16Khz16BitMonoPcm)

Configuration in Burki

To use Azure Speech TTS in your assistant:
1

Get Azure Credentials

Create a Speech resource in Azure Portal and copy your Subscription Key and Region.
2

Add to Burki

Go to SettingsProvider KeysTTS and add your Azure Speech credentials.
3

Configure Assistant

Edit your assistant, select Azure Speech as the TTS provider, and choose a voice.

SSML Support

Azure Speech supports SSML for advanced voice control:
<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US">
    <voice name="en-US-JennyNeural">
        <prosody rate="+10%" pitch="+5%">
            Welcome to our service!
        </prosody>
        <break time="500ms"/>
        How can I assist you today?
    </voice>
</speak>
Use SSML tags in your assistant’s responses for fine-grained control over pronunciation, emphasis, and pacing.

Regional Selection

Latency Optimization: Choose the Azure region closest to your deployment for optimal latency.
RegionLocationBest For
eastusEast USNorth America (East)
westus2West US 2North America (West)
westeuropeNetherlandsEurope
southeastasiaSingaporeAsia-Pacific
australiaeastAustralia EastAustralia/Oceania

Pricing Overview

TierCharacters/MonthNeural VoicesPrice
Free500,000Yes$0
StandardPay-as-you-goYes$16 per 1M chars
Enterprise: Contact Azure for custom pricing on high-volume usage and reserved capacity.

Common Issues & Solutions

Problem: API returns 401 UnauthorizedSolutions:
  • Verify your Azure Speech Key is correct in SettingsProvider Keys
  • Ensure the key is from your Speech resource (not another Azure service)
  • Check that the region matches your Speech resource’s region
Problem: Selected voice doesn’t workSolutions:
  • Verify the voice ID format (e.g., en-US-JennyNeural)
  • Check that the voice is available in your region
  • Ensure your subscription tier supports the selected voice
Problem: TTS response is slowSolutions:
  • Select an Azure region closer to your users
  • Burki uses streaming synthesis for optimal performance
  • Consider caching common phrases

Best Practices

See Also

🎯 Multilingual?

Cartesia Sonic 3 - 42 languages with voice cloning

⚡ Need Speed?

Deepgram Aura - Ultra-low ~75ms latency

🔗 Additional Resources


🚀 Ready to Use Azure Speech?

Head back to your assistant configuration and set up Azure Speech for enterprise-grade TTS!