🎯 Resemble AI: Custom Brand Voices
Create unlimited custom voices with your brand’s unique sound. WebSocket streaming ensures real-time responses for professional applications.
Quick Setup
Get API Credentials
- Visit Resemble AI and create an account
- Navigate to Settings → API Keys
- Generate an API key with TTS permissions
- Copy your API Key and Project UUID
Create Custom Voice
- Go to Voices in your Resemble dashboard
- Click Create Voice and upload voice samples
- Wait for training completion (~30 minutes)
- Copy the generated Voice UUID
Business Plan Required: WebSocket streaming (required for real-time TTS) is only available on Business plans ($99+/month).
Voice Creation Process
🎙️ Build Your Brand Voice
Resemble AI specializes in creating custom voices that match your brand personality and requirements.
Voice Training Steps
- Voice Samples
- Training Process
- Voice Quality Tips
Upload Requirements:
- Duration: 3-10 minutes of clean audio
- Format: WAV or MP3, 22kHz+ sample rate
- Content: Read diverse sentences for best results
- Quality: Clear speech, minimal background noise
Available Models
🔧 Synthesis Models
Resemble AI focuses on custom voice synthesis rather than multiple models.
Default Synthesis Model
~300ms latencyHigh-quality neural synthesis optimized for custom voicesFeatures:
- Custom voice support
- WebSocket streaming
- Phone-compatible formats
- Twilio integration ready
Model Focus: Unlike other providers, Resemble specializes in voice quality and customization rather than offering multiple model options.
WebSocket Streaming
⚡ Real-Time Streaming
WebSocket streaming enables real-time TTS for live applications like phone calls and interactive experiences.
Streaming Setup
Audio Format Configuration
- Phone Calls (Recommended)
- High Quality
- Balanced
Custom Voice Management
🎛️ Voice Library Management
Organize and manage your custom voices for different use cases and brand requirements.
Voice Categories
Brand Representative Voices
Brand Representative Voices
Use Case: Customer service, sales, brand communicationCharacteristics:
- Professional and approachable tone
- Consistent with brand personality
- Clear pronunciation and pacing
- Suitable for extended conversations
- Use your actual customer service representatives
- Record in professional setting
- Include common business phrases and terminology
- Test with actual customer scripts
Character Voices
Character Voices
Use Case: Gaming, entertainment, interactive mediaCharacteristics:
- Distinctive personality traits
- Appropriate for character backstory
- Emotionally expressive range
- Memorable and engaging
- Work with voice actors who understand the character
- Include emotional range in training samples
- Record character-appropriate content
- Test with actual dialogue scripts
Narrator Voices
Narrator Voices
Use Case: E-learning, audiobooks, documentationCharacteristics:
- Clear and educational tone
- Good pacing for comprehension
- Neutral but engaging delivery
- Suitable for long-form content
- Use experienced narrators or educators
- Include varied sentence structures
- Practice with actual educational content
- Focus on clarity and comprehension
Voice UUID Management
Integration Examples
- Customer Service Bot
- Brand Spokesperson
- Multi-Voice Application
Pricing Structure
💰 Custom Voice Pricing
Resemble AI pricing is based on usage and plan features. WebSocket streaming requires Business plans or higher.
| Plan | Monthly Cost | Characters Included | WebSocket Streaming | Custom Voices |
|---|---|---|---|---|
| Basic | $29 | 200,000 | ❌ | 3 voices |
| Pro | $89 | 800,000 | ❌ | 10 voices |
| Business | $199 | 2,000,000 | ✅ | 25 voices |
| Enterprise | Custom | Custom | ✅ | Unlimited |
WebSocket Requirement: Real-time TTS for phone calls requires Business plan ($199/month) or higher due to WebSocket streaming dependency.
Cost Optimization Tips
Efficient Voice Usage
Efficient Voice Usage
- Voice Reuse: Create versatile voices that work across multiple use cases
- Batch Processing: Use REST API for non-real-time applications to save costs
- Smart Caching: Cache frequently used phrases to reduce API calls
- Context-Aware Selection: Use different voices only when necessary for user experience
Quality Assurance
🎯 Voice Quality Testing
Ensure your custom voices meet production standards with systematic testing approaches.
Testing Framework
Common Quality Issues
Pronunciation Problems
Pronunciation Problems
Issue: Custom voice mispronounces specific wordsSolutions:
- Include problematic words in training data
- Use phonetic spelling in TTS requests
- Create pronunciation guide for domain-specific terms
- Retrain voice with additional samples if needed
Emotional Range Limitations
Emotional Range Limitations
Issue: Voice sounds monotone or lacks expressionSolutions:
- Include more emotional range in training samples
- Use varied sentence types during training
- Consider retraining with more expressive speaker
- Test with TTS-specific emotional markup if available
Troubleshooting
WebSocket Connection Issues
WebSocket Connection Issues
Problem: Cannot establish WebSocket connectionSolutions:
- Verify Business plan subscription
- Check API key permissions for streaming
- Confirm project UUID is correct
- Test connection with WebSocket debugging tools
- Check firewall settings for WebSocket traffic
Voice UUID Not Found
Voice UUID Not Found
Problem: Custom voice UUID returns errorSolutions:
- Verify voice training is completed
- Check voice UUID spelling in configuration
- Confirm voice is associated with correct project
- Contact support if voice disappeared after training
Audio Quality Issues
Audio Quality Issues
Problem: Generated audio has artifacts or poor qualitySolutions:
- Adjust audio format settings (sample rate, precision)
- Test with different output formats
- Check if voice training data was high quality
- Consider retraining voice with better samples
- Verify network stability for streaming
Migration Guide
- From Standard TTS Providers
- Voice Replacement Strategy
Migration Benefits:
- Custom brand voice consistency
- WebSocket streaming for real-time apps
- Unlimited voice creation potential
- Professional voice quality control
- Voice Planning: Decide what custom voices you need
- Training Data: Collect high-quality voice samples
- Voice Creation: Train your custom voices
- Testing: Validate voice quality and performance
- Integration: Update API calls to use custom voice UUIDs
- Monitoring: Implement quality monitoring
🎯 Ready to Create Your Brand Voice?
Set up Resemble AI in your assistant configuration and start building custom voices that represent your brand perfectly!