Overview
60db’s Text-to-Speech (TTS) API converts written text into natural-sounding speech using advanced AI models. Our TTS engine supports multiple voices, languages, and customization options.Features
Multiple Voices
Choose from 50+ pre-built voices or create custom voices
Real-time Streaming
Stream audio in real-time for low-latency applications
Voice Customization
Adjust speed, pitch, and other parameters
High Quality
Crystal-clear audio with natural intonation
Basic Usage
- JavaScript
- Python
Voice Parameters
Speed
Control the speaking rate of the generated audio:0.5: Half speed (slow)1.0: Normal speed (default)2.0: Double speed (fast)
Pitch
Adjust the pitch of the voice:Enhancement
Enable audio enhancement for better quality:Output Formats
Supported audio formats:| Format | Quality | File Size | Use Case |
|---|---|---|---|
| MP3 | Good | Small | Web, mobile apps |
| WAV | Excellent | Large | Professional audio |
| OGG | Good | Small | Web streaming |
| FLAC | Lossless | Medium | High-quality archival |
Streaming Mode
For long-form content or real-time applications, use streaming:Benefits of Streaming
- Lower Latency: Start playing audio before generation completes
- Memory Efficient: Process chunks instead of entire file
- Better UX: Progressive audio playback
- Real-time Applications: Ideal for chatbots and voice assistants
Best Practices
Text Formatting
Text Formatting
- Use proper punctuation for natural pauses
- Break long texts into paragraphs
- Use SSML tags for advanced control (coming soon)
Voice Selection
Voice Selection
- Test multiple voices for your use case
- Consider accent and gender for your audience
- Use custom voices for brand consistency
Performance
Performance
- Cache frequently used audio
- Use streaming for content > 500 characters
- Batch requests when possible
Quality
Quality
- Enable enhancement for production use
- Use WAV format for highest quality
- Test with different speed settings