Text-to-Speech
Text to Speech
Convert text to natural-sounding speech
POST
Request
Headers
Bearer token with your API key
application/json
Body
The text to convert to speech (max 5000 characters)
ID of the voice to use (default: system default voice)
Alternative way to specify voice name
Enable audio enhancement for better quality
Speech speed multiplier (0.5 to 2.0)
Voice stability 0-100 (lower = more expressive, higher = more consistent)
Voice similarity 0-100 (how closely the output matches the source voice)
Audio output format: mp3, wav, ogg, flac
Response
Indicates if the request was successful
Status message
Base64-encoded audio data
Audio sample rate in Hz
Duration of the audio in seconds
Audio encoding format
Audio output format