Generate speech
Synthesizes speech from a text string and returns the audio data as a binary stream.
Authorizations
Your API key; get it from your LMNT account page.
Body
The text to synthesize; max 5000 characters per request (including spaces)
The voice id of the voice to use for synthesis; voice ids can be retrieved by calls to List voices
or Voice info
Set this to true
to generate conversational-style speech rather than reading-style speech. Does not work with the blizzard
model.
The file format of the synthesized audio output
aac
, mp3
, mulaw
, raw
, wav
The desired language of the synthesized speech. Two letter ISO 639-1 code. Does not work with professional clones and the blizzard
model.
de
, en
, es
, fr
, pt
, zh
, ko
, hi
Produce speech of this length in seconds; maximum 300.0 (5 minutes). Does not work with the blizzard
model.
x < 300
The model to use for synthesis. One of aurora
(default) or blizzard
. Learn more about models here.
aurora
, blizzard
The desired output sample rate in Hz
8000
, 16000
, 24000
Seed used to specify a different take; defaults to random
The talking speed of the generated speech, a floating point value between 0.25
(slow) and 2.0
(fast).
0.25 < x < 2
Response
The response is of type file
.
Was this page helpful?