GET
/
v1
/
ai
/
speech
curl --request GET \
  --url https://api.lmnt.com/v1/ai/speech
"HTTP/1.1 200 OK\nContent-Type: audio/wav\nX-Sample-Rate: 24000\nX-Duration-Samples: 57000\n\n<binary data>"

For more detailed output such as the duration of each spoken word, use the Speech POST request.

Query Parameters

X-API-Key
string
required

Your API key; get it from your LMNT account page.

voice
string
required

The voice id of the voice to use for synthesis; voice ids can be retrieved by calls to List voices or Voice info.

text
string
required

The text to synthesize; max 5000 characters per request (including spaces).

language
enum<string>
default:auto

The desired language of the synthesized speech. Two letter ISO 639-1 code. Defaults to auto language detection.

Available options:
auto,
de,
en,
es,
fr,
hi,
id,
it,
ja,
ko,
nl,
pl,
pt,
ru,
sv,
th,
tr,
uk,
vi,
zh
model
enum<string>
default:blizzard

The model to use for synthesis. Learn more about models here.

Available options:
blizzard
format
enum<string>
default:mp3

The file format of the synthesized audio output.

Available options:
aac,
mp3,
mulaw,
raw,
wav
sample_rate
enum<number>
default:24000

The desired output sample rate in Hz. Defaults to 24000 for all formats except mulaw which defaults to 8000.

Available options:
8000,
16000,
24000
seed
integer

Seed used to specify a different take; defaults to random (see here for more details).

Response

200
application/octet-stream

OK

The response is of type file.