Model capabilities

Voice cloning

Create voices to use with LMNT's models with 5-10 seconds of reference speech

Voice cloning is the way you create and save voice prompts to use with LMNT's models. LMNT does the hard work to ensure your prompts are ready to serve your traffic with low latency at scale.

Crafting a good voice prompt

Treat the reference speech like you'd treat a prompt to an LLM: clear, focused, and representative of the style of output you want.

See our voice prompting guide.

Creating a voice

If you're trying things out or only creating a handful of voices, it's easiest to use our Playground.

Otherwise, upload your reference speech prompt through the Voice API and you'll get back a voice object with an id you can use in any speech call.

from lmnt import Lmnt
 
client = Lmnt()
 
with open('reference.mp3', 'rb') as audio:
    voice = client.voices.create(
        name='my-voice',
        enhance=False,
        files=[audio],
    )
 
print(f'Created voice: {voice.id}')

Next steps