Model capabilities

Voice cloning

Create voices to use with LMNT's models with 5-10 seconds of reference speech

Voice cloning is the way you create and save voice prompts to use with LMNT's models. LMNT does the hard work to ensure your prompts are ready to serve your traffic with low latency at scale.

Crafting a good voice prompt

Treat the reference speech like you'd treat a prompt to an LLM: clear, focused, and representative of the style of output you want.

See our voice prompting guide.

Creating a voice

If you're trying things out or only creating a handful of voices, it's easiest to use our Playground.

Otherwise, upload your reference speech prompt through the Voice API and you'll get back a voice object with an id you can use in any speech call.

import asyncio
import sys
 
from lmnt import AsyncLmnt
 
async def main():
  client = AsyncLmnt()
  with open(sys.argv[1], 'rb') as audio:
    voice = await client.voices.create(
      name='my-voice',
      file=audio,
    )
  print(f'Created voice: {voice.id}')
 
asyncio.run(main())

Next steps