Building with LMNT

Features overview

Explore LMNT's advanced features and capabilities.

LMNT's capabilities are organized into two main areas:

  • Model capabilities: Control how LMNT generates speech that matches the feeling your want.
  • Streaming & realtime: Realtime serving for latency sensitive use cases like voice agents.

If you're new, familiarize yourself with model capabilities first.

Model capabilities

Ways to steer the model to generate speech that matches the feeling you're looking for.

FeatureDescription
Voice cloningCreate custom voices from 5–10 seconds of reference speech.
AccentsSteer the accent of generated speech.
LanguagesGenerate speech in 20 languages with native code-switching.
Word timestampsGet precise per-word timing to sync subtitles, lip movement, and other modalities.

Streaming & realtime

Ways to meet latency deadlines for your specific use cases.

FeatureDescription
Speech APIStreams generated speech to you. Full text must be known ahead of time.
Speech Sessions APIStream in text from an LLM, and LMNT streams speech back to you. Great for voice agents.