Submits a request to create a voice with a supplied voice configuration and a batch of input audio data.
Parameters
For unclean audio with background noise, applies processing to attempt to improve quality. Default is false as this can also degrade quality in some circumstances.
One or more input audio files to train the voice in the form of binary wav, mp3, mp4, m4a, or webm attachments.
The display name for this voice
A text description of this voice.
A tag describing the gender of this voice. Has no effect on voice creation.
Returns
Voice where each Voice is:
A text description of this voice.
A tag describing the gender of this voice, e.g. male, female, nonbinary.
The unique identifier of this voice.
The display name of this voice.
The owner of this voice.
Whether this voice has been starred by you or not.
The state of this voice in the training pipeline (e.g., ready, training).
The method by which this voice was created: instant or professional.
A URL that returns a preview speech sample of this voice. The file can be played directly in a browser or audio player.