Handle audio chunk by chunk
Use the real-time speech session API
Use an SDK
Use pcm_s16le or pcm_f32le
pcm_s16le
or pcm_f32le
format. It’s the fastest format we offer and returns 16-bit or 32-bit raw audio.Use async tasks
Keep an open connection
Use servers in the U.S.