Handle audio chunk by chunk
Use the real-time speech session API
Use an SDK
Use raw format
raw
format. It’s the fastest format we offer and returns 16-bit PCM (little-endian) audio at 24 kHz.Use async tasks
Keep an open connection
Use servers in the U.S.