What audio quality is required for voice samples?

What is the maximum length for a single text-to-speech conversion?

Can LMNT mimic specific emotional tones or speaking styles?