Generates audio from the input text. Returns audio data or streams via SSE.
Model in provider/model format
Text to convert to speech
Set to "sse" to enable streaming
sse mp3, opus, aac, flac, wav, pcm 0.25 <= x <= 4Successful response
The response is of type file.