Documentation Index
Fetch the complete documentation index at: https://docs.getbifrost.ai/llms.txt
Use this file to discover all available pages before exploring further.
Streaming Text Completion
Request text completions with streaming enabled to receive partialtext chunks as they are generated.
Streaming Chat Responses
Receive AI responses in real-time as they’re generated. Perfect for chat applications where you want to show responses as they’re being typed, improving user experience.Note: Streaming requests also follow the default timeout setting defined in provider configuration, which defaults to 30 seconds.
Bifrost standardizes all stream responses to send usage and finish reason only in the last chunk, and content in the previous chunks.
Responses API Streaming
Stream the OpenAI-style Responses API with event-based SSE. This includesevent: lines and does not use the [DONE] marker; the stream ends when the connection closes.
Text-to-Speech Streaming: Real-time Audio Generation
Stream audio generation in real-time as text is converted to speech. Ideal for long texts or when you need immediate audio playback.> audio_stream.txt to redirect output to a file.
Speech-to-Text Streaming: Real-time Audio Transcription
Stream audio transcription results as they’re processed. Get immediate text output for real-time applications or long audio files.--form 'language="en"' or --form 'prompt="context hint"' for better accuracy.
Audio Format Support
Speech Synthesis: Supports"response_format": "mp3" (default) and "response_format": "wav"
Transcription Input: Accepts MP3, WAV, M4A, and other common audio formats
Note: Streaming capabilities vary by provider and model. Check each provider’s documentation for specific streaming support and limitations.
Next Steps
Now that you understand streaming responses, explore these related topics:Essential Topics
- Tool Calling - Enable AI models to use external tools and functions
- Multimodal AI - Process images, audio, and multimedia content
- Provider Configuration - Multiple providers for redundancy
- Integrations - Drop-in compatibility with existing SDKs
Advanced Topics
- Core Features - Advanced Bifrost capabilities
- Architecture - How Bifrost works internally
- Deployment - Production setup and scaling

