Skip to main content

Overview

Parasail is an OpenAI-compatible provider offering high-performance inference. Bifrost delegates to the OpenAI implementation with standard parameter handling. Key characteristics:
  • OpenAI API compatibility - Identical request/response format
  • Full streaming support - Server-Sent Events with usage tracking
  • Tool calling - Complete function definition and execution
  • Parameter filtering - Removes unsupported OpenAI-specific fields
  • Responses API - Fallback to Chat Completions

Supported Operations

OperationNon-StreamingStreamingEndpoint
Chat Completions/v1/chat/completions
Responses API/v1/chat/completions
List Models-/v1/models
Text Completions-
Embeddings-
Speech (TTS)-
Transcriptions (STT)-
Files-
Batch-
Unsupported Operations (❌): Text Completions, Embeddings, Speech, Transcriptions, Files, and Batch are not supported by the upstream Parasail API. These return UnsupportedOperationError.

1. Chat Completions

Request Parameters

Parasail supports all standard OpenAI chat completion parameters. For full parameter reference and behavior, see OpenAI Chat Completions.

Filtered Parameters

Removed for Parasail compatibility:
  • prompt_cache_key - Not supported
  • verbosity - Anthropic-specific
  • store - Not supported
  • service_tier - Not supported

Reasoning Parameter

Reasoning via standard OpenAI format:
{
  "model": "parasail-llama-33-70b-fp8",
  "messages": [...],
  "reasoning_effort": "high"
}
Parasail supports all standard OpenAI message types, tools, responses, and streaming formats. For details on message handling, tool conversion, responses, and streaming, refer to OpenAI Chat Completions.

2. Responses API

Converted internally to Chat Completions:
ResponsesRequest → ChatRequest → ChatCompletion → ResponsesResponse
Same parameter support as Chat Completions.

3. List Models

Lists available Parasail models with capabilities and context information.

Unsupported Features

FeatureReason
Text CompletionsNot offered by Parasail API
EmbeddingNot offered by Parasail API
Speech/TTSNot offered by Parasail API
Transcription/STTNot offered by Parasail API
Batch OperationsNot offered by Parasail API
File ManagementNot offered by Parasail API

Caveats

Severity: Medium Behavior: Cache control directives are removed from messages Impact: Prompt caching features don’t work Code: Stripped during JSON marshaling
Severity: Low Behavior: OpenAI-specific parameters filtered out Impact: prompt_cache_key, verbosity, store removed Code: filterOpenAISpecificParameters
Severity: Low Behavior: User field > 64 characters silently dropped Impact: Longer user identifiers are lost Code: SanitizeUserField enforces 64-char max