Path Parameters
Azure deployment ID
Body
Model identifier in 'provider/model' format (e.g., 'anthropic/claude-2.1')
"anthropic/claude-2.1"
Text prompt for completion
"The benefits of artificial intelligence include"
If set, partial message deltas will be sent, like in ChatGPT. Tokens will be sent as data-only server-sent events as they become available.
Generates best_of completions server-side and returns the 'best' one. See n for comparison.
Echo back the prompt in addition to the completion.
Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim.
-2 <= x <= 2Modify the likelihood of specified tokens appearing in the completion. Accepts a JSON object that maps tokens to an associated bias value from -100 to 100.
Include the log probabilities on the logprobs most likely tokens, as well the chosen tokens.
Maximum number of tokens to generate
x >= 11000
How many completions to generate for each prompt.
Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics.
-2 <= x <= 2This feature is in Beta. If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result. Determinism is not guaranteed, and you should refer to the system_fingerprint response parameter to monitor changes in the backend.
Up to 4 sequences where the API will stop generating further tokens.
The suffix that comes after a completion of inserted text.
Controls randomness in the output. Higher values make the output more random, while lower values make it more deterministic.
0 <= x <= 2Controls diversity via nucleus sampling. 0.5 means half of all likelihood-weighted options are considered.
0 <= x <= 1A unique identifier representing your end-user, which can help OpenAI to monitor and detect abuse.
Fallback model names in 'provider/model' format
[
"anthropic/claude-3-sonnet-20240229",
"openai/gpt-4o"
]Response
Azure OpenAI-compatible text completion response
Unique response identifier
"chatcmpl-123"
Response type
text.completion, chat.completion, embedding, speech, transcribe, responses.completion "chat.completion"
Array of completion choices for chat and text completions. Not present for responses type.
Array of embedding objects
Array of messages for responses type.
The conversation ID.
The reason the model stopped generating tokens.
The reason the model stopped generating tokens.
The stop sequence that was generated.
Model used for generation
"gpt-4o"
Unix timestamp of creation
1677652288
Service tier used
System fingerprint

