Create response (OpenAI Responses API)

curl --request POST \
  --url http://localhost:8080/openai/v1/responses \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gpt-4",
  "input": "<string>",
  "stream": true,
  "instructions": "<string>",
  "max_output_tokens": 123,
  "metadata": {},
  "parallel_tool_calls": true,
  "previous_response_id": "<string>",
  "reasoning": {
    "effort": "none",
    "generate_summary": "auto",
    "summary": "auto",
    "max_tokens": 123
  },
  "store": true,
  "temperature": 1,
  "text": {
    "format": {
      "type": "text",
      "json_schema": {
        "name": "<string>",
        "schema": {},
        "strict": true
      }
    }
  },
  "tool_choice": "none",
  "tools": [
    {
      "type": "function",
      "name": "<string>",
      "description": "<string>",
      "cache_control": {
        "type": "ephemeral",
        "ttl": "<string>"
      },
      "parameters": {
        "type": "<string>",
        "description": "<string>",
        "required": [
          "<string>"
        ],
        "properties": {},
        "enum": [
          "<string>"
        ],
        "additionalProperties": true
      },
      "strict": true,
      "vector_store_ids": [
        "<string>"
      ],
      "filters": {},
      "max_num_results": 123,
      "ranking_options": {},
      "display_height": 123,
      "display_width": 123,
      "environment": "<string>",
      "enable_zoom": true,
      "search_context_size": "<string>",
      "user_location": {},
      "server_label": "<string>",
      "server_url": "<string>",
      "allowed_tools": {},
      "authorization": "<string>",
      "connector_id": "<string>",
      "headers": {},
      "require_approval": {},
      "server_description": "<string>",
      "container": {},
      "background": "<string>",
      "input_fidelity": "<string>",
      "input_image_mask": {},
      "moderation": "<string>",
      "output_compression": 123,
      "output_format": "<string>",
      "partial_images": 123,
      "quality": "<string>",
      "size": "<string>",
      "format": {}
    }
  ],
  "top_p": 123,
  "truncation": "auto",
  "user": "<string>",
  "fallbacks": [
    "<string>"
  ]
}
'

{
  "id": "<string>",
  "background": true,
  "conversation": {},
  "created_at": 123,
  "error": {
    "code": "<string>",
    "message": "<string>"
  },
  "include": [
    "<string>"
  ],
  "incomplete_details": {
    "reason": "<string>"
  },
  "instructions": {},
  "max_output_tokens": 123,
  "max_tool_calls": 123,
  "metadata": {},
  "model": "<string>",
  "output": [
    {
      "id": "<string>",
      "type": "message",
      "status": "in_progress",
      "role": "assistant",
      "content": "<string>",
      "call_id": "<string>",
      "name": "<string>",
      "arguments": "<string>",
      "output": {},
      "action": {},
      "error": "<string>",
      "queries": [
        "<string>"
      ],
      "results": [
        {}
      ],
      "summary": [
        {
          "type": "summary_text",
          "text": "<string>"
        }
      ],
      "encrypted_content": "<string>"
    }
  ],
  "parallel_tool_calls": true,
  "previous_response_id": "<string>",
  "prompt": {},
  "prompt_cache_key": "<string>",
  "reasoning": {
    "effort": "none",
    "generate_summary": "<string>",
    "summary": "auto",
    "max_tokens": 123
  },
  "safety_identifier": "<string>",
  "service_tier": "<string>",
  "status": "completed",
  "stop_reason": "<string>",
  "store": true,
  "temperature": 123,
  "text": {
    "format": {
      "type": "text",
      "name": "<string>",
      "schema": {},
      "strict": true
    },
    "verbosity": "low"
  },
  "top_logprobs": 123,
  "top_p": 123,
  "tool_choice": "none",
  "tools": [
    {
      "type": "function",
      "name": "<string>",
      "description": "<string>",
      "cache_control": {
        "type": "ephemeral",
        "ttl": "<string>"
      },
      "parameters": {
        "type": "<string>",
        "description": "<string>",
        "required": [
          "<string>"
        ],
        "properties": {},
        "enum": [
          "<string>"
        ],
        "additionalProperties": true
      },
      "strict": true,
      "vector_store_ids": [
        "<string>"
      ],
      "filters": {},
      "max_num_results": 123,
      "ranking_options": {},
      "display_height": 123,
      "display_width": 123,
      "environment": "<string>",
      "enable_zoom": true,
      "search_context_size": "<string>",
      "user_location": {},
      "server_label": "<string>",
      "server_url": "<string>",
      "allowed_tools": {},
      "authorization": "<string>",
      "connector_id": "<string>",
      "headers": {},
      "require_approval": {},
      "server_description": "<string>",
      "container": {},
      "background": "<string>",
      "input_fidelity": "<string>",
      "input_image_mask": {},
      "moderation": "<string>",
      "output_compression": 123,
      "output_format": "<string>",
      "partial_images": 123,
      "quality": "<string>",
      "size": "<string>",
      "format": {}
    }
  ],
  "truncation": "<string>",
  "usage": {
    "input_tokens": 123,
    "input_tokens_details": {
      "text_tokens": 123,
      "audio_tokens": 123,
      "image_tokens": 123,
      "cached_tokens": 123
    },
    "output_tokens": 123,
    "output_tokens_details": {
      "text_tokens": 123,
      "accepted_prediction_tokens": 123,
      "audio_tokens": 123,
      "reasoning_tokens": 123,
      "rejected_prediction_tokens": 123,
      "citation_tokens": 123,
      "num_search_queries": 123,
      "cached_tokens": 123
    },
    "total_tokens": 123,
    "cost": {
      "input_tokens_cost": 123,
      "output_tokens_cost": 123,
      "request_cost": 123,
      "total_cost": 123
    }
  },
  "extra_fields": {
    "request_type": "<string>",
    "provider": "openai",
    "model_requested": "<string>",
    "model_deployment": "<string>",
    "latency": 123,
    "chunk_index": 123,
    "raw_request": {},
    "raw_response": {},
    "cache_debug": {
      "cache_hit": true,
      "cache_id": "<string>",
      "hit_type": "<string>",
      "provider_used": "<string>",
      "model_used": "<string>",
      "input_tokens": 123,
      "threshold": 123,
      "similarity": 123
    }
  },
  "search_results": [
    {
      "title": "<string>",
      "url": "<string>",
      "date": "<string>",
      "last_updated": "<string>",
      "snippet": "<string>",
      "source": "<string>"
    }
  ],
  "videos": [
    {
      "url": "<string>",
      "thumbnail_url": "<string>",
      "thumbnail_width": 123,
      "thumbnail_height": 123,
      "duration": 123
    }
  ],
  "citations": [
    "<string>"
  ]
}

POST

openai

responses

Create response (OpenAI Responses API)

curl --request POST \
  --url http://localhost:8080/openai/v1/responses \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gpt-4",
  "input": "<string>",
  "stream": true,
  "instructions": "<string>",
  "max_output_tokens": 123,
  "metadata": {},
  "parallel_tool_calls": true,
  "previous_response_id": "<string>",
  "reasoning": {
    "effort": "none",
    "generate_summary": "auto",
    "summary": "auto",
    "max_tokens": 123
  },
  "store": true,
  "temperature": 1,
  "text": {
    "format": {
      "type": "text",
      "json_schema": {
        "name": "<string>",
        "schema": {},
        "strict": true
      }
    }
  },
  "tool_choice": "none",
  "tools": [
    {
      "type": "function",
      "name": "<string>",
      "description": "<string>",
      "cache_control": {
        "type": "ephemeral",
        "ttl": "<string>"
      },
      "parameters": {
        "type": "<string>",
        "description": "<string>",
        "required": [
          "<string>"
        ],
        "properties": {},
        "enum": [
          "<string>"
        ],
        "additionalProperties": true
      },
      "strict": true,
      "vector_store_ids": [
        "<string>"
      ],
      "filters": {},
      "max_num_results": 123,
      "ranking_options": {},
      "display_height": 123,
      "display_width": 123,
      "environment": "<string>",
      "enable_zoom": true,
      "search_context_size": "<string>",
      "user_location": {},
      "server_label": "<string>",
      "server_url": "<string>",
      "allowed_tools": {},
      "authorization": "<string>",
      "connector_id": "<string>",
      "headers": {},
      "require_approval": {},
      "server_description": "<string>",
      "container": {},
      "background": "<string>",
      "input_fidelity": "<string>",
      "input_image_mask": {},
      "moderation": "<string>",
      "output_compression": 123,
      "output_format": "<string>",
      "partial_images": 123,
      "quality": "<string>",
      "size": "<string>",
      "format": {}
    }
  ],
  "top_p": 123,
  "truncation": "auto",
  "user": "<string>",
  "fallbacks": [
    "<string>"
  ]
}
'

{
  "id": "<string>",
  "background": true,
  "conversation": {},
  "created_at": 123,
  "error": {
    "code": "<string>",
    "message": "<string>"
  },
  "include": [
    "<string>"
  ],
  "incomplete_details": {
    "reason": "<string>"
  },
  "instructions": {},
  "max_output_tokens": 123,
  "max_tool_calls": 123,
  "metadata": {},
  "model": "<string>",
  "output": [
    {
      "id": "<string>",
      "type": "message",
      "status": "in_progress",
      "role": "assistant",
      "content": "<string>",
      "call_id": "<string>",
      "name": "<string>",
      "arguments": "<string>",
      "output": {},
      "action": {},
      "error": "<string>",
      "queries": [
        "<string>"
      ],
      "results": [
        {}
      ],
      "summary": [
        {
          "type": "summary_text",
          "text": "<string>"
        }
      ],
      "encrypted_content": "<string>"
    }
  ],
  "parallel_tool_calls": true,
  "previous_response_id": "<string>",
  "prompt": {},
  "prompt_cache_key": "<string>",
  "reasoning": {
    "effort": "none",
    "generate_summary": "<string>",
    "summary": "auto",
    "max_tokens": 123
  },
  "safety_identifier": "<string>",
  "service_tier": "<string>",
  "status": "completed",
  "stop_reason": "<string>",
  "store": true,
  "temperature": 123,
  "text": {
    "format": {
      "type": "text",
      "name": "<string>",
      "schema": {},
      "strict": true
    },
    "verbosity": "low"
  },
  "top_logprobs": 123,
  "top_p": 123,
  "tool_choice": "none",
  "tools": [
    {
      "type": "function",
      "name": "<string>",
      "description": "<string>",
      "cache_control": {
        "type": "ephemeral",
        "ttl": "<string>"
      },
      "parameters": {
        "type": "<string>",
        "description": "<string>",
        "required": [
          "<string>"
        ],
        "properties": {},
        "enum": [
          "<string>"
        ],
        "additionalProperties": true
      },
      "strict": true,
      "vector_store_ids": [
        "<string>"
      ],
      "filters": {},
      "max_num_results": 123,
      "ranking_options": {},
      "display_height": 123,
      "display_width": 123,
      "environment": "<string>",
      "enable_zoom": true,
      "search_context_size": "<string>",
      "user_location": {},
      "server_label": "<string>",
      "server_url": "<string>",
      "allowed_tools": {},
      "authorization": "<string>",
      "connector_id": "<string>",
      "headers": {},
      "require_approval": {},
      "server_description": "<string>",
      "container": {},
      "background": "<string>",
      "input_fidelity": "<string>",
      "input_image_mask": {},
      "moderation": "<string>",
      "output_compression": 123,
      "output_format": "<string>",
      "partial_images": 123,
      "quality": "<string>",
      "size": "<string>",
      "format": {}
    }
  ],
  "truncation": "<string>",
  "usage": {
    "input_tokens": 123,
    "input_tokens_details": {
      "text_tokens": 123,
      "audio_tokens": 123,
      "image_tokens": 123,
      "cached_tokens": 123
    },
    "output_tokens": 123,
    "output_tokens_details": {
      "text_tokens": 123,
      "accepted_prediction_tokens": 123,
      "audio_tokens": 123,
      "reasoning_tokens": 123,
      "rejected_prediction_tokens": 123,
      "citation_tokens": 123,
      "num_search_queries": 123,
      "cached_tokens": 123
    },
    "total_tokens": 123,
    "cost": {
      "input_tokens_cost": 123,
      "output_tokens_cost": 123,
      "request_cost": 123,
      "total_cost": 123
    }
  },
  "extra_fields": {
    "request_type": "<string>",
    "provider": "openai",
    "model_requested": "<string>",
    "model_deployment": "<string>",
    "latency": 123,
    "chunk_index": 123,
    "raw_request": {},
    "raw_response": {},
    "cache_debug": {
      "cache_hit": true,
      "cache_id": "<string>",
      "hit_type": "<string>",
      "provider_used": "<string>",
      "model_used": "<string>",
      "input_tokens": 123,
      "threshold": 123,
      "similarity": 123
    }
  },
  "search_results": [
    {
      "title": "<string>",
      "url": "<string>",
      "date": "<string>",
      "last_updated": "<string>",
      "snippet": "<string>",
      "source": "<string>"
    }
  ],
  "videos": [
    {
      "url": "<string>",
      "thumbnail_url": "<string>",
      "thumbnail_width": 123,
      "thumbnail_height": 123,
      "duration": 123
    }
  ],
  "citations": [
    "<string>"
  ]
}

Body

application/json

model

string

required

Model identifier

Example:

"gpt-4"

input

required

Input - can be a string or array of messages

stream

boolean

instructions

string

System instructions for the model

max_output_tokens

integer

metadata

object

parallel_tool_calls

boolean

previous_response_id

string

reasoning

object

Show child attributes

store

boolean

temperature

number

Required range: 0 <= x <= 2

text

object

Show child attributes

tool_choice

Available options:

none,

auto,

required

tools

object[]

Show child attributes

top_p

number

truncation

enum<string>

Available options:

auto,

disabled

user

string

fallbacks

string[]

Response

Successful response

string

background

boolean

conversation

object

created_at

integer

error

object

Show child attributes

include

string[]

incomplete_details

object

Show child attributes

instructions

object

max_output_tokens

integer

max_tool_calls

integer

metadata

object

model

string

output

object[]

Show child attributes

parallel_tool_calls

boolean

previous_response_id

string

prompt

object

prompt_cache_key

string

reasoning

object

Show child attributes

safety_identifier

string

service_tier

string

status

enum<string>

Available options:

completed,

failed,

in_progress,

canceled,

queued,

incomplete

stop_reason

string

store

boolean

temperature

number

text

object

Show child attributes

top_logprobs

integer

top_p

number

tool_choice

Available options:

none,

auto,

required

tools

object[]

Show child attributes

truncation

string

usage

object

Show child attributes

extra_fields

object

Additional fields included in responses

Show child attributes

search_results

object[]

Show child attributes

videos

object[]

Show child attributes

citations

string[]

Create text completion (Azure OpenAI)Create response (Azure OpenAI)

API Reference

Body

Response