Create response | Respan Docs

Send a response request through the Respan gateway using the OpenAI Responses API format. Supports streaming, tool use, and prompt management.

Respan parameters can be passed the same way as Create chat completion: top-level fields, nested under respan_params, or via X-Respan-Params header.

Request

This endpoint expects an object.

modelstringRequired

Model to use.

inputstring or list of objectsRequired

Input text or array of conversation messages.

instructionsstringOptional

System instructions for the model.

streambooleanOptional

Stream the response as server-sent events.

temperaturedoubleOptional

Sampling temperature (0-2).

max_output_tokensintegerOptional

Maximum tokens to generate.

top_pdoubleOptional

Nucleus sampling parameter.

toolslist of objectsOptional

Tools the model may call.

previous_response_idstringOptional

ID of a previous response for multi-turn conversations.

fallback_modelslist of stringsOptional

Backup models if the primary model fails.

customer_credentialsobjectOptional

Per-customer LLM provider credentials.

credential_overrideobjectOptional

One-off credential overrides per provider.

cache_enabledbooleanOptional

Enable response caching.

cache_ttlintegerOptional

Cache TTL in seconds.

promptobjectOptional

Prompt template config. Properties: prompt_id (required), variables, version, echo. See Prompt management.

retry_paramsobjectOptional

Retry config. Properties: retry_enabled (boolean), num_retries, retry_after (seconds).

disable_logbooleanOptional

When true, omits input/output from the log. Metrics still recorded.

modelslist of objectsOptional

Load balancing model list.

exclude_providerslist of stringsOptional

Providers to exclude from routing.

exclude_modelslist of stringsOptional

Models to exclude from routing.

metadataobjectOptional

Custom key-value metadata attached to the span.

custom_identifierstringOptional

Indexed custom tag for fast querying.

customer_identifierstringOptional

End user identifier for analytics and budgets.

customer_paramsobjectOptional

Extended customer info.

thread_identifierstringOptional

Conversation thread ID.

positive_feedbackbooleanOptional

User feedback. true = liked, false = disliked.

propertiesobjectOptional

Typed metadata preserving native types.

respan_paramsobjectOptional

Namespaced container for all Respan parameters.

Response

Model response

Errors

400

Bad Request Error

401

Unauthorized Error

1	curl -X POST https://api.respan.ai/api/responses \
2	-H "Authorization: Bearer sk_live_xxxxx" \
3	-H "Content-Type: application/json" \
4	-d '{
5	"model": "gpt-4o",
6	"input": "string"
7	}'

1	{
2	"id": "resp_abc123",
3	"object": "response",
4	"created_at": 1709155200,
5	"model": "gpt-4o-mini",
6	"output": [
7	{
8	"type": "message",
9	"role": "assistant",
10	"content": [
11	{
12	"type": "output_text",
13	"text": "Why do programmers prefer dark mode? Because light attracts bugs!"
14	}
15	]
16	}
17	],
18	"usage": {
19	"input_tokens": 12,
20	"output_tokens": 18,
21	"total_tokens": 30
22	}
23	}

Headers

Request

Response

Errors