For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DiscordPlatform
DocumentationIntegrationsAPI referenceSDKsChangelog
DocumentationIntegrationsAPI referenceSDKsChangelog
    • Overview
  • Tracing
  • Gateway
      • OpenAI
      • Anthropic
      • Google Gemini
      • xAI
      • AWS Bedrock
      • Google Vertex AI
      • Azure OpenAI
      • Mistral
      • DeepSeek
      • Groq
      • Cohere
      • Perplexity
      • Together AI
      • Fireworks AI
      • Cerebras
      • OpenRouter
      • Replicate
      • Baseten
      • Moonshot
      • Nebius
      • Novita AI
      • FriendliAI
      • Parasail
      • Tera
      • Inference.net
      • AI21 Labs
      • Reducto
      • Wafer
      • Nextbit
      • Azure DeepSeek
      • Google PaLM
      • Custom Provider
  • Others
  • Migrating
    • Braintrust
    • Langfuse
    • Portkey
LogoLogo
DiscordPlatform
On this page
  • Quick setup
  • Send your first request
  • More integrations
  • Switch models
  • Use your own Tera key (BYOK)
  • Override credentials per model (Optional)
  • Log without proxying (Optional)
GatewayModel Providers

Provider: Tera

Call Tera models through Respan Gateway with unified logs, cost, and latency.
Was this page helpful?
Previous

Provider: Inference.net

Call Inference.net models through Respan Gateway with unified logs, cost, and latency.
Next
Built with
This page is for Respan LLM Gateway users.

Use Respan Gateway to call Tera models (tera/gpt-oss-20b) while keeping unified observability (logs, cost, latency, reliability) in Respan.

Quick setup

1

Get a Respan API key

Sign up and create a key on the API keys page.

2

Add credits (recommended)

Top up credits to pay through Respan. No Tera key required, Respan handles provider auth and billing.

Prefer to route through your own Tera account? See Use your own Tera key.

Send your first request

Pick the integration that matches your stack. The base URL is https://api.respan.ai/api and the only key needed is your RESPAN_API_KEY.

OpenAI SDK
Respan API

Tera is OpenAI-compatible. Point the OpenAI SDK at the Respan gateway and call any Tera model with the tera/ prefix.

1from openai import OpenAI
2
3client = OpenAI(
4 api_key="YOUR_RESPAN_API_KEY",
5 base_url="https://api.respan.ai/api",
6)
7
8response = client.chat.completions.create(
9 model="tera/gpt-oss-20b",
10 messages=[{"role": "user", "content": "Hello, Tera!"}],
11)
12print(response.choices[0].message.content)

More integrations

Tera models work with every Respan gateway integration:

  • OpenAI SDK
  • OpenAI Agents SDK
  • LangChain
  • LangGraph
  • LlamaIndex
  • Pydantic AI
  • Respan native (OTel)

Switch models

Change the model parameter to call any supported model through the same client. Use the tera/ prefix to disambiguate when routing across providers. Browse the full list on the Models page.

1client.chat.completions.create(model="tera/gpt-oss-20b", messages=messages)
2client.chat.completions.create(model="openai/gpt-5.5", messages=messages)
3client.chat.completions.create(model="anthropic/claude-sonnet-4-5", messages=messages)

Use your own Tera key (BYOK)

Credits are the default path. If you’d rather bill Tera directly, attach your own provider key.

Global (UI)
Per-request (Code)
1

Open Providers

Go to the Providers page.

2

Add Tera

Select Tera and paste your tera.api_key.

3

Load balancing (Optional)

Add multiple credential sets and use Load balancing weight to distribute traffic across them.

Override credentials per model (Optional)

Use credential_override when one model on a request should use a different Tera key than the default.

1{
2 "customer_credentials": {
3 "tera": { "api_key": "YOUR_TERA_API_KEY" }
4 },
5 "credential_override": {
6 "tera/gpt-oss-20b": { "api_key": "ANOTHER_TERA_API_KEY" }
7 }
8}

Log without proxying (Optional)

Already calling Tera directly? Send logs to Respan asynchronously to track cost, latency, and performance for those external calls.

1import requests
2
3requests.post(
4 "https://api.respan.ai/api/request-logs/create/",
5 headers={
6 "Authorization": "Bearer YOUR_RESPAN_API_KEY",
7 "Content-Type": "application/json",
8 },
9 json={
10 "model": "tera/gpt-oss-20b",
11 "prompt_messages": [{"role": "user", "content": "Hello, how are you?"}],
12 "completion_message": {"role": "assistant", "content": "Hello from Tera through Respan."},
13 "cost": 0.001,
14 "generation_time": 1.2,
15 "customer_params": {"customer_identifier": "user_123"},
16 },
17)

See the logging guide for the full setup.