For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DiscordPlatform
DocumentationIntegrationsAPI referenceSDKsChangelog
DocumentationIntegrationsAPI referenceSDKsChangelog
    • Overview
  • Tracing
  • Gateway
      • OpenAI Agents
      • Claude Agent SDK
      • Vercel AI SDK
      • Pydantic AI
      • CrewAI
      • Haystack
      • LangChain
      • LangGraph
      • Langflow
      • LlamaIndex
      • AutoGen
      • DSPy
      • Google ADK
      • Smolagents
      • Strands Agents
      • AgentSpec
      • Guardrails
      • Agno
      • MCP
      • BeeAI
      • Pipecat
      • Superagent
  • Others
  • Migrating
    • Braintrust
    • Portkey
    • Langfuse
LogoLogo
DiscordPlatform
On this page
  • Setup
  • Switch models
GatewayAgent Frameworks

MCP (gateway)

Was this page helpful?
Previous

BeeAI (gateway)

Next
Built with

When MCP tool calls trigger LLM completions, route those LLM calls through the Respan gateway to use 250+ models from different providers. Only your RESPAN_API_KEY is needed — no separate provider keys required.

Setup

1

Install packages

$pip install respan-ai openinference-instrumentation-mcp mcp
2

Set environment variables

$export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

No provider key needed — the Respan gateway handles provider authentication.

3

Point your LLM client to the Respan gateway

1import os
2from openai import OpenAI
3
4llm_client = OpenAI(
5 api_key=os.environ["RESPAN_API_KEY"],
6 base_url="https://api.respan.ai/api",
7)
8
9response = llm_client.chat.completions.create(
10 model="gpt-4.1-nano",
11 messages=[{"role": "user", "content": "Use my MCP tool to fetch data."}],
12)
13print(response.choices[0].message.content)

Switch models

Change the model parameter on your LLM client to use 250+ models from different providers through the same gateway.

1response = llm_client.chat.completions.create(model="claude-sonnet-4-5-20250929", messages=messages)
2response = llm_client.chat.completions.create(model="gemini-2.5-flash", messages=messages)

See the full model list.