End-to-end: from first API call to production monitoring

Set up Respan

Sign up — Create an account at platform.respan.ai
Create an API key — Generate one on the API keys page
Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page

Overview

This cookbook walks through the complete Respan workflow with a real example: a customer support chatbot. By the end, you’ll have:

LLM calls routed through the gateway
Prompts managed in the platform
Full tracing for every conversation
Automated evaluation on production traffic

Step 1: Set up the gateway

Point your OpenAI SDK to Respan:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.respan.ai/api/",
    api_key="YOUR_RESPAN_API_KEY",
)

Test that it works:

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Step 2: Create a prompt template

Go to Prompts and create a prompt:

Name: support_chatbot
System message:

You are a helpful customer support agent for Acme Corp.
Answer the customer's question based on our knowledge base.
Be concise, friendly, and accurate.
If you don't know the answer, say so honestly.

Model: gpt-4o-mini

Step 3: Use the prompt in code

Fetch the prompt at runtime so you can update it without redeploying:

import requests

def get_prompt(prompt_name):
    headers = {"Authorization": "Bearer YOUR_RESPAN_API_KEY"}
    resp = requests.get(
        "https://api.respan.ai/api/prompts/",
        headers=headers,
        params={"prompt_name": prompt_name},
    )
    return resp.json()

prompt = get_prompt("support_chatbot")

response = client.chat.completions.create(
    model=prompt["model"],
    messages=[
        {"role": "system", "content": prompt["messages"][0]["content"]},
        {"role": "user", "content": "How do I reset my password?"},
    ],
    extra_body={
        "customer_identifier": "user_123",
        "thread_identifier": "conv_abc",
        "metadata": {"feature": "support_chat"},
    },
)

Step 4: Add tracing

Wrap the chatbot logic with tracing to see the full execution flow:

from respan_tracing.decorators import workflow, task
from respan_tracing.main import RespanTelemetry
from respan_tracing.contexts.span import respan_span_attributes

k_tl = RespanTelemetry()


@task(name="fetch_prompt")
def fetch_prompt():
    return get_prompt("support_chatbot")


@task(name="generate_response")
def generate_response(prompt, user_message):
    response = client.chat.completions.create(
        model=prompt["model"],
        messages=[
            {"role": "system", "content": prompt["messages"][0]["content"]},
            {"role": "user", "content": user_message},
        ],
    )
    return response.choices[0].message.content


@workflow(name="support_chatbot")
def support_chatbot(user_message: str, customer_id: str, thread_id: str):
    with respan_span_attributes(
        respan_params={
            "customer_identifier": customer_id,
            "thread_identifier": thread_id,
            "metadata": {"feature": "support_chat"},
        }
    ):
        prompt = fetch_prompt()
        answer = generate_response(prompt, user_message)
    return answer


# Run it
result = support_chatbot(
    "How do I reset my password?",
    customer_id="user_123",
    thread_id="conv_abc",
)
print(result)

You’ll see this trace in Respan:

support_chatbot (workflow)
├── fetch_prompt (task)
└── generate_response (task)
    └── gpt-4o-mini (LLM call)

Step 5: Create an evaluator

Go to Evaluation > Evaluators > + New evaluator:

Field	Value
Name	Support Quality
Type	LLM
Model	gpt-4o
Score type	Numerical (1-5)
Passing score	3
Definition	Rate this customer support response. Consider: (1) Does it answer the question? (2) Is the tone friendly and professional? (3) Is the information accurate? Score: 1=poor, 3=acceptable, 5=excellent.

Step 6: Run an offline experiment

Before going to production, test your prompt against sample questions:

Go to Experiments > + New experiment
Select the support_chatbot prompt
Add test cases:

user_message	ideal_output
How do I reset my password?	Go to Settings > Security > Reset Password…
What’s your refund policy?	We offer full refunds within 30 days…
My order hasn’t arrived	Let me check your order status…

Run the experiment and evaluate with the “Support Quality” evaluator
If scores are good, proceed to production

Step 7: Set up online evaluation

Create an automation to continuously evaluate production traffic:

Condition: metadata.feature = "support_chat"
Evaluator: Support Quality
Sampling rate: 0.2 (evaluate 20% of traffic)

Now every 5th support conversation is automatically scored.

Step 8: Monitor and iterate

Your production monitoring is now running. Here’s your ongoing workflow:

Check the dashboard daily — Watch cost, latency, and evaluation scores
Review low-scoring logs — Filter for conversations that scored below 3
Add failures to your test dataset — Growing your dataset makes evaluations more robust
Iterate on the prompt — Edit in the Respan playground, test with experiments
Deploy updates — Update the active prompt version, monitor the impact

The complete workflow is now a loop: production data feeds evaluation, which feeds prompt improvements, which deploy back to production.

Get started

Features

Admin

Security

Resources

Help & Community

End-to-end: from first API call to production monitoring

Overview

Step 1: Set up the gateway

Step 2: Create a prompt template

Step 3: Use the prompt in code

Step 4: Add tracing

Step 5: Create an evaluator

Step 6: Run an offline experiment

Step 7: Set up online evaluation

Step 8: Monitor and iterate

Next steps

Workflow overview

A/B test prompts

Get started

Features

Admin

Security

Resources

Help & Community

​Overview

​Step 1: Set up the gateway

​Step 2: Create a prompt template

​Step 3: Use the prompt in code

​Step 4: Add tracing

​Step 5: Create an evaluator

​Step 6: Run an offline experiment

​Step 7: Set up online evaluation

​Step 8: Monitor and iterate

​Next steps

Workflow overview

A/B test prompts

Overview

Step 1: Set up the gateway

Step 2: Create a prompt template

Step 3: Use the prompt in code

Step 4: Add tracing

Step 5: Create an evaluator

Step 6: Run an offline experiment

Step 7: Set up online evaluation

Step 8: Monitor and iterate

Next steps