Experiments V2 (Beta)

Set up Respan

Sign up — Create an account at platform.respan.ai
Create an API key — Generate one on the API keys page
Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page

Use AI

Add the Docs MCP to your AI coding tool to get help building with Respan. No API key needed.

1 {
2   "mcpServers": {
3     "respan-docs": {
4       "url": "https://docs.respan.ai/mcp"
5     }
6   }
7 }

Experiments lets you run repeatable evaluations over a dataset and inspect outputs, evaluator scores, and run status. Choose a workflow type based on your use case.

Prompt workflow

Render a saved prompt template with dataset variables, then run LLM calls automatically.

Via UI

Via API

Click New experiment

Go to Experiments and click New experiment.

Select a dataset

Choose the dataset you want to run on.

Select task = Prompt

Pick Prompt as the task type, then choose the prompt and version.

Select evaluators and create

Select evaluators to score outputs, then click Create. Inspect outputs and scores once the run finishes.

Completion workflow

Run direct LLM completions on dataset messages automatically — no prompt templates needed.

Via UI

Via API

Click New experiment

Go to Experiments and click New experiment.

Select a dataset

Choose the dataset you want to run on.

Select task = LLM generation

Pick LLM generation (chat completion), then configure the model and parameters (temperature, max tokens).

Select evaluators and create

Select evaluators, then click Create. Inspect outputs and scores once the run finishes.

Custom workflow

Fetch inputs, run your own code/model, then submit outputs back for automatic evaluation.

Via UI

Via API

Click New experiment

Go to Experiments and click New experiment.

Select a dataset

Choose the dataset you want to run on.

Select task = Custom

Pick Custom as the task type, select evaluators, then click Create.

Submit outputs via API

The system creates placeholder rows. Use the API to submit outputs — evaluators run automatically.

The UI is used to monitor progress and review results. Outputs are submitted via API.

Troubleshooting

Logs list is empty after creation

The experiment may still be processing — wait 5–10 seconds and retry. Also check that your dataset is not empty.

Evaluator spans never appear

Confirm the evaluator slug exists and is accessible. Evaluators run asynchronously — poll the log detail endpoint after submission.

Inputs/outputs look truncated

Use the detail endpoint to retrieve the full span tree and untruncated fields.