Prompt workflow
Render a saved prompt template with dataset variables, then run LLM calls automatically.- Via UI
- Via API
Completion workflow
Run direct LLM completions on dataset messages automatically — no prompt templates needed.- Via UI
- Via API
Select task = LLM generation
Pick LLM generation (chat completion), then configure the model and parameters (temperature, max tokens).



Custom workflow
Fetch inputs, run your own code/model, then submit outputs back for automatic evaluation.- Via UI
- Via API
Troubleshooting
Logs list is empty after creation
Logs list is empty after creation
The experiment may still be processing — wait 5–10 seconds and retry. Also check that your dataset is not empty.
Evaluator spans never appear
Evaluator spans never appear
Confirm the evaluator slug exists and is accessible. Evaluators run asynchronously — poll the log detail endpoint after submission.
Inputs/outputs look truncated
Inputs/outputs look truncated
Use the detail endpoint to retrieve the full span tree and untruncated fields.












