On the same platform as tracing, evals, and the gateway, metrics, saved views on logs and traces, and threshold monitors.
Metrics on the dashboard, saved views on filtered surfaces, and monitors when thresholds breach — on the same logs and traces you instrument for debugging.
See requests, tokens, errors, latency, and cost in one place, broken down by model, user, and API key.
See LLM and tool steps in one trace tree on the same traffic, broken down by span and workflow.
Watch error rate, cost, latency, or tokens over time and alert Slack, email, or a webhook on breach.
Metrics, views, and monitors on one surface — tied to tracing, evals, and the gateway.
Performance monitoring for response times and model behavior. Cost management to find expensive prompts. Quality and debugging tie back to full sessions on Logs and Traces.
Build filters on Logs, Traces, Users, or Prompts; then Save as view. Update filters in a view with Save or Save as new when your investigation changes.
Trigger: When [metric] of [source] is [threshold condition] over [time window]. Narrow with Where on model, project, environment, or user. Send test alerts before deploy.
Ingest instrumented traffic, aggregate metrics on the dashboard, then alert with views and monitors when something moves.
Use the Respan SDK or OTLP so every request feeds metrics, logs, and traces on the same platform.
Track requests, tokens, errors, latency, and cost on the dashboard, sliced by model, user, and API key.
Save filters as views, then deploy monitors when error rate, cost, latency, or tokens cross a threshold.
Six gaps teams hit without metrics, saved views, and threshold monitors on the same logs and traces they debug.
A single monthly bill with no per-model or per-feature chart. Use dashboard breakdowns by model, user, API key, and prompt to see what drives spend.
Same status, model, and custom-property filters typed again for each outage. Save as view on Logs or Traces and apply it in one click.
LLM responses can succeed at the protocol level but regress on cost or latency. Monitor cost, tokens, and latency, not only error rate.
Engineers and PMs use different ad hoc filters. Named views (for example production errors or high-cost requests) align the team on one filter set.
Org-wide thresholds page constantly. Add Where conditions on model, project, environment, or user so alerts match the slice you own.
A chart spikes but you cannot open the failing runs. On Respan, the same filters on Logs and Traces back the metrics you chart and monitor.
Instrument traffic, review dashboard metrics, save views on Logs and Traces, then deploy monitors with notifications.
Instrument traffic
Respan SDK or OTLP on LLM clients and agents so logs and traces feed metrics.
Review the dashboard
Track requests, tokens, errors, latency, cost by model, user, API key, prompt.
Save views
Filter Logs or Traces, then Save as view for filters you reuse every week.
Deploy monitors
Set trigger, Where, and notifications, test alert, then Deploy from Monitors.
import os
from openai import OpenAI
from respan import Respan
os.environ["RESPAN_API_KEY"] = "YOUR_RESPAN_API_KEY"
Respan()
client = OpenAI()
response = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "Hello"}],
extra_body={
"customer_identifier": "user_123",
"metadata": {"feature": "support_bot", "environment": "production"},
},
)
print(response.choices[0].message.content)import os
from openai import OpenAI
from respan import Respan
os.environ["RESPAN_API_KEY"] = "YOUR_RESPAN_API_KEY"
Respan()
client = OpenAI()
response = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "Hello"}],
extra_body={
"customer_identifier": "user_123",
"metadata": {"feature": "support_bot", "environment": "production"},
},
)
print(response.choices[0].message.content)Alerts, time windows, and cost visibility: what teams configure before monitors become noise or blind spots.
A monitor fires but on-call cannot find the runs. Pair each monitor with a saved view on Logs or Traces using the same Where conditions.
A five-minute spike missed on a one-hour window. Match the monitor time window to how fast metrics move. Test with Send test alert.
Error rate flat while token spend doubles. Add a cost or token monitor with breakdown by model and feature metadata.
ISO 27001
Respan is fully compliant with ISO 27001, the internationally recognized standard for information security management.
SOC 2
We meet SOC 2 requirements to ensure secure and compliant management of data across all our systems.
GDPR
With operations designed for global compliance, we operate under GDPR - the world's strictest standard for data privacy.
HIPAA
Respan is HIPAA compliant with a Business Associate Agreement available for healthcare organizations.
Observability is the foundation. Three adjacent disciplines build on top of it:
Domain-specific observability patterns and failure modes we have written about:
Related guides: LLM tracing · LLM evals · AI gateway