List LLM metrics

Returns LLM usage metrics (requests, tokens, cost, latency, cache hit rate, etc.) bucketed by time_tick (minute / hour / day). Use metrics_to_aggregate to request a subset of metrics.

Headers

AuthorizationstringRequired

Bearer token. Use Bearer YOUR_API_KEY.

Request

This endpoint expects an object.
end_timedatetimeRequired
Inclusive end of the range, ISO 8601.
start_timedatetimeRequired
Inclusive start of the range, ISO 8601.
filtersobjectOptional

Filter criteria. See Filters API Reference for operator syntax.

metrics_to_aggregatelist of enumsOptional
Optional. If provided, the response includes only these metric fields. If omitted, all LLM metrics are returned.
time_tickenumOptional

Bucket granularity for time-series endpoints. If omitted, inferred from the range. Has no effect on /summary/ endpoints.

Allowed values:

Response

Successful response.
date_groupdatetime
Bucket start time, ISO 8601.
number_of_requestsinteger
total_costdouble
total_prompt_tokensinteger
total_completion_tokensinteger
total_tokensinteger
error_countinteger
error_percentagedouble
average_prompt_tokensinteger
average_completion_tokensinteger
average_tokensinteger
average_costdouble
average_tpsdouble
average_latencydouble
average_ttftdouble
prompt_cache_hit_tokensinteger
reasoning_tokensinteger
cache_hit_percentagedouble
requests_per_seconddouble