Monitor Groq inference with Respan

Overview

Respan routes Groq model calls through the gateway, providing unified observability including logs, cost tracking, latency metrics, and reliability monitoring for ultra-fast inference.

Configure credentials globally or per-request. Multiple API keys are supported with configurable load balancing weights for traffic distribution across credentials.

How it works

Add your Groq API key in the Respan Providers dashboard, or pass credentials per-request using the customer_credentials parameter.

Route requests through the gateway using the OpenAI SDK, LangChain, LlamaIndex, or other supported frameworks. Async logging is also available for direct Groq SDK calls.