Respan routes Groq model calls through the gateway, providing unified observability including logs, cost tracking, latency metrics, and reliability monitoring for ultra-fast inference.
Configure credentials globally or per-request. Multiple API keys are supported with configurable load balancing weights for traffic distribution across credentials.
Add your Groq API key in the Respan Providers dashboard, or pass credentials per-request using the customer_credentials parameter.
Route requests through the gateway using the OpenAI SDK, LangChain, LlamaIndex, or other supported frameworks. Async logging is also available for direct Groq SDK calls.