Free
$0
Per month
- Core features
DeepEval is open-source framework for evaluating LLM outputs with metrics and test cases.
What this tool does well, and the limitations to keep in mind.
Pros
Cons
What's included in each plan, and how the tiers compare.
$0
Per month
$0
Per month
Custom
Contract
Integrate with Respan for enhanced AI workflows
Top companies in Observability, Prompts & Evals you can use instead of DeepEval.
Respan
LLM tracing, evals, and gateway
LangSmith
Trace visualization for LLM chains
Weights & Biases
ML experiment tracking
MLflow
OpenTelemetry-native tracing
Langfuse
Open-source LLM observability
Arize AI
ML observability with LLM support
Datadog LLM
LLM monitoring within Datadog platform
Helicone
Traceloop
OpenTelemetry
Braintrust
Real-time LLM logging and tracing
HoneyHive
Prompt management
Phoenix
OpenTelemetry-based LLM and agent tracing
Promptfoo
Patronus AI
Automated LLM evaluation platform
Portkey
Humanloop
Sentry
Ragas
RAG-specific evaluation framework
LangWatch
Multi-turn agent simulation testing
Galileo AI
LLM output quality evaluation
PromptLayer
Maxim AI
Distributed tracing for LLM and agent apps
Confident AI
DeepEval open-source evaluation framework
Opik
Agenta
Lunary
Future AGI
Multimodal evaluation (text, image, audio, video)
Parea AI
Chamber
ML infrastructure automation
Athina AI
Ashr
Multi-modal synthetic testing
Sentrial
Agent failure root cause analysis
Moda
Hallucination detection
Side-by-side comparisons with other tools in this category.
Companies from adjacent layers in the AI stack that work well with DeepEval.
Last verified: March 10, 2026