Evaluate
Configure and evaluate prompt performance for your LLM
Keys are not stored and used only for this session
Drop CSV or browse
Required: prompt, response columns
Loading metrics...
Custom Metrics
Metric Preview
Hover over a metric to view detailsSelect or hover over a metric to see its description or template here.
Status
Model —
API Key —
Dataset —
Metrics 0
Est. API Calls 0
Provider Defaults
Select a provider
Leave empty to use provider defaults