Card
- default
evaluation · run 42
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
- elevated
evaluation · run 42
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
- active
selected
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
- inverse-ink
spotlight
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
- success
passed · 3 min ago
All checks passed
Every assertion in the evaluation suite passed on this run.
- warning
warning · 3 min ago
Partial match
3 of 7 assertions passed. Review edge cases before promoting.
- error
failed · 3 min ago
Evaluation failed
The model response did not meet the required quality threshold.
- bezel
glass surface
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
evaluation
Response Quality
Clarity, depth, correctness.