Card

default: evaluation · run 42
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
elevated: evaluation · run 42
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
active: selected
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
inverse-ink: spotlight
Response Quality
Evaluates clarity, depth, and factual correctness of model output.
success: passed · 3 min ago
All checks passed
Every assertion in the evaluation suite passed on this run.
warning: warning · 3 min ago
Partial match
3 of 7 assertions passed. Review edge cases before promoting.
error: failed · 3 min ago
Evaluation failed
The model response did not meet the required quality threshold.
bezel: glass surface
Response Quality
Evaluates clarity, depth, and factual correctness of model output.