Skip to content

Card

default

evaluation · run 42

Response Quality

Evaluates clarity, depth, and factual correctness of model output.

elevated

evaluation · run 42

Response Quality

Evaluates clarity, depth, and factual correctness of model output.

active

selected

Response Quality

Evaluates clarity, depth, and factual correctness of model output.

inverse-ink

spotlight

Response Quality

Evaluates clarity, depth, and factual correctness of model output.

success

passed · 3 min ago

All checks passed

Every assertion in the evaluation suite passed on this run.

warning

warning · 3 min ago

Partial match

3 of 7 assertions passed. Review edge cases before promoting.

error

failed · 3 min ago

Evaluation failed

The model response did not meet the required quality threshold.

bezel

glass surface

Response Quality

Evaluates clarity, depth, and factual correctness of model output.