Commit Graph

3 Commits

Author SHA1 Message Date
Christian Gunderman
f1bb2af6de Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941) 2026-04-08 23:57:26 +00:00
Christian Gunderman
514d431049 Demote unreliable test. (#20571) 2026-02-27 16:48:46 +00:00
N. Taylor Mullen
d45a45d565 chore: strengthen validation guidance in system prompt (#18544) 2026-02-09 05:32:46 +00:00