Commit Graph

7 Commits

Author SHA1 Message Date
Christian Gunderman d7f6d21c10 Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941) 2026-04-08 23:57:26 +00:00
Christian Gunderman 55633ca50b Promote stable tests. (#22253) 2026-03-13 21:32:00 +00:00
Christian Gunderman af24040e5d Promote stable tests to CI blocking. (#20581) 2026-02-27 21:08:12 +00:00
christine betts 00384b9ec1 Disable failing eval test (#19455) 2026-02-18 19:27:21 +00:00
Abhijit Balaji 278120b4be fix(evals): prevent false positive in hierarchical memory test (#18777) 2026-02-11 01:51:05 +00:00
Keith Guerin 98c008cf90 ui: update & subdue footer colors and animate progress indicator (#18570) 2026-02-10 17:36:20 +00:00
joshualitt 8edc496312 feat(core): Render memory hierarchically in context. (#18350) 2026-02-10 02:01:59 +00:00