Christian Gunderman
|
d7f6d21c10
|
Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941)
|
2026-04-08 23:57:26 +00:00 |
|
Christian Gunderman
|
55633ca50b
|
Promote stable tests. (#22253)
|
2026-03-13 21:32:00 +00:00 |
|
Christian Gunderman
|
af24040e5d
|
Promote stable tests to CI blocking. (#20581)
|
2026-02-27 21:08:12 +00:00 |
|
christine betts
|
00384b9ec1
|
Disable failing eval test (#19455)
|
2026-02-18 19:27:21 +00:00 |
|
Abhijit Balaji
|
278120b4be
|
fix(evals): prevent false positive in hierarchical memory test (#18777)
|
2026-02-11 01:51:05 +00:00 |
|
Keith Guerin
|
98c008cf90
|
ui: update & subdue footer colors and animate progress indicator (#18570)
|
2026-02-10 17:36:20 +00:00 |
|
joshualitt
|
8edc496312
|
feat(core): Render memory hierarchically in context. (#18350)
|
2026-02-10 02:01:59 +00:00 |
|