Christian Gunderman
|
f1bb2af6de
|
Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941)
|
2026-04-08 23:57:26 +00:00 |
|
Christian Gunderman
|
fe8d93c75a
|
Promote stable tests. (#22253)
|
2026-03-13 21:32:00 +00:00 |
|
Christian Gunderman
|
05ef2eb362
|
Promote stable tests to CI blocking. (#20581)
|
2026-02-27 21:08:12 +00:00 |
|
christine betts
|
8a8826654c
|
Disable failing eval test (#19455)
|
2026-02-18 19:27:21 +00:00 |
|
Abhijit Balaji
|
b3ecac7086
|
fix(evals): prevent false positive in hierarchical memory test (#18777)
|
2026-02-11 01:51:05 +00:00 |
|
Keith Guerin
|
5920750c24
|
ui: update & subdue footer colors and animate progress indicator (#18570)
|
2026-02-10 17:36:20 +00:00 |
|
joshualitt
|
89d4556c45
|
feat(core): Render memory hierarchically in context. (#18350)
|
2026-02-10 02:01:59 +00:00 |
|