Commit Graph

8 Commits

Author SHA1 Message Date
Alisa 5626611bb8 feat: implement high-signal PR regression check for evaluations (#23937) 2026-04-02 05:14:43 +00:00
Abhi dd538662c7 feat(skills): add behavioral-evals skill with fixing and promoting guides (#23349) 2026-03-23 21:06:43 +00:00
Christian Gunderman 27fa2a685d Add some dos and don'ts to behavioral evals README. (#20629) 2026-03-02 23:14:00 +00:00
Christian Gunderman 94b2cb593e Add slash command for promoting behavioral evals to CI blocking (#20575)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-02-27 19:11:30 +00:00
Christian Gunderman 3593e7236e Slash command for helping in debugging (#17609) 2026-01-27 02:47:04 +00:00
Christian Gunderman 9c3f0fce0b Steer outer agent to use expert subagents when present (#16763) 2026-01-16 16:51:10 +00:00
Christian Gunderman f60b442d39 Aggregate test results. (#16581) 2026-01-14 07:08:05 +00:00
Christian Gunderman b82c66b2d8 Behavioral evals framework. (#16047) 2026-01-14 04:49:17 +00:00