Commit Graph

10 Commits

Author SHA1 Message Date
Christian Gunderman d7f6d21c10 Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941) 2026-04-08 23:57:26 +00:00
Jerop Kipruto d2f0d9dd2d feat(core): prioritize discussion before formal plan approval (#24423) 2026-04-01 15:55:47 +00:00
David Pierce 718659fef4 fix(core): resolve Plan Mode deadlock during plan file creation due to sandbox restrictions (#24047) 2026-03-31 22:06:50 +00:00
ruomeng 7c549901e0 feat(plan): promote planning feature to stable (#24282) 2026-03-31 16:10:13 +00:00
Adib234 ff48c3a10e fix(plan): sandbox path resolution in Plan Mode to prevent hallucinations (#22737)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-03-24 13:19:29 +00:00
ruomeng 7945227bdd feat(plan): support plan mode in non-interactive mode (#22670) 2026-03-18 20:00:26 +00:00
Christian Gunderman af24040e5d Promote stable tests to CI blocking. (#20581) 2026-02-27 21:08:12 +00:00
Jerop Kipruto d8656206d3 fix(core): clarify plan mode constraints and exit mechanism (#19438) 2026-02-18 20:09:59 +00:00
Jerop Kipruto 54f4def89d feat(plan): add positive test case and update eval stability policy (#18457) 2026-02-06 19:45:22 +00:00
Jerop Kipruto d01287b640 feat(plan): add behavioral evals for plan mode (#18437) 2026-02-06 16:51:12 +00:00