Tommaso Sciortino
|
36cc283f33
|
use macos-latest-large runner where applicable. (#25413)
|
2026-04-14 14:05:25 -07:00 |
|
Christian Gunderman
|
d7f6d21c10
|
Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941)
|
2026-04-08 23:57:26 +00:00 |
|
Alisa
|
aa178f547c
|
feat(evals): add reliability harvester and 500/503 retry support (#23626)
|
2026-03-26 01:48:45 +00:00 |
|
Aditya Bijalwan
|
ad8528833e
|
test: add browser agent integration tests (#21151)
|
2026-03-05 13:29:35 +00:00 |
|
Christian Gunderman
|
ccb5c0e679
|
ci(evals): only run evals in CI if prompts or tools changed (#20898)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-03 00:29:31 +00:00 |
|
Christian Gunderman
|
5e583749ae
|
Do not block CI on evals (#20870)
|
2026-03-02 20:31:02 +00:00 |
|
DeWitt Clinton
|
4a156cb7ec
|
Disable expensive and scheduled workflows on personal forks (#20449)
|
2026-02-27 17:40:09 +00:00 |
|
Gal Zahavi
|
045a9b815c
|
fix: action var usage (#20492)
|
2026-02-26 22:06:09 +00:00 |
|
Jerop Kipruto
|
d887c28e43
|
fix(github): resolve actionlint and yamllint regressions from #19443 (#20467)
|
2026-02-26 11:31:31 -08:00 |
|
Google Admin
|
48edb0f728
|
Refactor Github Action per b/485167538 (#19443)
Co-authored-by: Ben Knutson <benknutson@google.com>
|
2026-02-26 12:58:14 -05:00 |
|
Tommaso Sciortino
|
f504f072b3
|
make windows tests mandatory (#20096)
|
2026-02-24 00:06:14 +00:00 |
|
Christian Gunderman
|
b82c66b2d8
|
Behavioral evals framework. (#16047)
|
2026-01-14 04:49:17 +00:00 |
|
N. Taylor Mullen
|
325d96ec5b
|
Optimize CI workflow: Parallelize jobs and cache linters (#16054)
Co-authored-by: matt korwel <matt.korwel@gmail.com>
|
2026-01-07 21:50:22 +00:00 |
|
Tommaso Sciortino
|
f5458103e2
|
Always set pending status in E2E tests (#14756)
|
2025-12-09 02:58:52 +00:00 |
|
Tommaso Sciortino
|
e2a730040d
|
Remove old E2E Workflows (#14749)
|
2025-12-09 00:32:29 +00:00 |
|