Commit Graph

479 Commits

Author SHA1 Message Date
mkorwel 40e7d69d2a test(cli): fix App tests and run only App.test.tsx in CI 2026-04-16 21:23:15 +00:00
mkorwel bcc9c72bb3 ci: run all sequential tests again after fixes 2026-04-16 21:10:50 +00:00
mkorwel fd2875d310 ci: run only SettingsDialog.test.tsx to verify speed 2026-04-16 21:08:46 +00:00
mkorwel b77d861925 ci: run all sequential tests to verify fix 2026-04-16 18:51:16 +00:00
mkorwel 140b10d107 ci: run only the 3 specific suspect tests 2026-04-16 18:01:26 +00:00
mkorwel 0804145b36 ci: run all tests in packages/cli to verify fix 2026-04-16 17:54:50 +00:00
mkorwel 9148016e98 ci: remove test_linux job to isolate suspect test 2026-04-16 16:32:18 +00:00
mkorwel 6c69b51a8b ci: run only render.test.tsx in suspect job 2026-04-16 16:29:45 +00:00
mkorwel f4a564714f ci: make sandbox deps optional in setup action 2026-04-16 16:27:58 +00:00
mkorwel 1e092a4a98 ci: use forks pool for suspect tests 2026-04-16 16:23:56 +00:00
mkorwel 58f4414b35 ci: exclude suspect tests from main linux job 2026-04-16 16:13:39 +00:00
mkorwel a19222f830 ci: add concurrency group to cancel in-progress runs 2026-04-16 16:12:49 +00:00
mkorwel 32d9de8b2d ci: add job for suspect tests 2026-04-16 16:10:50 +00:00
mkorwel 844bb44411 ci: remove windows and bundle jobs to focus on linux 2026-04-16 15:58:30 +00:00
mkorwel 9a347447c2 fix(cli): resolve race conditions in test utils and streamline CI workflow 2026-04-16 15:50:55 +00:00
mkorwel ae3f3b6127 ci: remove mac job and improve CI polling logic 2026-04-16 15:28:00 +00:00
mkorwel ae4b563326 fix(ci): fix infrastructure startup errors and restore sandbox dependencies 2026-04-14 20:57:43 -07:00
mkorwel 9e4c1538f1 feat(ci): build full cross-platform matrix on high-power runners 2026-04-14 20:33:15 -07:00
mkorwel 7621c425b3 fix(ci): ensure core is built for integration test environment 2026-04-14 20:14:50 -07:00
mkorwel 73ecb6816f feat(ci): simplify trial to balance reliability and speed 2026-04-14 20:07:02 -07:00
mkorwel 8ad9efdbb0 feat(ci): implement 'build once, test many' shared artifact architecture 2026-04-14 19:55:42 -07:00
mkorwel de2dfd7559 feat(ci): refine trial to measure both unit speed and bundle integrity 2026-04-14 19:38:50 -07:00
mkorwel a4730c3d87 feat(ci): target CLI unit tests specifically in trial 2026-04-14 19:35:44 -07:00
mkorwel a438c24ee0 feat(ci): switch trial to optimized unit tests 2026-04-14 19:30:11 -07:00
mkorwel c850a720a0 feat(ci): switch trial to integration tests 2026-04-14 16:56:31 -07:00
mkorwel 340f241a7d fix(ci): use valid SHA for download-artifact action 2026-04-14 16:53:13 -07:00
mkorwel dae73612e5 feat(ci): add trial bundling workflow and setup action 2026-04-14 16:49:08 -07:00
Sam Roberts 58901207bd docs: Update docs-audit to include changes in PR body (#25153) 2026-04-10 22:29:04 +00:00
Sam Roberts f387e456be Automated documentation audit (#24567) 2026-04-09 18:45:24 +00:00
Christian Gunderman f1bb2af6de Generalize evals infra to support more types of evals, organization and queuing of named suites (#24941) 2026-04-08 23:57:26 +00:00
Sri Pasumarthi c7b920717f feat(test-utils): add CPU performance integration test harness (#24951) 2026-04-08 21:50:29 +00:00
Sri Pasumarthi 4ebc43bc66 feat(test-utils): add memory usage integration test harness (#24876) 2026-04-08 17:42:18 +00:00
Alisa 846051f716 Alisa/approve button (#24645) 2026-04-07 16:44:07 +00:00
Alisa e74efc5c0b Do not run pr-eval workflow when no steering changes detected (#24621) 2026-04-03 22:54:05 +00:00
Emily Hedlund ca0e6f9bd9 test: fix Windows CI execution and resolve exposed platform failures (#24476) 2026-04-03 15:50:29 +00:00
Alisa 973092df50 feat: implement high-signal PR regression check for evaluations (#23937) 2026-04-02 05:14:43 +00:00
Sehoon Shon f938a3f51d fix(build): upload full bundle directory archive to GitHub releases (#24403) 2026-04-01 16:39:44 +00:00
Tommaso Sciortino 91400c5b0b don't try to remove tags in dry run (#24356) 2026-03-31 12:54:53 -07:00
Sehoon Shon 25af91e4df chore(release): allow bundling for all builds, including stable (#24305) 2026-03-31 18:37:14 +00:00
Emily Hedlund 1edc542b55 ci: install bubblewrap on Linux for release workflows (#24347) 2026-03-31 18:19:48 +00:00
Alisa 2e03e3aed5 feat(evals): add reliability harvester and 500/503 retry support (#23626) 2026-03-26 01:48:45 +00:00
Tommaso Sciortino b91758bf6b Don't try removing a tag from a non-existent release. (#23830) 2026-03-25 22:27:17 +00:00
Alisa 7b710a2790 new linting check: github-actions-pinning (#23808) 2026-03-25 21:43:12 +00:00
Tommaso Sciortino a6a3689298 don't silence errors from wombat (#23822) 2026-03-25 13:38:30 -07:00
Emily Hedlund 109a7dc531 test(core): install bubblewrap on Linux CI for sandbox integration tests (#23583) 2026-03-25 14:29:46 +00:00
Alisa 99e5164c82 Evals: PR Guidance adding workflow (#23164) 2026-03-23 15:07:40 +00:00
Christian Gunderman 28935d1e6b Retry evals on API error. (#23322) 2026-03-21 02:52:19 +00:00
Sam Roberts 4fc059beb5 Update website issue template for label and title (#23036) 2026-03-19 20:51:16 +00:00
Bryan Morgan 17b37144a9 fix(automation): evaluate staleness before checking protected labels (#22561) 2026-03-15 15:50:19 -04:00
Bryan Morgan 5ddb517593 fix(automation): harden stale PR closer permissions and maintainer detection (#22558) 2026-03-15 14:58:56 -04:00