Weekly regression chunks by steveri · Pull Request #2114 · StanfordAHA/aha

steveri · 2026-03-24T20:53:34Z

This is part of my ongoing effort to make weekly regressions more useful and more manageable.

Instead of trying to run the entire suite of weekly apps in a single 2-day chunk, this new code runs it in sequential chunks, where each chunk is a restartable group i.e. glb_tests, glb_tests_RV etc. It still takes 2 days for the entire run, but a single failure at the end no longer requires a whole new 2-day run from the beginning. Also, when/if a group fails, the test continues on to run the remaining groups. This way, you can optionally restart the failed step/group even as the remaining groups continue to run.

Examples:

In terms of being more manageable, weekly runs now bypass the weird byzantine regress-metahooks/regression-steps mechanism in favor of a much simpler weekly.yml driver. The new driver gets loaded as soon as pipeline.yml recognizes that we are doing a weekly run and not the normal aha1-9 regressions.

And also I took this opportunity to simplify and optimize the way we do E64_supported_test checks.

Summary of changes

New files

generate-weekly-pipeline.sh
weekly.yml: much simpler full regression pipeline, generated with help from generate-weekly-pipeline script

Changed files

pipeline.yml: new "Launch Weekly Run" step lets us swap weekly.yml in place of normal aha9 regressions
app: added new --subgroup option to run a single config group standalone
repress.py
- support for new --group(s) option e.g. can do e.g. "--groups glb_tests,resnet_tests"
- print timing table after every app success or failure
- reduced try/except block extent
repress_info.py: fixed method summarize_and_print_info(), which was supposed to use the timing table read-only, but oops no
tests.py and regress_util.py: E64_supported_test group properties do not belong with dynamically-loaded executable app groups and directives, so this fixes that.

…resnet_tests"

steveri · 2026-03-25T14:48:47Z

This one is ready to go.
I think these changes will make our lives better, at least I hope so!

yuchen-mei · 2026-03-26T01:36:42Z

Hi Steve, thanks for making these changes! We unfortunately have to work on a resubmission of the conference paper, which is due pretty soon. We will review and approve the changes after we finish the resubmission on April 10.

steveri · 2026-03-26T13:16:57Z

Hi Steve, thanks for making these changes! We unfortunately have to work on a resubmission of the conference paper, which is due pretty soon. We will review and approve the changes after we finish the resubmission on April 10.

Sure, no problem. Let me know if you need/want help with the paper.

steveri · 2026-04-15T15:58:55Z

Okay guys, this PR is still waiting for some kind of action. If we can get it approved and merged before Friday, then I think we have a chance of getting through a weekly regression this weekend, for the first time since November maybe :)

Thanks!

steveri added 30 commits March 13, 2026 07:30

start with an item from the wish list: intermediate timing tables

705f3e9

cleanup

e166e10

Accomodations for e.g. "--group sparse_tests" or "--groups glb_tests,…

5f0be2e

…resnet_tests"

refactor

5bf2603

Use --groups filter to eliminate unwanted test groups

e3f586f

E64 IS NOT A TEST GROUP!!!

1fb752d

Problems with gcc13, try newer Dockerfile

12c62ca

buildkite debugging :(

4f195a3

borrow the faster docker build from the other branch

1072ecf

if ya gonna debug, do it right

4a7003e

retry with all nine

3b2291f

voyager pointiwise by itself maybe

e38d48e

restore and try again from scratch

fa5f7a6

support for config subgroups

2ddb930

try new weekly pipeline maybe

59dbea0

quotes be bad dunno why

f16f19a

misspelled env var

106fc37

skip unnecessary submod checkout ugh

a31e59b

wait!

038ab37

cleanup

389719a

try replace + wait syntax problems maybe

8cd8049

maybe debugging

ef35c8c

trouble checking out the correct repo commit maybe

ad63309

trouble checking out the correct repo commit maybe ii

72dd362

trying a pre-checkout thing

a24b3b8

trying a pre-checkout thing ii

741d58c

forgot to shellcheck my changes oops

8c3a45a

problems with consistent checkout directory(s)

d20b187

trying yaml anchors...this will NEVER work

ed7b7de

no wait on failed step maybe

983e272

steveri added 17 commits March 18, 2026 09:10

comments

964c474

oops accidentally used pointer instead of copy

4b6900e

Merge branch 'master' into weekly-regression-chunks

662e4b6

comments

1869f16

comments

b74851b

cleanup refs to generate-weekly-pipeline

e9d435d

comments_and_cleanup

a0bd86b

comments

7a2bcb4

run failing tests FIRST

3794abe

cleanup and testing and also changed my mind about "1" vs. "0h01"

3491089

cleanup

683625c

cleanup

eb85c8f

cleanup

48ea42d

cleanup

aa9a3b9

cleanup

fd74e60

further reduce try/except region

965ee17

turn off debugging

ce0abd9

steveri requested review from mcoduoza and yuchen-mei March 25, 2026 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weekly regression chunks#2114

Weekly regression chunks#2114
steveri wants to merge 47 commits intomasterfrom
weekly-regression-chunks

steveri commented Mar 24, 2026

Uh oh!

steveri commented Mar 25, 2026

Uh oh!

yuchen-mei commented Mar 26, 2026

Uh oh!

steveri commented Mar 26, 2026

Uh oh!

steveri commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

steveri commented Mar 24, 2026

Summary of changes

Uh oh!

steveri commented Mar 25, 2026

Uh oh!

yuchen-mei commented Mar 26, 2026

Uh oh!

steveri commented Mar 26, 2026

Uh oh!

steveri commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants