-
Notifications
You must be signed in to change notification settings - Fork 180
JP-3775: Expand usage of resource_tracker for benchmarking #9433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JP-3775: Expand usage of resource_tracker for benchmarking #9433
Conversation
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #9433 +/- ##
==========================================
+ Coverage 77.92% 77.93% +0.01%
==========================================
Files 362 362
Lines 36283 36252 -31
==========================================
- Hits 28272 28253 -19
+ Misses 8011 7999 -12 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
b66e3ae to
6b0918a
Compare
|
Maybe you already plan to do this, but could you add one that covers |
57c4167 to
f6ba9c8
Compare
Done - coverage is effectively complete now. I'll pull this out of draft as review-ready. Latest RT run here: https://github.com/spacetelescope/RegressionTests/actions/runs/14888971507 |
|
I'm a bit puzzled by the summary of the regression test. I only see the output of |
My understanding is that entries will only exist in the RT summary for tests that have an existing value on main - that excludes all tests added in this PR. Of the existing tests on main (~five, added in #9357 ), it appears that the MOS spec3 test went above the threshold set in the workflow: https://github.com/spacetelescope/RegressionTests/blob/01de5ada29e4e8914902576f8ea9558b37b4b494/.github/workflows/test.yml#L465 Interestingly, in the set of RTs I ran before the last set of commits, that test showed marginally higher memory usage - https://github.com/spacetelescope/RegressionTests/actions/runs/14886917550 That test appears to have some variability in memory usage. |
|
Ok that makes sense - it isn't displayed because there's nothing to compare with on main.
It would be nice if we didn't see the report in this case, if this variability is benign. Is there a way to either make the test not change memory usage (I myself have no clue why it would be happening in the first place) or change the tolerance such that we don't see this? Would it help to make the tolerance relative, rather than an absolute number of megabytes? |
Yes, I think a tolerance increase will be a necessary next step. We can probably let the new set of tests inform the update as well. I'm not opposed to a relative tolerance, though I a flat level of 10-100 MB may also be feasible. Curious to see how these tests behave |
d3a8c9d to
42a86bb
Compare
emolter
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall LGTM. I had one comment about perhaps splitting a fixture into spec2 and spec3 components. It's not possible to see what the outputs will look like until merging, but I assume they'll be mostly fine since you've already made a proof-of-concept
|
FWIW: the resource measurements can be seen from the runs by looking at the results.xml directly. For https://github.com/spacetelescope/RegressionTests/actions/runs/14886917550 near the bottom of the summary is a link in the "Artifacts" section for results-jwst-1.16.0-1576-gf6ba9c8ab-Linux-X64-py3.12-1947.xml. Do these look reasonable?
|
|
Thanks Brett that is very helpful. If the runtime changes substantially, will that also cause the line to appear in the summary? Or only the memory usage? |
|
The runtime is highly variable (in part because the tests run in parallel) so it's not shown but is recorded in the |
|
I figured that was probably the case, thanks |
42a86bb to
b2d3e3d
Compare
|
Fresh RTs just in case: https://github.com/spacetelescope/RegressionTests/actions/runs/14911058249 |
emolter
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM pending clean regtest run
|
Regtests were clean; merging. I'll run another set immediately after for a sonarscan smoke test. |
Resolves JP-3775
This PR expands coverage of resource tracking to more regression tests.
Tasks
Build 12.0(use the latest build if not sure)no-changelog-entry-needed)changes/:echo "changed something" > changes/<PR#>.<changetype>.rst(see changelog readme for instructions)docs/pageokify_regteststo update the truth files