JP-3311: OutlierDetectionStep producing extraneous outlier_i2d files #8464

penaguerrero · 2024-05-07T13:12:52Z

This PR addresses intermediate files being left over after step was run. I made a series of modifications to produce an output file list and then clean up these files. In the process of creating such a list, https://jira.stsci.edu/browse/JP-3039 got resolved as well - the _i2d files were not being saved to the specified output directory.

Checklist for maintainers

added entry in CHANGES.rst within the relevant release section
updated or added relevant tests
updated relevant documentation
added relevant milestone
added relevant label(s)
ran regression tests, post a link to the Jenkins job below.
How to run regression tests on a PR
Make sure the JIRA ticket is resolved properly

penaguerrero · 2024-05-07T15:18:37Z

reg test running: https://plwishmaster.stsci.edu:8081/job/RT/job/JWST-Developers-Pull-Requests/1421/

codecov · 2024-05-07T15:53:48Z

Codecov Report

Attention: Patch coverage is 71.87500% with 9 lines in your changes are missing coverage. Please review.

Project coverage is 56.53%. Comparing base (6580914) to head (2b006d2).
Report is 14 commits behind head on master.

❗ Current head 2b006d2 differs from pull request most recent head 30f6d15. Consider uploading reports for the commit 30f6d15 to get more accurate results

Files	Patch %	Lines
jwst/outlier_detection/outlier_detection_spec.py	12.50%	7 Missing ⚠️
jwst/resample/resample_spec.py	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #8464      +/-   ##
==========================================
+ Coverage   56.38%   56.53%   +0.15%     
==========================================
  Files         387      387              
  Lines       38716    38822     +106     
==========================================
+ Hits        21830    21949     +119     
+ Misses      16886    16873      -13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

penaguerrero · 2024-05-07T20:10:11Z

fixed failed reg test and rerunning at https://plwishmaster.stsci.edu:8081/job/RT/job/JWST-Developers-Pull-Requests/1425/

penaguerrero · 2024-05-07T22:57:46Z

Happy with results of last reg test run. The failed tests are not related to changes in this PR.

braingram

Thanks for taking this on. I left a few comments/questions (not a full review yet) and I have one general question. My understanding (correct me if I'm wrong) is that the intermediate files are (or perhaps only should be) written only if in_memory is False. The current code (without this PR) doesn't delete these files until the end of the step (which makes "book-keeping" difficult and contributes to the issue addressed in this PR).

Would it be easier to not write out the files if they aren't required and to delete them as soon as they are no longer useful? I think this is already the case for the median image (it's not written if save_intermediate_results is False). The same can't be said for blot_models which are always written out. For these they are no longer needed after the call to detect_outliers and could be deleted there if save_intermediate_results is False. Similarly the drizzled_models are not needed after the call to create_median and could be deleted there.

What do you think of this alternative approach? It should avoid needing to add the new parameter to the spec and the output_list. Will it be compatible with the other issue fixed in this PR (fixing the output directory for the i2d files)?

jwst/outlier_detection/outlier_detection_spec.py

jwst/outlier_detection/outlier_detection_step.py

braingram · 2024-05-08T15:11:11Z

jwst/resample/resample.py

                output_model.save(output_name)
+                if self.mk_output_list:
+                    self.output_list.append(output_name)


It looks like output_list contains the same items as output_models. What required adding the new list?

The list contains the strings of the full paths of the files.

Isn't that the same thing that's in output_models? line 315 contains:

self.output_models.append(output_name)

which appends the same variable as:

self.output_list.append(output_name)

yes, I agree that the 2 lines are appending the name, however when you print self.output_models you get a model container and not a list of strings

Thanks! I did miss that point, and it looks like there is an issue with resample (one that exists on main) where it creates the container with an unknown argument with presumably the expectation that the models are not used:

jwst/jwst/resample/resample.py

Line 186 in 9e857a5

self.output_models = ModelContainer(open_models=False)

As resample appends filenames it's possible to either:

disable return_open (which I expect is what the open_models was intended to do) and indexing the container will return the filenames

access _models (which will contain the filenames)

I went with the second approach in penaguerrero#1

jwst/outlier_detection/outlier_detection.py

jwst/outlier_detection/outlier_detection_spec.py

hbushouse · 2024-05-08T19:27:53Z

CI style check is failing.

hbushouse

Looks good now.

braingram

Thanks for making the changes. I don't think I did a good job explaining the alternative approach. I opened a PR against your branch to hopefully illustrate the idea:
penaguerrero#1
It:

removes the mk_output_list parameter from the step spec
removes the changes to resample
removes output_list
deletes drizzled_models after they are no longer needed
deletes blot_models after they are no longer needed

I ran the outlier detection unit tests but not the full unit tests (or regtests) for the PR against your branch. Please take a look at the PR.

It seems possible to fix this issue without changes to resample (to help keep the API for the 2 steps separate).

Would you add a unit test that fails on jwst master but passes with this PR to both illustrate the fix and to make sure the the upcoming outlier detection and resample changes do not re-introduce these issues?

delete files after they're no longer useful

penaguerrero · 2024-05-09T12:50:23Z

started a new reg test run at https://plwishmaster.stsci.edu:8081/job/RT/job/JWST-Developers-Pull-Requests/1431/

penaguerrero · 2024-05-09T15:45:58Z

failed tests are unrelated to this PR

hbushouse · 2024-05-09T19:40:00Z

@braingram @penaguerrero Is everyone happy now? Are the changes in resample/resample_spec still necessary or not?

braingram · 2024-05-09T19:44:42Z

Would you add a unit test that fails on jwst master but passes with this PR to both illustrate the fix and to make sure the the upcoming outlier detection and resample changes do not re-introduce these issues?

@penaguerrero is a test for JP-3311 doable as part of this PR?

I think the resample changes are needed for JP-3039.

hbushouse · 2024-05-10T12:35:44Z

Given that this PR contains a fix for a persistent regtest error I'd like to go ahead and get this merged ASAP. We can always submit a unit test or regtest for this condition in a separate PR. Is there anything that anyone thinks really needs changing or updating yet in this PR before merging?

braingram · 2024-05-10T12:42:18Z

Is the regtest fix the changes in test_nirspec_mos_spec3? Are those related to the outlier detection changes? If not, wouldn't a different PR that describes those changes make sense?

I'll dismiss my review and leave it up to @hbushouse but I believe the outlier detection changes warrant a test. For example, I do not know if the changes in my PR against the source branch for this PR even addressed the issue.

braingram · 2024-05-10T12:43:20Z

I don't see a button to dismiss my own review so I marked it as 'approved' to remove my changes request.

made a series of modifications to produce an output file list

2c56f9b

penaguerrero requested a review from a team as a code owner May 7, 2024 13:12

github-actions bot added outlier_detection resample labels May 7, 2024

penaguerrero and others added 2 commits May 7, 2024 09:16

added PR explanation

c0dd793

Merge branch 'master' into jp3311

ff6f5d5

Merge branch 'master' into jp3311

caa9013

hbushouse requested a review from braingram May 7, 2024 19:17

fixing silly mistake in reg test

5679f6a

github-actions bot added NIRSPEC spec3 pipeline regression_testing labels May 7, 2024

Merge branch 'master' into jp3311

744a608

Merge branch 'master' into jp3311

90d817f

Merge branch 'master' into jp3311

5333b50

hbushouse added this to the Build 11.0 milestone May 8, 2024

braingram reviewed May 8, 2024

View reviewed changes

cleaning up code

3337fb5

melanieclarke mentioned this pull request May 8, 2024

JP-3613: Update error arrays to match NaN in data #8463

Merged

7 tasks

hbushouse reviewed May 8, 2024

View reviewed changes

jwst/outlier_detection/outlier_detection.py Outdated Show resolved Hide resolved

jwst/outlier_detection/outlier_detection_spec.py Outdated Show resolved Hide resolved

penaguerrero and others added 3 commits May 8, 2024 14:56

Merge branch 'master' into jp3311

fcd0645

minor change

6de36aa

minor change

3f919d9

penaguerrero and others added 2 commits May 8, 2024 15:35

minor change

fee7539

Merge branch 'master' into jp3311

cff4ead

hbushouse approved these changes May 8, 2024

View reviewed changes

hbushouse requested a review from braingram May 8, 2024 20:30

penaguerrero and others added 2 commits May 8, 2024 16:55

Merge branch 'master' into jp3311

70c67c8

delete files after they're no longer useful

f5d0884

braingram requested changes May 8, 2024

View reviewed changes

penaguerrero and others added 2 commits May 9, 2024 08:33

Merge pull request #1 from braingram/jp3311_bjg

7eb54ac

delete files after they're no longer useful

minor change

2b006d2

Merge branch 'master' into jp3311

F438

30f6d15

braingram self-requested a review May 10, 2024 12:42

braingram approved these changes May 10, 2024

View reviewed changes

hbushouse merged commit 8b254ae into spacetelescope:master May 10, 2024
23 of 24 checks passed

This was referenced May 10, 2024

OutlierDetectionStep producing extraneous outlier_i2d files #7747

Closed

Output files *_outlier_i2d.fits not saved to the specified output_dir #8063

Closed

penaguerrero mentioned this pull request May 13, 2024

adding tests for changes in JP-3311 and minor fixes #8481

Merged

8 tasks

braingram mentioned this pull request May 14, 2024

JP-3584: Use rolling window median for TSO outlier detection #8473

Merged

8 tasks

braingram mentioned this pull request Jul 21, 2024

Output files *_outlier_i2d.fits not saved to the specified output_dir #7419

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JP-3311: OutlierDetectionStep producing extraneous outlier_i2d files #8464

JP-3311: OutlierDetectionStep producing extraneous outlier_i2d files #8464

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JP-3311: OutlierDetectionStep producing extraneous outlier_i2d files #8464

JP-3311: OutlierDetectionStep producing extraneous outlier_i2d files #8464

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!