Splitting part of source_detection.ipynb off into separate imaging.ipynb example notebook #610

PHerzogFHNW · 2024-08-13T06:38:40Z

The purpose of this PR is to split off the examples on how to use Karabo for imaging out of the notebook that proves source detection examples, leaving only the WSClean algorithm in the source detection notebook.

This corresponds to https://jira.skatelescope.org/browse/CHOC-18

… Update docs accordingly 🈴

codecov · 2024-08-13T07:07:02Z

Codecov Report

Attention: Patch coverage is 94.11765% with 2 lines in your changes missing coverage. Please review.

Project coverage is 68.78%. Comparing base (45adff6) to head (0848370).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
karabo/simulation/sample_simulation.py	94.11%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #610      +/-   ##
==========================================
+ Coverage   68.63%   68.78%   +0.15%     
==========================================
  Files          53       54       +1     
  Lines        5714     5748      +34     
==========================================
+ Hits         3922     3954      +32     
- Misses       1792     1794       +2

Flag	Coverage Δ
	`68.78% <94.11%> (+0.15%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Lukas113

Hmm, I have mixed feelings about this PR. I agree to do the split because otherwise it's too much workflow in a single notebook. However, the reasons for my mixed feelings are:

imaging.ipynb is not included in the tests
This PR introduces a lot of duplication, which makes testing longer, maintainability harder and potentially confusing for a user (I would be confused & double check if the potential duplicated code is the same or not).

I think a better solution would be to split actually all notebooks into smaller parts. The notebooks can start with an instantiation of an according class (e.g. Visibility) from a file. Here's a suggestion on how to separate the notebooks:

Visibility simulation
Imaging
Source detection

Such a solution would also make it clear that you e.g. can take visibilities from any archive, not just Karabo simulated visibilities.

For implementation & testing, such an approach would need some modification (apart from splitting the notebooks them-self & test them). I suggest the following:

Take according file/dir-paths (in- and/or output-product) from an env-var starting with a prefix KARABO_, set the variable(s) before launching a notebook test in an according test_notebooks.py test.
Adapt conftest.py clean_disk to clean after module or session, not function. This should ensure cleanup (I think also if a test failed?).
Make the test ordered, because they would rely on the execution of the previous test(s). Tools which can help are: pytest-order & pytest-dependency
Maybe sth else I didn't think of?

What are your thoughts on that?

Lukas113 · 2024-08-13T11:27:37Z

In addition, writing the down the purpose of this PR would be helpful rather than pointing to a JIRA ticket where permissions are needed to access.

PHerzogFHNW · 2024-08-13T11:55:10Z

I added the corresponding test and a small description in the original comment, as those are no-brainers. I copied that format off someone who must've been equally lazy, my bad.

As for the rest, it might be best to wait until Michel is back since I did this implementation as per his instructions. I do agree that the duplication is bad for the reasons you listed. Discussing this with Michel, we could not come up with a better solution as doing the initial simulation would be part of the expected workflow of a scientist doing either imaging or source detection.

Lukas113 · 2024-08-13T12:08:14Z

Sure we can wait. I also discussed this matter with Mathias Graf. And we concluded more or less to the solution I proposed. And since the changes are small (at least if I considered every aspect of it which might not be the case), it's hard to see for me why we shouldn't do the suggested approach.

mpluess · 2024-08-26T11:27:33Z

The idea was for this PR to be just about splitting up source detection and imaging, but since the topic of simulation code duplication came up multiple times during the last weeks (e.g. Lukas' metadata example script IIRC), it might indeed be a good idea to address it right now. I could imagine a slightly simpler solution than Lukas suggested: what about just having a saved FITS file that lives in the repo and replace the simulation code with a simple loading of that file? That would save us env vars, cleanup, ordered tests.
I believe in the SRCNet_rucio_meta.py script, we also need telescope and observation objects, but we could just load them from disk as well using serialization.
Opinions?

sfiruch · 2024-08-26T12:41:16Z

[...] I believe in the SRCNet_rucio_meta.py script, we also need telescope and observation objects, but we could just load them from disk as well using serialization. Opinions?

Depends on the size. It also means that we'd have to update serialized files whenever internals change.

Why don't we add production of example-data to the Karabo API? A function à la get visibilities for tests, experiments and examples() could be useful?

mpluess · 2024-08-26T12:58:26Z

[...] I believe in the SRCNet_rucio_meta.py script, we also need telescope and observation objects, but we could just load them from disk as well using serialization. Opinions?

Depends on the size. It also means that we'd have to update serialized files whenever internals change.

Why don't we add production of example-data to the Karabo API? A function à la get visibilities for tests, experiments and examples() could be useful?

Sounds like a good solution. Can be a small simulation so it doesn't take much time when run repeatedly in different parts of the code base.

Lukas113 · 2024-08-26T14:45:36Z

Not sure... because to me it seems like the proposed solution is way easier to implement and doesn't require any dedicated file generation function, just a get_vis_path_from_* function which is looking at an env-var (or even a hard-coded path).

However, since you both don't agree I'm not really sure what to say other that I disagree. What matters to me in the end is to have minimal code duplication for maintainability & example clarity.

…ts 👇

mpluess

Looks good apart from the points addressed in my comments, thanks for your work.

karabo/examples/imaging.ipynb

karabo/examples/source_detection.ipynb

karabo/simulation/sample_simulation.py

mpluess

LGTM

mpluess · 2024-09-25T09:51:59Z

@Lukas113 merging is blocked because of your change request in august, do you want to have another look or alternatively withdraw the change request?

Disagree with PR, but was outvoted.

PHerzogFHNW added 2 commits August 12, 2024 14:03

Split part of source_detection.ipynb off into imaging.ipynb examples.…

c77ad5c

… Update docs accordingly 🈴

Fix line endings to unix format. 😆

c896c8c

Lukas113 previously requested changes Aug 13, 2024

View reviewed changes

Add test for new imaging notebook 🎎

72ef286

PHerzogFHNW added 9 commits September 18, 2024 17:30

Add function to simulate visibilities, reducing duplication ☎️

78bb4f5

Add Type Annotations to new function ✋

d94c9b3

Fix function input types ☎️

2bc9f66

Add failsave in case list of vis is generated 📗

603c7a7

Merge branch 'main' into update-sample-notebooks

439c0d3

Fix return variable type 🚐

0f65898

Merge branch 'main' into update-sample-notebooks

12a5f16

Update run_simulation to accept path in addition to str, fix type hin…

468838f

…ts 👇

Undo run_simulation to accept paths, cast to str instead 🚚

ce919a9

PHerzogFHNW requested a review from Lukas113 September 19, 2024 12:50

Lukas113 removed their request for review September 20, 2024 06:50

Make verbose parameter kw-only and remove mutable default ↕️

3cda10c

PHerzogFHNW requested a review from mpluess September 20, 2024 09:54

mpluess requested changes Sep 23, 2024

View reviewed changes

karabo/examples/imaging.ipynb Outdated Show resolved Hide resolved

karabo/examples/source_detection.ipynb Outdated Show resolved Hide resolved

mpluess reviewed Sep 23, 2024

View reviewed changes

karabo/simulation/sample_simulation.py Outdated Show resolved Hide resolved

PHerzogFHNW added 2 commits September 24, 2024 14:49

Add deduplication to source detection notebook, fix todo text 🐅

adbcd6a

Add deduplication to source detection notebook, fix todo text 🌾

0848370

mpluess approved these changes Sep 25, 2024

View reviewed changes

mpluess merged commit 08b76aa into main Sep 25, 2024
3 checks passed

mpluess deleted the update-sample-notebooks branch September 25, 2024 11:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Splitting part of source_detection.ipynb off into separate imaging.ipynb example notebook #610

Splitting part of source_detection.ipynb off into separate imaging.ipynb example notebook #610

PHerzogFHNW commented Aug 13, 2024 •

edited

Loading

codecov bot commented Aug 13, 2024 •

edited

Loading

Lukas113 left a comment •

edited

Loading

Lukas113 commented Aug 13, 2024

PHerzogFHNW commented Aug 13, 2024

Lukas113 commented Aug 13, 2024

mpluess commented Aug 26, 2024

sfiruch commented Aug 26, 2024

mpluess commented Aug 26, 2024

Lukas113 commented Aug 26, 2024 •

edited

Loading

mpluess left a comment

mpluess left a comment

mpluess commented Sep 25, 2024

Splitting part of source_detection.ipynb off into separate imaging.ipynb example notebook #610

Splitting part of source_detection.ipynb off into separate imaging.ipynb example notebook #610

Conversation

PHerzogFHNW commented Aug 13, 2024 • edited Loading

codecov bot commented Aug 13, 2024 • edited Loading

Codecov Report

Lukas113 left a comment • edited Loading

Choose a reason for hiding this comment

Lukas113 commented Aug 13, 2024

PHerzogFHNW commented Aug 13, 2024

Lukas113 commented Aug 13, 2024

mpluess commented Aug 26, 2024

sfiruch commented Aug 26, 2024

mpluess commented Aug 26, 2024

Lukas113 commented Aug 26, 2024 • edited Loading

mpluess left a comment

Choose a reason for hiding this comment

mpluess left a comment

Choose a reason for hiding this comment

mpluess commented Sep 25, 2024

PHerzogFHNW commented Aug 13, 2024 •

edited

Loading

codecov bot commented Aug 13, 2024 •

edited

Loading

Lukas113 left a comment •

edited

Loading

Lukas113 commented Aug 26, 2024 •

edited

Loading