Unify functionality of super and hyperrun #871

dachengx · 2024-08-19T03:31:08Z

What is the problem / what does the code in this PR do

Caveat: sometimes the saved superurn with _combining_subruns = True and _combining_subruns = False are different when the combined subruns under _combining_subruns = False has "long range force", like the example in test_only_combining_superruns. However, this is not usually the case in real data processing. But users need to be clear about what they are doing.

Sorry for the back-and-forth, after this PR:

When run_id starts with _ and _combining_subruns is True when get_iter, the context will make the targets and combine them according to the run metadata.
When run_id starts with _ and _combining_subruns is False when get_iter, the context will make the subruns' targets untill the dependency tree starts to support superrun (allow_superrun is True). The first plugin whose allow_superrun is True will merge the depends_on of subruns according to the run metadata, and then the following processing will all be done in the scope of superrun.

To achieve these, the main change is that we handle superrun, especially the subruns in Plugin, not Rechunker anymore.

Also, completely deprecate storage_converter.

Can you briefly describe how it works?

Change the logic in get_components.

Can you give a minimal working example (or illustrate with a figure)?

The most important test might be test_only_combining_superruns and test_loaders_and_savers in TestSuperRuns. Please look at them.

components = self.context.get_components(self.superrun_name, "peak_classification")
# Because records is not allow_superrun
assert "records" in components.loaders
# Because though we call for peak_classification,
# peaks already allow_superrun
assert "peaks" not in components.loaders
# peaks and lone_hits should all be saved
assert "peaks" in components.savers
assert "lone_hits" in components.savers
# of course peak_classification should be saved
assert "peak_classification" in components.savers

# When we make superrun, subruns of the targeted data_type should
# be first made individually and combined.
components = self.context.get_components(
    self.superrun_name, "peak_classification", _combining_subruns=True
)
assert len(components.loaders) == 1
assert "peak_classification" in components.loaders

When _combining_subruns is True, context will process each subrun to the targeted data_type, and then combine subruns.

Please include the following if applicable:

Update the docstring(s)
Update the documentation
Tests to check the (new) code is working as desired.
Does it solve one of the open issues on github?

Please make sure that all automated tests have passed before asking for a review (you can save the PR as a draft otherwise).

coveralls · 2024-08-22T01:37:53Z

coverage: 90.283% (+0.5%) from 89.802%
when pulling 35662fb on unify_super_hyperrun
into 95f8ca2 on master.

…s `True`

dachengx added 11 commits August 18, 2024 22:29

Unify functionality of super and hyperrun

f34774b

Check subruns and superrun information

00cd4c0

Separate workflow of combining subruns and process superrun

b0bf27c

Little debug

2faccad

Merge superrun updating function into _fix_output

dfbaf18

Simplify code and add back a check

42f99ff

Completely remove hyperruns and add more tests about superrun processing

d25af02

Use _combining_subruns to indicate only combining subruns

993964a

Minor change

d175bdf

Minor change

bd14cb6

First unify time-offset

f575cf3

dachengx marked this pull request as ready for review August 22, 2024 02:19

dachengx added 5 commits August 21, 2024 21:22

Assert data_type not stored

042ebfe

No more hyperrun

1f61b41

Debug

c3fe256

Raise error when call for multiple target when _combining_subruns i…

654e740

…s `True`

More tests about subruns information

35662fb

dachengx merged commit ac232cc into master Aug 22, 2024
8 checks passed

dachengx deleted the unify_super_hyperrun branch August 22, 2024 07:32

This was referenced Aug 25, 2024

Be compatible with new Plugin.run_id XENONnT/straxen#1410

Merged

Add combining into the DataKey #886

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify functionality of super and hyperrun #871

Unify functionality of super and hyperrun #871

dachengx commented Aug 19, 2024 •

edited

Loading

coveralls commented Aug 22, 2024 •

edited

Loading

Unify functionality of super and hyperrun #871

Unify functionality of super and hyperrun #871

Conversation

dachengx commented Aug 19, 2024 • edited Loading

coveralls commented Aug 22, 2024 • edited Loading

dachengx commented Aug 19, 2024 •

edited

Loading

coveralls commented Aug 22, 2024 •

edited

Loading