Implement Hyperrun #838

dachengx · 2024-05-02T20:42:32Z

What is the problem / what does the code in this PR do

Hyperruns. Hyperruns are superruns, also processing of hyperruns depends on run_metadata defined similarly to https://straxen.readthedocs.io/en/latest/tutorials/SuperrunsExample.html#Define-a-superrun:.

If a hyperrun has run_id __000000, the plugin.run_id of the plugins used in processing will be 000000, and the subruns of the hyperrun will be mixed(concatenated) while processing.

A new attribute allow_hyperrun of strax.Plugin class is added. The allow_hyperrun of data_type depends on the data_type whose allow_hyperrun is True can only be True. So this means, your father is True, so you must be True.

Can you briefly describe how it works?

The key feature of hyperrun is that the chunks of subruns can be loaded together.

For superruns, the subruns are still made and loaded separately. For hyperrun, we can really treat the combination of subruns as a single run in the context of processing.

For example, in the added test., the data_type sum deepens on ranges. The "data" in ranges are just sequence of numbers in order with an offset, like [0, 1, 2, 3, 4, 5, 6, 7, 8, 9] and [10, 11, 12, 13, 14, 15, 16, 17, 18, 19]. The sum will sum the "data" in ranges by ranges["data"] + ranges["data"][::-1], so you will get [0] * 10 or [29] * 10.

Frist, note that we will have 3 runs. And the "data" in the ranges of 3 subruns are assigned to be [0, 1, 2, 3, 4, 5, 6, 7, 8, 9], [10, 11, 12, 13, 14, 15, 16, 17, 18, 19], and [20, 21, 22, 23, 24, 25, 26, 27, 28, 29]

If the loaded sum is from a superrun, the sum will be [9] * 10 + [29] * 10 + [49] * 10. Just because the subruns will still be calculated separately.
If the loaded sum is from a hyperrun, the sum will be [29] * 30. Because the subruns will still be first combined into "data" whose length is 30, then ranges["data"] + ranges["data"][::-1] run. So the subruns are really loaded as if they are the same run.

Can you give a minimal working example (or illustrate with a figure)?

The example is in the added test.

Please include the following if applicable:

Update the docstring(s)
Update the documentation
Tests to check the (new) code is working as desired.
Does it solve one of the open issues on github?

Please make sure that all automated tests have passed before asking for a review (you can save the PR as a draft otherwise).

coveralls · 2024-05-03T01:47:32Z

coverage: 90.719% (-0.5%) from 91.17%
when pulling a007517 on hyperrun
into 954bc39 on master.

strax/chunk.py

strax/context.py

yuema137 · 2024-05-07T23:12:06Z

Hi @dachengx this PR looks good to me, but I'm a little bit confused:

Why do we add hyperrun on top of superrun? It seems to me that their duties are kind of duplicated and the restrictions & checkings for hyperrun could be set for superrun as well.
It will be great if you could provide a scenario when we need this feature

dachengx · 2024-05-08T02:29:12Z

Hi @dachengx this PR looks good to me, but I'm a little bit confused:

Why do we add hyperrun on top of superrun? It seems to me that their duties are kind of duplicated and the restrictions & checkings for hyperrun could be set for superrun as well.
It will be great if you could provide a scenario when we need this feature

I will write a test module for this PR. Sorry, actually this PR is not fully ready. In the test, I will show how hyperruns and superruns are different.

dachengx · 2024-05-11T09:25:54Z

@yuema137 the test is added

yuema137

Hi @dachengx, I read the changes carefully, and now I think I understand this much better:

In get_components, the treatment for hyperrun is actually similar to normal runs, which means there is only a single loader. For superruns a loader is defined for each subrun.
And the difference is that for hyperruns more checks are done to guarantee that the targets to load satisfy the requirements for hyperruns.
Therefore the hyperrun is precisely equivalent to a single run in the view of strax/straxen. So when you use the exhaust plugins, the processor will regard several runs as a single one and give you the whole chunk. But for normal plugins, there should not be a difference.

The test works fine for me. Please merge it if my understanding is correct. Otherwise please let me know.

zihaoxu98 · 2024-05-20T00:28:32Z

Thanks for this hyper interesting PR. Although I cannot follow the code, based on my understanding in the test case, we always need ExhaustPlugin when we need to concatenate the chunks and do the computing right? I am asking because I found the document of ExhaustPlugin is lost and it would give the wrong result if I changed it to the normal Plugin.

yuema137 · 2024-05-21T22:45:40Z

@zihaoxu98 The purpose of Hyperrun is to let strax regard the runlist as a single run (for super run it's not the case as strax still treats them as separate runs). It doesn't make observable changes for normal plugins because the data is still chunked, and the computation is done chunk by chunk.
Only with ExhaustPlugin, which tries to combine whatever chunk it can find within a single run, is the computation performed for the combined big chunk as we expect. So basically, ExhaustPlugin merges all possible chunks in a single run, and Hyperrun breaks the boundary between runs.

Implement Hyperrun

1375186

dachengx marked this pull request as ready for review May 2, 2024 20:50

Check hyperrun in dependency tree

5a37cc5

dachengx marked this pull request as draft May 2, 2024 21:01

dachengx mentioned this pull request May 3, 2024

Implement Hyperruns XENONnT/axidence#65

Merged

Debug for _target_should_be_saved

c5ebcf6

dachengx marked this pull request as ready for review May 3, 2024 04:12

Minor debug

32f5aa9

yuema137 self-requested a review May 7, 2024 22:41

yuema137 reviewed May 7, 2024

View reviewed changes

strax/chunk.py Outdated Show resolved Hide resolved

yuema137 reviewed May 7, 2024

View reviewed changes

strax/context.py Show resolved Hide resolved

Remove unnecessary codes

4672055

dachengx added 2 commits May 11, 2024 04:14

Add test of hyperruns

1f1370b

Merge branch 'hyperrun' of github.com:AxFoundation/strax into hyperrun

a007517

yuema137 self-requested a review May 14, 2024 02:53

yuema137 approved these changes May 14, 2024

View reviewed changes

dachengx merged commit d66cd35 into master May 14, 2024
8 of 9 checks passed

dachengx deleted the hyperrun branch May 14, 2024 05:27

dachengx added a commit to XENONnT/straxen that referenced this pull request May 16, 2024

Fix a bug in test caused by AxFoundation/strax#838

b0bc81c

dachengx mentioned this pull request May 16, 2024

Fix a bug in test XENONnT/straxen#1381

Merged

4 tasks

dachengx added a commit to XENONnT/straxen that referenced this pull request May 16, 2024

Fix a bug in test caused by AxFoundation/strax#838 (#1381)

b7e3db1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Hyperrun #838

Implement Hyperrun #838

dachengx commented May 2, 2024 •

edited

Loading

coveralls commented May 3, 2024 •

edited

Loading

yuema137 commented May 7, 2024

dachengx commented May 8, 2024

dachengx commented May 11, 2024

yuema137 left a comment

zihaoxu98 commented May 20, 2024

yuema137 commented May 21, 2024

Implement Hyperrun #838

Implement Hyperrun #838

Conversation

dachengx commented May 2, 2024 • edited Loading

coveralls commented May 3, 2024 • edited Loading

yuema137 commented May 7, 2024

dachengx commented May 8, 2024

dachengx commented May 11, 2024

yuema137 left a comment

Choose a reason for hiding this comment

zihaoxu98 commented May 20, 2024

yuema137 commented May 21, 2024

dachengx commented May 2, 2024 •

edited

Loading

coveralls commented May 3, 2024 •

edited

Loading