Decision tree step incorrectly assumes a sorted component table #923

handwerkerd · 2023-01-05T21:52:09Z

Summary

There is a step in the decision tree where the first X remaining components should be used for something where the components are sorted by variance. MEICA sorted components by variance but tedana does not. That means tedana is incorrectly using a semi-random subset of components instead of the highest variance components.

Additional Detail

These are the key lines

tedana/tedana/selection/tedica.py

Lines 388 to 392 in f00cb25

    
           # Find components to ignore 
        
           # Ignore high variance explained, poor decision tree scored components 
        
           new_varex_lower = stats.scoreatpercentile( 
        
               comptable.loc[unclf[:num_acc_guess], "variance explained"], LOW_PERC 
        
           )

This will be fixed in #756 but we might want to also fix in the current code so that pre and post decision-tree- modularization results will perfectly match.

This is a bug, but it will not affect the denoised time series. The new_varex_lower threshold is used to decide if components are accepted or ignored With the current version, it's possible for more components than intended to end up ignored but they'll still be retained in the denoised time series.

Next Steps

Confirm this is a real bug
I'm fairly sure MEICA sorted components with the highest variance being component 0 so the highest variance components should be retained (the comment in the code might be wrong). Look back at some MEICA outputs to confirm this correct
Decide if we want to fix this in the current Main so that it will match the modularized output.

The text was updated successfully, but these errors were encountered:

tsalo · 2023-01-05T22:02:42Z

This sounded familiar, so I searched around and found #295. Do you know if that PR fixed the issue?

handwerkerd · 2023-01-05T22:24:31Z

I did also remember having this issue before. Thank you for connecting these two. You solved the same issue in a different part of the code.

handwerkerd added bug issues describing a bug or error found in the project priority: medium Should get addressed soon effort: low Theoretically less than a day's work impact: medium Improves code/documentation functionality for some users labels Jan 5, 2023

handwerkerd mentioned this issue Jan 8, 2023

Sorting varex for decision tree criterion I011 #924

Merged

jbteves closed this as completed in #924 Feb 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decision tree step incorrectly assumes a sorted component table #923

Decision tree step incorrectly assumes a sorted component table #923

handwerkerd commented Jan 5, 2023

tsalo commented Jan 5, 2023

handwerkerd commented Jan 5, 2023

Decision tree step incorrectly assumes a sorted component table #923

Decision tree step incorrectly assumes a sorted component table #923

Comments

handwerkerd commented Jan 5, 2023

Summary

Additional Detail

Next Steps

tsalo commented Jan 5, 2023

handwerkerd commented Jan 5, 2023