-
Notifications
You must be signed in to change notification settings - Fork 374
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Refactor structure awareness in packm_blk_var1.c. (#707)
Details: - Factored some of the structure awareness out of the loop in bli_packm_blk_var1(). So instead of having a single loop with conditionals in the body to handle various kinds of structure (and stored/unstored submatrix placement), we now have a conditional branch to handle various structure/storage scenarios with a loop in each section. This change was originally motivated to choose slab or round- robin partitioning (in the context of triangular matrices) based on the structure of the entire block (or panel) being packed rather than each micropanel individually. Previously, the code would attempt to limit rr to the portion of the block that intersects the diagonal and use slab for the remainder. However, that approach was not well-thought out and in many situations this would lead to inferior load balancing when compared to using round-robin for the entire block (or panel). This commit has the added benefit of incurring less overhead during the packing process now that each of the new loops is simpler.
- Loading branch information
1 parent
f956b79
commit b6735ca
Showing
1 changed file
with
87 additions
and
87 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters