-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge release/2.6 into google/2.6 #15759
Conversation
mjmac
commented
Jan 22, 2025
- DAOS-16477 pool: Rename Suspect state to Dead (DAOS-16477 pool: Rename Suspect state to Dead #15716)
- DAOS-16469 dtx: properly handle DTX partial commit - b26 (DAOS-16469 dtx: properly handle DTX partial commit - b26 #15399)
- DAOS-16868 bio: Skip LED reset on absent bdev (DAOS-16868 bio: Skip LED reset on absent bdev #15718) (DAOS-16868 bio: Skip LED reset on absent bdev (#15718) #15733)
- DAOS-16897 object: add N+3 EC object class (DAOS-16897 object: add N+3 EC object class #15649) (DAOS-16897 object: add N+3 EC object class (#15649) #15730)
- DAOS-16374 vos: integer overflow on evt recx trace (DAOS-16374 vos: integer overflow on evt recx trace #15439) (DAOS-16374 vos: integer overflow on evt recx trace (#15439) #15533)
Change the name to more closely reflect the underlying SWIM status, and reduce user confusion. An engine that has been marked DEAD by SWIM cannot participate in pool services, and has most likely already SIGKILL-ed itself. Update "engines become unavailable" in admin/troubleshooting.md to link to the new "suspect ranks" description. Signed-off-by: Li Wei <[email protected]> Signed-off-by: Michael MacDonald <[email protected]> Signed-off-by: Wang Shilong <[email protected]> Co-authored-by: Michael MacDonald <[email protected]> Co-authored-by: Li Wei <[email protected]>
When a DTX leader globally commit the DTX, it is possible that some DTX participant(s) cannot commit such DTX entry because of kinds of issues, such as network or space trouble. Under such case, the DTX leader needs to keep the active DTX entry persistently for further commit/resync. But it does not means related modification attched to such DTX entry on the leader target cannot be committed, instead, we can commit related modification with only keeping the DTX header. That is enough for the DTX leader to do further DTX commit/resync to handle related former failed DTX participant(s). The benefit is that VOS aggregation on the leader target will not be affected by remote DTX commit failure. Signed-off-by: Fan Yong <[email protected]>
Signed-off-by: Tom Nabarro <[email protected]>
Currently, DAOS supports EC (Erasure Coding) object classes with redundancy levels of N+1 and N+2. In certain scenarios, users may wish to use N+3 for enhanced redundancy and safety. Generally, DAOS’s EC and rebuild mechanisms are designed to handle various parity levels effectively. With the introduction of new object classes, comprehensive testing should be conducted to ensure that these changes do not introduce any unexpected issues or disrupt existing functionalities. Extend test cases to cover EC_4P3X object classes as a min test coverage. Signed-off-by: Wang Shilong <[email protected]>
The evt recx trace is used for vos aggregation debugging, and it's currently reset on akey iteration callback, but the akey iteration callback could be skipped in some cases, for example, when evt aggregation hit an aborted recx, it'll start over in evtree level without the recx trace reset, that could lead to integer overflow on the 'int ap_trace_count'. This patch moved the ap_trace_count reset to merge window open/close to ensure the evt recx trace always being reset properly. Signed-off-by: Niu Yawei <[email protected]> Co-authored-by: Niu Yawei <[email protected]>
Errors are component not formatted correctly,Ticket number prefix incorrect,PR title is malformatted. See https://daosio.atlassian.net/wiki/spaces/DC/pages/11133911069/Commit+Comments,Unable to load ticket data |
Skip-nlt: true Features: pool Change-Id: I9e7d49d5c31dc048546de2010a88fcca5ff4a9d2 Signed-off-by: Michael MacDonald <[email protected]>
477cb4f
to
7faea33
Compare
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15759/2/execution/node/1120/log |