Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bs kvg improve model #41

Merged
merged 15 commits into from
Dec 2, 2024
Merged

Bs kvg improve model #41

merged 15 commits into from
Dec 2, 2024

Conversation

bshifaw
Copy link
Collaborator

@bshifaw bshifaw commented Nov 7, 2024

  • Adding option to search all short reads, not only the reads that fall within regions of the reference the long reads have aligned to.
  • Breaking detect_relavent_loci function into subfunctions
  • Fix: Handling short read sample records that might have a missing chrom info

kvg and others added 8 commits October 21, 2024 12:02
…the tree each time.

The final merging pass iterates over loci and tries to merge overlapping intervals within each contig. However, the code inserts current_interval directly into new_tree only if there are no overlaps. If there are overlaps, it calculates min_start and max_end to create a merged_interval. If two or more intervals overlap, the code will keep overwriting new_tree in each iteration, potentially losing previously calculated merged intervals from earlier iterations.
found_items
.load(Ordering::Relaxed)
.to_formatted_string(&Locale::en),
tid_to_chrom.get(&read.tid()).unwrap(),
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code was panicking when working a particular sample that may have records missing chromosome info so a default unknown name was added for these situation.

Searching for similar reads (2,479 found, most recent at chrUn_JTFH01001822v1_decoy:470) ... [00:33:20] 737,000,000                   
thread '<unnamed>' panicked at src/hidive/src/rescue.rs:107:55:
called `Option::unwrap()` on a `None` value
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I saw that too. I just put in an "unwrap_or" and supply a contig name of "unknown" if we don't have anything in the tid_to_chrom map.

@bshifaw bshifaw changed the base branch from kvg_improve_model to main December 2, 2024 18:37
bshifaw added 2 commits December 2, 2024 15:36
# Conflicts:
#	src/hidive/src/main.rs
#	src/hidive/src/rescue.rs
#	src/skydive/src/stage.rs
#	train.sh
@kvg kvg merged commit fa10056 into main Dec 2, 2024
1 check passed
@kvg kvg deleted the bs_kvg_improve_model branch December 2, 2024 23:40
kvg added a commit that referenced this pull request Dec 3, 2024
* Auto-detect where to search in a genome to rescue reads

* Accumulate the merged intervals continuously rather than overwriting the tree each time.
The final merging pass iterates over loci and tries to merge overlapping intervals within each contig. However, the code inserts current_interval directly into new_tree only if there are no overlaps. If there are overlaps, it calculates min_start and max_end to create a merged_interval. If two or more intervals overlap, the code will keep overwriting new_tree in each iteration, potentially losing previously calculated merged intervals from earlier iterations.

* added fetches option to rescue.rs, option search all, contig, contig-and-interval, unmapped

---------

Co-authored-by: Kiran Garimella <[email protected]>
Co-authored-by: bshifaw <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants