Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reference Genome #140

Open
nsmt89 opened this issue Jun 18, 2019 · 3 comments
Open

Reference Genome #140

nsmt89 opened this issue Jun 18, 2019 · 3 comments

Comments

@nsmt89
Copy link

nsmt89 commented Jun 18, 2019

How to run chloroExtractor while providing reference genome for assembly? I tried run it with Illumina paired end data it gave fasta file with more than one contigs. The length for those contigs is quite short from supposedly chloroplast length (about 20k).

@greatfireball
Copy link
Member

greatfireball commented Jun 20, 2019

Dear nsmt89,

Unfortunately, chloroExtractor does not support a reference based assembly, due to its underlying assembler spades. In general it would be possible to use another/additional references during the filter step, but the final assembly is limited by spades. Nevertheless, using an individual reference while filtering would avoid to throw away important read pairs . In case you want to try that we will provide you with an how-to . Just let us know.

@nsmt89
Copy link
Author

nsmt89 commented Jan 20, 2020

Hi,
Sorry for taking some time to answer you offer. Yes I would like to use individual reference while filtering. Can you guide me how can I do that?

Thank you

@iimog
Copy link
Member

iimog commented Jan 27, 2020

Hi @nsmt89
the way to go is creating your own config file e.g. with --create-config and then editing that file, in particular the --ref-cluster entry in the scale_reads.pl call. There you can provide your own reference, it is important to note that this is only used for read scaling not for a reference guided assembly or scaffolding as @greatfireball already noted. It might still increase your result. Another point where you can inject your own reference is the --blastdb option of find_cyclic_graph.pl but this is only for filtering of the assembled contigs. Remember to pass the adjusted config file to chloroExtractor via the --config option.

Two more notes mostly directed at @greatfireball:

  1. It might be possible to provide a reference to spades as --[un]trusted-contigs
  2. We can consider accepting a reference for scaffolding e.g. with RaGOO

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants