You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I was wondering if it's possible to save the index table with the k-mers generated from input sequences to disk and later retrieve it, in order to speed up clustering. My idea is to do this for large datasets, using cdhit-2d: one input dataset would be provided by the user (i.e. the index table would always be computed on the fly) and the other would come from a prepared selection of datasets. For the latter, I would like to precompute index tables to speed up the overall comparison. I don't know how much of the total runtime is spent creating the index tables but I would imagine it to be considerable for large datasets. Please correct me if I'm wrong.
Please advise if this is possible at all or can be somehow done by tweaking the code.
Thank you
The text was updated successfully, but these errors were encountered:
Hello,
I was wondering if it's possible to save the index table with the k-mers generated from input sequences to disk and later retrieve it, in order to speed up clustering. My idea is to do this for large datasets, using cdhit-2d: one input dataset would be provided by the user (i.e. the index table would always be computed on the fly) and the other would come from a prepared selection of datasets. For the latter, I would like to precompute index tables to speed up the overall comparison. I don't know how much of the total runtime is spent creating the index tables but I would imagine it to be considerable for large datasets. Please correct me if I'm wrong.
Please advise if this is possible at all or can be somehow done by tweaking the code.
Thank you
The text was updated successfully, but these errors were encountered: