Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: sequence run manager support for library linking #760

Open
reisingerf opened this issue Dec 9, 2024 · 3 comments
Open

feat: sequence run manager support for library linking #760

reisingerf opened this issue Dec 9, 2024 · 3 comments
Labels
epic pipeline Workflow/Pipeline Manager

Comments

@reisingerf
Copy link
Member

Ideally we'd be able to link sequence runs to libraries and vice versa.

This linking information would likely have to come from the SampleSheet of each sequence run. There is also other useful / important information in the SampleSheet that makes it worth retrieving and storing in the OrcaBus back-end.

(at a later stage we may even add information from RunInfo.xml and RunParameters.xml but that's another issue)

The ICAv2 BSSH sequencing event should carry a URL to retrieve details of the run and can hopefully be used to identify and retrieve the SampleSheet. The SS info can then be stored by the Sequence Run Manager and a link to the libraries can be established.

@alexiswl
Copy link
Member

alexiswl commented Dec 9, 2024

The bssh fastq copy does exactly this already.
We could add in the same logic into the BCLConvert Manager.

@reisingerf
Copy link
Member Author

Yes, I am aware... I was hoping we could steal some of the Stacky bits ;-)
Not sure I'd put all that into the BCL Convert Manager though...

The reasons for adding it to the Sequence Run Manager are:

  • the SS and linked libraries are initially a property related to the sequencing run
  • the SRM comes earlier in the chain (and data would be available earlier)
  • the SRM would store this even if no conversion would ever happen
  • we have already a SRM back-end with persistent storage and API
  • the BCL Convert Manager is more of an execution service, which usually don't have a persistent back-end (at least not an exposed one anyway)

An argument can be made that the SS is mainly used for conversion and could change even though the run does not.
However, the SS is still tightly bound to the run info and I keep the option of multiple SS per run on the SRM. The BCL manager could then reference the SS in the data payload of its WRSC...

Hope that makes sense.
Open for discussion though!

@alexiswl
Copy link
Member

alexiswl commented Dec 9, 2024

Yes makes sense, happy for this to put all the way into the Sequence Run Manager

@victorskl victorskl added epic pipeline Workflow/Pipeline Manager labels Dec 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic pipeline Workflow/Pipeline Manager
Projects
None yet
Development

No branches or pull requests

3 participants