-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[2Q2024] Update CXG conversion script [previously - Change cxg
conversion script to include uns
data]
#6659
Comments
Also Jason from the Lattice Team said that they expect to get 5had updated to conform with schema 4.2, but we'll still need to convert them to cxg (manually?) |
Thanks for the advanced ping, @tihuan. By 'collaborate', did you just mean some consultation and maybe PR review? The spatial work for Data Platform isn't scheduled to start until Mar 4, earliest. Also note:
|
Thanks so much for all the context, @metakuni ! I think mostly I just want to make sure that we don't do double work 😆 @kaloster has done some investigation on how to update the So maybe we should just talk about how we should update the conversion script when Data Platform is ready to work on it! |
@tihuan - Just FYI - Applications is responsible for the CXG validation and conversion per a decision that Arathi made prior to 4.0. |
Hey @brianraymor ! Ooh thanks for the headsup 💡 I see, so I should see who on the Applications team know more about validation and conversion, and go from there 😄 In that case, @metakuni sorry for the false ping! 🙏 |
No worries, @tihuan ! As for whom on the Apps team... I believe that person was @joyceyan. 😏 For the schema 4.0 migration, she worked on the Apps portion. |
One more thing worth noting: We'll have to coordinate when to merge and deploy your CXG conversion changes since that's part of the 5.1 schema, and we need to release all of those changes simultaneously. Hopefully, that won't complicate your development or testing schedules. |
Awesomee thanks for saving me time figuring out who I should reach out 😆 Okay! Yeah coordination sounds good! I heard that Jason from the Lattice Team said that all the h5ad files should be updated to conform with 5.1 by Match 29, but I suppose we'd still need to deploy the new 5.1 schema conversion script to convert h5ad to cxg on Data Portal, and trigger the data processing pipeline to make sure everything is up to date? |
Yes - Jason was updating me on their readiness for re-curation for platform planning purposes. The existing visium datasets will fail validation if they're not updated.
Absolutely. When we migrate the data corpus to a new schema version, we need to re-create all the artifacts like seurat downloads and CXG(s) because the underlying data has changed (new/modified/deprecated metadata fields, replaced/obsolete ontology terms, new dataset citations, etc.) |
@tihuan I'm intimately familiar with the CXG converter and also orchestrated a CXG remastering effort for the explorer performance improvements. Feel free to ping me as a consultant for this work if needed. |
Oh you're a savior, @atarashansky 🤩🙌 Thanks so much! We'll definitely need your help with this lol Can you help us with a quick rundown on the steps? So far the pieces I know are:
And the goal is to update the data portal datasets with the latest schema, so we can enable Explorer spatial mode I'm assuming the steps would be something like:
Something like that? I don't know if 5.1 schema is backward compatible, if not, it feels like more coordination will be needed lol CC: @kaloster @seve something we need to keep an eye on for rolling out the changes! |
Assigning this to me for now to drive the discussion with the schema team on potentially having them include uns data in the conversion script. Will re-assign back to Ronen, depending on the discusison outcome |
RE
The migration of of the corpus from 5.0 -> 5.1 is automated. Curators do not download, revise, and upload datasets. There is no manual trigger for CXG and Seurat conversions. We can review the process on the DP-Apps call. If you're curious, please see: https://docs.google.com/document/d/1QNWicEb_C7W5B-L6Lb9jbnX897nUnoamEdaebznB_jw/edit#heading=h.jcf27gbiblyg |
Reached out to Dan about the collab possibility |
Chatting with BrianR on Slack for help now! |
Sync'd with BrianR on this! He confirmed that the APS team is indeed responsible for updating and maintaining any APS needed fields in the schema. And once Dan is done with Schema 5.0 migration, he'll schedule time for a DPS/APS sync, so we can coordinate on Schema 5.1 rollout |
Putting this back to Disorder until DPT is ready for collab! |
cxg
conversion script to include uns
datacxg
conversion script to include uns
data
cxg
conversion script to include uns
datacxg
conversion script to include uns
data]
I've updated the PR title and description given the meeting yesterday. Thanks so much again for the thorough presentation and info, @brianraymor !! |
=== OLD TICKET DESCRIPTION === DoD:
|
CC: @kaloster please help review the ticket and see if anything else is needed. Thanks so much! |
Is there any additional processing/filtering required for these new
|
Oh great question! I might need @kaloster to help answer this question 😆 |
Hey team! Please add your planning poker estimate with Zenhub @kaloster @seve |
@seve gentle ping thank you! |
@kaloster just missing your vote now 😊 Thanks! |
@kaloster should we close this ticket in favor of #839 https://app.zenhub.com/workspaces/single-cell-5e2a191dad828d52cc78b028/issues/gh/chanzuckerberg/single-cell-explorer/839 ? Thank you! |
Context:
DoD:
The text was updated successfully, but these errors were encountered: