-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MGI xrefs failing GO checks #408
Comments
@pgaudet I've temporarily put this in the "low-hanging fruit" project in the spec and prioritize section. |
From db.xrefs file:
The code validates against the id_syntax field. A new entry has to be added with |
@mugitty Hm. I think that entry is actually correct. If you look at something like SGD or WB, you can see that they match in intention, even though MGI is a special case. (The field names |
@kltm, the code is using id_syntax, which is specified in https://github.com/geneontology/go-site/blob/master/metadata/db-xrefs.schema.yaml. If internal_id_syntax is to be used then it has to be added to db-xrefs.schema.yaml. Currently, id_syntax does not match example_id. The code handles multiple id_syntax entries. It does not use example_id |
@mugitty Yes, that is correct:
If the metadata is correct, we need to look at affecting the change we want--with the metadata we have--in the code. The MGI:MGI doubling has always caused problems... |
@pgaudet, the internal representation of MGI was updated due to geneontology/go-site#91 I will update to handle what is in db-xrefs.yaml as well as the internal representation |
From @ValWood at pombase/pombase-chado#1224
@kltm 's response:
Looking at https://github.com/geneontology/go-site/blob/master/metadata/rules/gorule-0000027.md . Okay, "soft" warning, so no data filtering.
The moment of failure is likely here:
https://github.com/biolink/ontobio/blob/master/ontobio/io/assocparser.py#L835
Special casing for MGI leading into it is:
https://github.com/biolink/ontobio/blob/master/ontobio/io/assocparser.py#L802-L806
So, it looks like
MGI:MGI:1919005
would be clipped toMGI
and1919005
, the latter of which would fail when checking against the regexp. The options here would be:MGI:MGI:MGI:1919005
(I know what the knock-on effect would be: hilarity)Either way, @pgaudet , this is probably best approached as a GO QC bug for the moment (although a "light" one as no fix or filtering is done) and added to the QC worklist.
The text was updated successfully, but these errors were encountered: