-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(ingestion/tableau): Fix tableau custom sql lineage gap #10359
fix(ingestion/tableau): Fix tableau custom sql lineage gap #10359
Conversation
assert mcp.entityUrn == expected_entity_urn | ||
|
||
csql_urn = "urn:li:dataset:(urn:li:dataPlatform:tableau,09988088-05ad-173c-a2f1-f33ba3a13d1a,PROD)" | ||
expected_upstream_table = "urn:li:dataset:(urn:li:dataPlatform:bigquery,my_bigquery_project.invent_dw.UserDetail,PROD)" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we need to fix the urn lowercasing behavior here - it's not really clear to me why that changed to begin with
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We are considering bigquery as platform with case sensitive tables, hence urn contain table without lowercase.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok that's fine
@@ -1693,7 +1693,7 @@ def parse_custom_sql( | |||
) -> Optional["SqlParsingResult"]: | |||
database_info = datasource.get(c.DATABASE) or { | |||
c.NAME: c.UNKNOWN.lower(), | |||
c.CONNECTION_TYPE: "databricks", | |||
c.CONNECTION_TYPE: datasource.get(c.CONNECTION_TYPE) or "databricks", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think the default of "databricks" makes sense - that should be removed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just wondering if removal of default as "databricks" doesn't affect any other thing. Might have some reason behind adding default as "databricks" in PR #9838
Let me know if it still needs to remove.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes let's remove it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one last improvement, but otherwise looks good
@@ -1693,7 +1693,7 @@ def parse_custom_sql( | |||
) -> Optional["SqlParsingResult"]: | |||
database_info = datasource.get(c.DATABASE) or { | |||
c.NAME: c.UNKNOWN.lower(), | |||
c.CONNECTION_TYPE: "databricks", | |||
c.CONNECTION_TYPE: datasource.get(c.CONNECTION_TYPE) or "databricks", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes let's remove it
assert mcp.entityUrn == expected_entity_urn | ||
|
||
csql_urn = "urn:li:dataset:(urn:li:dataPlatform:tableau,09988088-05ad-173c-a2f1-f33ba3a13d1a,PROD)" | ||
expected_upstream_table = "urn:li:dataset:(urn:li:dataPlatform:bigquery,my_bigquery_project.invent_dw.UserDetail,PROD)" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok that's fine
Checklist