Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: Unclear error messages from Py4j tables crawler #3248

Open
1 task done
nfx opened this issue Nov 12, 2024 · 0 comments
Open
1 task done

[BUG]: Unclear error messages from Py4j tables crawler #3248

nfx opened this issue Nov 12, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@nfx
Copy link
Collaborator

nfx commented Nov 12, 2024

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

workflow is failing between these lines
https://github.com/databrickslabs/ucx/blob/main/src/databricks/labs/ucx/hive_metastore/tables.py#L526-L545
something related to shared state of spark session.

InternalError: failed-table-crawl: listing databases -> catalog : An error occurred while calling o400.sharedState. Trace: py4j.security.Py4JSecurityException: Method public org.apache.spark.sql.internal.SharedState 
org.apache.spark.sql.SparkSession.sharedState() is not whitelisted on class class org.apache.spark.sql.SparkSession at py4j.security.WhitelistingPy4JSecurityManager.checkCall(WhitelistingPy4JSecurityManager.java:473) at py4j.Gateway.invoke(Gateway.java:305) at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132) at py4j.commands.CallCommand.execute(CallCommand.java:79) at py4j.ClientServerConnection.waitForCommands(ClientServerConnection.java:199) at py4j.ClientServerConnection.run(ClientServerConnection.java:119) at java.lang.Thread.run(Thread.java:750)

Expected Behavior

clear error message that assessment workflow has to be run before fetching the list of tables. technically, we could also run the assessment job from within TablesCrawler.

Steps To Reproduce

No response

Cloud

AWS

Operating System

macOS

Version

latest via Databricks CLI

Relevant log output

No response

@nfx nfx added the bug Something isn't working label Nov 12, 2024
@nfx nfx added this to UCX (weekly) Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: No status
Development

No branches or pull requests

1 participant