-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pyarrow engine not supporting schema overwrite with Append mode #2654
Comments
👋 The Additionally I wanted to mention that:
this options introduces table corruption risk if there are ever two processes which try to concurrently modify the same Delta table. |
The reason for not using engine='rust' is we want to have this delta table mapped to Trino (for Grafana integration). But the mapping between delta table with rust and Trino was not compatible and throwing the below error. So we had to move to Pyarrow with which we could map to Trino. |
Also, you mentioned about storage_options introducing table corruption risk if there are ever two processes which try to concurrently modify the same Delta table so we initially had optimize.compact() on the delta table and this was causing the partition URL encoding to have spaces randomly (opened a github issue for this: #2634), could the 'storage_options' be associated to this random URL encoding with spaces? |
@rtyler Any update on the above comment for using the pyarrow engine for writing data. |
We are going to deprecate the pyarrow engine eventually, so schema evolution won't be supported there. If you are encountering issues with Trino when reading tables created by rust engine, then please create a separate issue with an MRE |
Environment
Delta-rs version: 0.17.4
Bug
We have a pandas df with 66 columns that is being written to a delta table with a pre-defined schema in pyarrow engine and now we have a new df with 67 columns, and the schema_mode = overwrite is not supported with 'append' mode.
Below is an basic example df of updating table schema with pyarrow engine and append mode: and the error we see while updating table schema
It would be great if we can have overwrite the existing schema of the delta table with append mode that does not effect the existing data. Can you please let me know if we can update schema while appending data?
The text was updated successfully, but these errors were encountered: