-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rust writer not encoding correct URL for partitions in delta table #2634
Comments
Confirmed that this also seems to happen in 0.18.0. This breaks compatibility for our Trino connector to enable visualization on dashboards. Ideally, Trino wouldn't care about spaces (it has a lot of other issues like not being compatible with timezone-aware timestamps). |
@ion-elgreco the issue we see here is with the Pyarrow engine |
@ion-elgreco on-elgreco Any update on this issue? |
@gprashmi sorry I don't have time to look into it unfortunately |
@ion-elgreco do you have any other suggestions on how this can be handled to have the correct URL encoding with optimizing the table? |
I'll take a look at this |
I'm late to responding, but thank you so much for addressing this @ion-elgreco ! |
Thank you very much @ion-elgreco for addressing the issue. |
Environment
Delta-rs version: 0.17.4
Bug
We write data to delta table using delta-rs with PyArrow engine with DayHour as partition column. However when we run the optimize.compact() on the table, it creates partitions with spaces and does not properly encode the partition urls as shown in the below image i.e; it creates new partitions url with spaces (.zstd.parquet).
Can you please let me know how we can run the optimize.compact without having partitions with spaces?
The text was updated successfully, but these errors were encountered: