Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SchemaError occurs during table optimisation after upgrade to v0.18.1 #2731

Closed
r1fad opened this issue Aug 5, 2024 · 4 comments
Closed

SchemaError occurs during table optimisation after upgrade to v0.18.1 #2731

r1fad opened this issue Aug 5, 2024 · 4 comments
Assignees
Labels
binding/rust Issues for the Rust crate bug Something isn't working

Comments

@r1fad
Copy link

r1fad commented Aug 5, 2024

Environment

Delta-rs version: v0.18.1

Binding: rust


Bug

What happened: In my project, there is a service that runs on a scheduled basis to compact and optimise delta tables. After upgrading delta-rs to v0.18.1 (from v0.17.1) we see an error saying Delta(Arrow { source: SchemaError("Could not find column keys") }). There is no column called 'keys' in our datasets so I am do not understand why we get this error. The error comes from deltalake-core-0.18.1/src/operations/cast.rs:150.

What you expected to happen:
No schema errors should be reported when upgrading to a new version of delta-rs.

How to reproduce it:
Not entirely sure how to reproduce it since we do not have a column called 'keys' in our datasets.

More details:
This error does not occur in the version we run in production which is v0.17.1 (commit hash 25962a0). The look back window on the compactor service is 2 days. So after 2 days of running the new version, we no longer see the SchemaError which leads me to believe that the v0.18.1 cannot optimise tables that contain data that was inserted by v0.17.1.

@r1fad r1fad added the bug Something isn't working label Aug 5, 2024
@r1fad r1fad changed the title SchemaError occurs during table optimization SchemaError occurs during table optimisation after upgrade to v0.18.1 Aug 5, 2024
@r1fad
Copy link
Author

r1fad commented Aug 6, 2024

@ion-elgreco how can I incorporate your fix into my project?

@ion-elgreco
Copy link
Collaborator

@ion-elgreco how can I incorporate your fix into my project?

You will have to wait for the next release when we bump kernel

@r1fad
Copy link
Author

r1fad commented Aug 6, 2024

@ion-elgreco any idea on when the next release of delta-rs is coming? I imagine this fix is important for quite a lot of people. And the change introduced earlier with the kernel is a breaking change.

I also noticed that after downgrading my project to v0.17.1, it can no longer optimise tables that contain data inserted by v0.18.1. This issue should disappear after 2 days though because lookback window is 2 days

@rtyler rtyler added the binding/rust Issues for the Rust crate label Aug 10, 2024
@ion-elgreco
Copy link
Collaborator

@r1fad somewhere today or this weekend it will be released

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
binding/rust Issues for the Rust crate bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants