Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: ensure metadata cleanup do not corrupt tables without checkpoints #2044

Merged
merged 5 commits into from
Jan 7, 2024

Conversation

Blajda
Copy link
Collaborator

@Blajda Blajda commented Jan 6, 2024

Description

When metadata cleanup is executed on a delta table without checkpoints it will corrupt the table and prevent further loading. This is a high risk for people who use delta-rs since our writers do not automatically create checkpoints.

Related Issue(s)

@github-actions github-actions bot added binding/rust Issues for the Rust crate crate/core labels Jan 6, 2024
@Blajda
Copy link
Collaborator Author

Blajda commented Jan 6, 2024

If there is a desire to refactor this operation into DeltaOps then I'd be happy to accommodate that request too.

@ion-elgreco
Copy link
Collaborator

@Blajda maybe let's get the fix in first depending on how much time it takes to make it a DeltaOps, this seems quite severe, so we ideally should push a release soon

@Blajda
Copy link
Collaborator Author

Blajda commented Jan 7, 2024

@Blajda maybe let's get the fix in first depending on how much time it takes to make it a DeltaOps, this seems quite severe, so we ideally should push a release soon

Yes I think that's the best approach for this.

@github-actions github-actions bot added the binding/python Issues for the Python package label Jan 7, 2024
@Blajda Blajda marked this pull request as ready for review January 7, 2024 00:47
Copy link
Collaborator

@ion-elgreco ion-elgreco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Blajda could you add one more python test to cover the case where no checkpoint is added?

@ion-elgreco ion-elgreco enabled auto-merge (squash) January 7, 2024 01:43
@ion-elgreco ion-elgreco merged commit 25040b8 into delta-io:main Jan 7, 2024
23 checks passed
r3stl355 pushed a commit to r3stl355/delta-rs that referenced this pull request Jan 10, 2024
delta-io#2044)

# Description
When metadata cleanup is executed on a delta table without checkpoints
it will corrupt the table and prevent further loading. This is a high
risk for people who use delta-rs since our writers do not automatically
create checkpoints.

# Related Issue(s)
- closes delta-io#2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
binding/python Issues for the Python package binding/rust Issues for the Rust crate crate/core
Projects
None yet
Development

Successfully merging this pull request may close these issues.

No snapshot or version 0 found, perhaps /Users/watsy0007/resources/test_table/ is an empty dir?
2 participants