-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[discussion] move delta log handling to new struct DeltaLog
and harmonize operations
#661
Comments
Hi @roeap, I'm interested in performing the work of factoring out the vacuum code into a struct related to line item 2. It would also be good opportunity to add commit information since it is now supported. I like a |
@Blajda - obviously any contribuiton is highly welcome :). Currently I am working on #632, where I am thinking a bit about how to structure our codebase to more efficiently handle out table state / log... One thing that keeps on coming up is that it might be beneficial to move the object store and stare around, rather then the whole table. The proposed delta log would probably be more or less immutable (except for a cache maybe)... I guess what I am saying is, I'd be very keen on seeing an implementation for vacuum, but am not sure yet which way we end up agree an pursuing :). |
I think a re-oganization is a good idea. I'd like to see some examples of how usage would change, though. Since many operations depend on having an up-to-date |
You are very much correct. Most of that thinking evolved while working on #632. Essentially the I will try to finish an initial mergeable version of #632, and then try and explore some of the options we have ... |
Closing this, since discussions advanced quite a bit, and this is now obsolete. |
Description
As we add more and more high level operations (vacuum, optimize, writes, ....) our
DeltaTable
struct is continuously growing in complexity, and maybe it's time for us to have another look at our APIs.The main proposal would be
DeltaLog
struct. This would likely also make things like Support file index #528 easier.into_future
feature. Additionally using a builder pattern to configure the specific operation.DeltaTable
APIs could then look somewhat like the current pyspark apiswhat do you think? @houqp, @rtyler @xianwill @wjones127 @mosyp @fvaleye
The text was updated successfully, but these errors were encountered: