-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CDC support in deltalog when writing delta table #2720
Comments
We already have CDC write support for update operations, @rtyler started looking at MERGE. |
That's awesome! Could you share some links? |
Would be awesome to have it for MERGE! |
According to the below, At least that's what Databricks claims that it can do. I tried setting up a table with I'm not sure if my experiment was wrong, but do you know if this is supported? |
@waddahAldrobi probably something on your side, this code works fine: from deltalake import DeltaTable
import polars as pl
df = pl.DataFrame({
"foo": [1,2], "bar":['1','2']
})
for i in range(2):
df.write_delta('test_table', mode='append', delta_write_options={"configuration":{"delta.enableChangeDataFeed":"true"}, "engine":"rust"})
dt= DeltaTable("test_table")
pl.from_arrow(dt.load_cdf())
shape: (4, 5)
┌─────┬─────┬──────────────┬─────────────────┬─────────────────────────┐
│ foo ┆ bar ┆ _change_type ┆ _commit_version ┆ _commit_timestamp │
│ --- ┆ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ str ┆ str ┆ i64 ┆ datetime[ms] │
╞═════╪═════╪══════════════╪═════════════════╪═════════════════════════╡
│ 1 ┆ 1 ┆ insert ┆ 1 ┆ 2024-08-10 17:42:19.836 │
│ 2 ┆ 2 ┆ insert ┆ 1 ┆ 2024-08-10 17:42:19.836 │
│ 1 ┆ 1 ┆ insert ┆ 0 ┆ 2024-08-10 17:42:15.639 │
│ 2 ┆ 2 ┆ insert ┆ 0 ┆ 2024-08-10 17:42:15.639 │
└─────┴─────┴──────────────┴─────────────────┴─────────────────────────┘ |
Thanks @ion-elgreco this is what we needed! 🙏 |
Description
Current
delta-rs
version is not supporting CDC in deltalog. It says that CDC support will be added in Version 4 here.What's the current timeline for this feature? How many months or years it can take to add support for CDC?
Use Case
We need to read CDC from deltalog of delta table written using
delta-rs
.Related Issue(s)
The text was updated successfully, but these errors were encountered: