Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: add page on why to use delta lake #2076

Merged
merged 6 commits into from
Jan 18, 2024
Merged

Conversation

MrPowers
Copy link
Contributor

This documentation page explains why to use Delta Lake.

Let me know if you have any suggestions on key features I missed.

I also added some tabbed code blocks to show how to create tabbed snippets for python & Polars!

Copy link

ACTION NEEDED

delta-rs follows the Conventional Commits specification for release automation.

The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification.

@MrPowers MrPowers changed the title Add documentation page on why to use Delta Lake docs: add page on why to use delta lake Jan 14, 2024
@ion-elgreco
Copy link
Collaborator

@MrPowers can you resolve the merge conflict?

* They are executed in a serial manner and don’t conflict with other transactions
* They don’t corrupt a table or violate table constraints

Data lakes don’t support transactions, so the write operations are dangerous:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bit of weird sentence to read, maybe "Data lakes don't support transactions with the following characteristics"

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, updated this.


Delta tables can be queried with a variety of different languages. This project provides APIs for Rust and Python users and does not depend on Java or Scala. This project is a great alternative for users like that Rust, pandas, Polars, DuckDB, or DataFusion.

The Delta Spark connector has APIs in Java, Scala, Python, and R.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe it can be confusing here, why we are talking about Delta Spark connector ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yea, good call, removed this.


## Support for many languages

Delta tables can be queried with a variety of different languages. This project provides APIs for Rust and Python users and does not depend on Java or Scala. This project is a great alternative for users like that Rust, pandas, Polars, DuckDB, or DataFusion.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"for users like that Rust" "that" seems not needed here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, removed this.

@rtyler rtyler enabled auto-merge (rebase) January 15, 2024 18:54
@MrPowers MrPowers requested a review from ion-elgreco January 16, 2024 16:29
@rtyler rtyler merged commit 2ae5be1 into delta-io:main Jan 18, 2024
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants