Building

Delta Lake is a storage layer that brings scalable, ACID transactions to Apache Spark and other big-data engines.

See the Delta Lake Documentation for details.

See the Quick Start Guide to get started with Scala, Java and Python.

Latest Binaries

See the online documentation for the latest release.

API Documentation

Compatibility

Compatibility with Apache Spark Versions

See the online documentation for the releases and their compatibility with Apache Spark versions.

API Compatibility

There are two types of APIs provided by the Delta Lake project.

Spark-based APIs - You can read Delta tables through the DataFrameReader/Writer (i.e. spark.read, df.write, spark.readStream and df.writeStream). Options to these APIs will remain stable within a major release of Delta Lake (e.g., 1.x.x).
Direct Java/Scala/Python APIs - The classes and methods documented in the API docs are considered as stable public APIs. All other classes, interfaces, methods that may be directly accessible in code are considered internal, and they are subject to change across releases.

Data Storage Compatibility

Delta Lake guarantees backward compatibility for all Delta Lake tables (i.e., newer versions of Delta Lake will always be able to read tables written by older versions of Delta Lake). However, we reserve the right to break forward compatibility as new features are introduced to the transaction protocol (i.e., an older version of Delta Lake may not be able to read a table produced by a newer version).

Breaking changes in the protocol are indicated by incrementing the minimum reader/writer version in the Protocol action.

Roadmap

For detailed detailed timeline, see the project roadmap.

Building

Delta Lake is compiled using SBT.

To compile, run

build/sbt compile

To generate artifacts, run

build/sbt package

To execute tests, run

build/sbt test

Refer to SBT docs for more commands.

Transaction Protocol

Delta Transaction Log Protocol document provides a specification of the transaction protocol.

Requirements for Underlying Storage Systems

Delta Lake ACID guarantees are predicated on the atomicity and durability guarantees of the storage system. Specifically, we require the storage system to provide the following.

Atomic visibility: There must be a way for a file to be visible in its entirety or not visible at all.
Mutual exclusion: Only one writer must be able to create (or rename) a file at the final destination.
Consistent listing: Once a file has been written in a directory, all future listings for that directory must return that file.

See the online documentation on Storage Configuration for details.

Concurrency Control

Delta Lake ensures serializability for concurrent reads and writes. Please see Delta Lake Concurrency Control for more details.

Reporting issues

We use GitHub Issues to track community reported issues. You can also contact the community for getting answers.

Contributing

We welcome contributions to Delta Lake. See our CONTRIBUTING.md for more details.

We also adhere to the Delta Lake Code of Conduct.

License

Apache License 2.0, see LICENSE.

Community

There are two mediums of communication within the Delta Lake community.

Public Slack Channel
- Register here
- Login here
Public Mailing list

Name		Name	Last commit message	Last commit date
Latest commit History 959 Commits
.github/workflows		.github/workflows
build		build
contribs/src		contribs/src
core/src		core/src
dev		dev
docs		docs
examples		examples
project		project
python		python
storage/src/main/java/io/delta/storage		storage/src/main/java/io/delta/storage
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
PROTOCOL.md		PROTOCOL.md
README.md		README.md
build.sbt		build.sbt
delta-charter.pdf		delta-charter.pdf
run-integration-tests.py		run-integration-tests.py
run-tests.py		run-tests.py
scalastyle-config.xml		scalastyle-config.xml
setup.py		setup.py
version.sbt		version.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latest Binaries

API Documentation

Compatibility

Compatibility with Apache Spark Versions

API Compatibility

Data Storage Compatibility

Roadmap

Building

Transaction Protocol

Requirements for Underlying Storage Systems

Concurrency Control

Reporting issues

Contributing

License

Community

About

Releases

Packages

Languages

License

j03wang/delta

Folders and files

Latest commit

History

Repository files navigation

Latest Binaries

API Documentation

Compatibility

Compatibility with Apache Spark Versions

API Compatibility

Data Storage Compatibility

Roadmap

Building

Transaction Protocol

Requirements for Underlying Storage Systems

Concurrency Control

Reporting issues

Contributing

License

Community

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages