Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Predicate Pushdown #101

Closed
rajasekarv opened this issue Mar 17, 2022 · 2 comments
Closed

Feature Request: Predicate Pushdown #101

rajasekarv opened this issue Mar 17, 2022 · 2 comments

Comments

@rajasekarv
Copy link

It will be hugely beneficial to have more abstract and more functional predicate pushdown filters while reading row groups and while reading individual records.

Ref: https://github.com/apache/parquet-mr/blob/master/parquet-column/src/main/java/org/apache/parquet/filter2/predicate/FilterPredicate.java

@rajasekarv rajasekarv changed the title Feature Reques: Predicate Pushdown Feature Request: Predicate Pushdown Mar 17, 2022
@jorgecarleitao
Copy link
Owner

Quite a coincidence!

We already support filter pushdown at row group level and page level.

The page level filter still requires reading the page header atm.

We added support to read bloom filters yesterday, and there is a PR to read column indexes and page indexes.

I am working on allowing to use these indexes to be used on our page iterators, as this requires piping things around.

Happy to hear ideas on how the API could look like for an end user.

@jorgecarleitao
Copy link
Owner

With #102 landed, all types of filter push-down in parquet's spec are fully supported in main. Closing as done.

Thanks again @rajasekarv for your interest.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants