-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generalize object_store integration to all supported cloud providers #5959
Comments
This is running in some small issues here apache/arrow-rs#3419 |
Hi, awesome feature! I see that it has made it's way into the rust codebase already, is this the issue to watch for Python API support, or is there a separate ticket for that? |
In progress, no ticket. There were some limitations in the GCP interface on the object_store side, so I got sidetracked for the last 2 weekends. |
Gotcha, thanks for the update. This a super powerful/useful feature. Looking forward to trying it out on the Python side. |
Made a bit more progress on this, see #6426 |
This has been implemented. |
Problem description
The current integration of the
object_store
crate enables parquet download from S3. Generalize the integration so that parquet files can be downloaded from any of the supportedobject_store
cloud providers.Change the python API to defer to cloud download operations to the rust side for the supported cloud providers. The rationale for this is that the Polars planner can optimize the downloads in some important use cases:
/field=value/
)The text was updated successfully, but these errors were encountered: