This repository has been archived by the owner on Feb 18, 2024. It is now read-only.
Make Parquet read sync and async apis consistent #669
Labels
enhancement
An improvement to an existing feature
In v0.7.0 I could stream in pages of a Parquet column chunk in an async context, then move the data into a dedicated thread pool to perform the CPU-intensive work.
However, as of v0.8.0 page_iter_to_array has been replaced by column_iter_to_array while the async api does not expose a corresponding get_column_stream (only get_page_stream). Is there a better way to load and parse a parquet file from S3? Or, are APIs just out of sync?
The text was updated successfully, but these errors were encountered: