You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Parquet row groups are meant to contain large numbers of rows. This helps amortize statistics, metadata, and IO overheads, and make the best use of dictionary encoding.
Currently every call to ArrowWriter::write creates a new row group, this is unfortunate
Describe the solution you'd like
ArrowWriter should only close a row group once it exceeds the configured WriterProperties::max_row_group_size
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Parquet row groups are meant to contain large numbers of rows. This helps amortize statistics, metadata, and IO overheads, and make the best use of dictionary encoding.
Currently every call to
ArrowWriter::write
creates a new row group, this is unfortunateDescribe the solution you'd like
ArrowWriter
should only close a row group once it exceeds the configuredWriterProperties::max_row_group_size
The text was updated successfully, but these errors were encountered: