Skip to content
This repository has been archived by the owner on Feb 18, 2024. It is now read-only.

Releases: jorgecarleitao/arrow2

v0.3.0

11 Aug 09:13
Compare
Choose a tag to compare

This is the first Rust implementation to support reading parquet on wasm, which opens up a whole new range of possibilities by allowing reading parquet directly on a browser without having to communicate data to a server.

This is also the first time an implementation of arrow in Rust offers native async support to read parquet, enabling consumers to perform ranged queries against blob storage without blocking. Check out an example here

Finally, this implementation now offers a Scalar API that we can build upon to support arrow's more exotic types such as maps and unions.

Thanks a lot to @sundy-li , @PsiACE , @ritchie46, @ghuls and @Dandandan for the contributions and discussions and to @Darksonn for all the patience and help that unblocked me when working on the async parquet support.

Full Changelog

Breaking changes:

  • Renamed sum to sum_primitive #273
  • Moved trait Index from array::Index to types::Index #272
  • Added optional projection to IPC FileReader #271
  • Added optional page_filter to parquet's RecordReader and get_page_iterator #270
  • Renamed parquets' CompressionCodec to Compression #269

New features:

Fixed bugs:

Enhancements:

  • Improve the performance in cast kernel using AsPrimitive trait in generic dispatch #252
  • Poor performance in sort::sort_to_indices with limit option in arrow2 #245
  • Support loading Feather v2 (IPC) files with more than 1 million tables #231
  • Migrated to parquet2 v0.3 #265 (jorgecarleitao)
  • Added more tests to cast and min/max #253 (jorgecarleitao)
  • Prettytable is unmaintained. Change to comfy-table #251 (PsiACE)
  • Added IndexRange to remove checks in hot loops #247 (jorgecarleitao)
  • Make merge_sort_slices MergeSortSlices public #243 (sundy-li)

Documentation updates:

Closed issues:

  • Allow projection pushdown to IPC files #261
  • Add support to write dictionary-encoded pages #211
  • Make IpcWriteOptions easier to find. #120

v0.2.0

30 Jul 12:28
Compare
Choose a tag to compare

Changelog

v0.2.0 (2021-07-30)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

  • Avoid materialization of indices in filter_record_batch for single arrays #234
  • Add integration tests for writing to parquet #80
  • Short-circuited boolean evaluation in GrowableList #228 (ritchie46)
  • Add extra inlining to speed up take #226 (Dandandan)
  • Removed un-needed unsafe #225 (jorgecarleitao)

Documentation updates:

  • Add documentation to guide #96
  • Add git submodule command to correct the test doc #223 (sundy-li)
  • Added badges to README #216 (sundy-li)
  • Clarified differences with arrow crate #210 (alamb)
  • Clarified differences with arrow crate #209 (alamb)

* This Changelog was automatically generated by github_changelog_generator