Skip to content
This repository has been archived by the owner on Feb 18, 2024. It is now read-only.

Writing compressed dictionary in parquet corrupts the files #667

Closed
jorgecarleitao opened this issue Dec 9, 2021 · 2 comments
Closed

Writing compressed dictionary in parquet corrupts the files #667

jorgecarleitao opened this issue Dec 9, 2021 · 2 comments
Labels
bug Something isn't working

Comments

@jorgecarleitao
Copy link
Owner

See pola-rs/polars#2009 .

@jorgecarleitao
Copy link
Owner Author

Minimal repro try to roundtrip the following:

        let keys = PrimitiveArray::<i64>::from_slice([0, 1, 2, 3, 4]);
        let values = Arc::new(Utf8Array::<i32>::from_slice(&[
            "AA", "AB", "AC", "AD", "AE",
        ]));
        let array = Arc::new(DictionaryArray::<i64>::from_data(keys, values)) as Arc<dyn Array>;

        Encoding::RleDictionary,
        WriteOptions {
            write_statistics: false,
            compression: Compression::Snappy,
            version: Version::V1,
        },

@jorgecarleitao jorgecarleitao changed the title Roundtrip of dictionary in parquet panics Roundtrip of compressed dictionary in parquet panics Dec 9, 2021
@jorgecarleitao
Copy link
Owner Author

Closed by jorgecarleitao/parquet2#72

@jorgecarleitao jorgecarleitao changed the title Roundtrip of compressed dictionary in parquet panics Writing compressed dictionary in parquet corrupts the files Dec 9, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant