Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add .parquet files data preview #13

Closed
RandomFractals opened this issue May 28, 2019 · 14 comments
Closed

add .parquet files data preview #13

RandomFractals opened this issue May 28, 2019 · 14 comments
Labels
enhancement New feature or request

Comments

@RandomFractals
Copy link
Owner

see finos/perspective#342 for more info

@RandomFractals RandomFractals added the enhancement New feature or request label May 28, 2019
RandomFractals added a commit that referenced this issue Jun 1, 2019
compressed data file reads
@gbrueckl
Copy link

Any time frame when this will be available?
Would like to refer to your extension when it comes to viewing parquet files

@RandomFractals
Copy link
Owner Author

RandomFractals commented Feb 25, 2020

I have not looked into it in a while. Last time I checked nodejs parquet lib had some issues and required native modules which I can't support in vscode ...

@gbrueckl
Copy link

Looks like there is a new package to work with parquet in TypeScript
https://github.com/kbajalc/parquets

Would this work?

@RandomFractals
Copy link
Owner Author

that might work. I'll take it for a spin this weekend.

Thanks for linking me to it.

@gbrueckl
Copy link

gbrueckl commented Apr 9, 2020

any update to this?

@RandomFractals
Copy link
Owner Author

hoping to look more into this in fall.

@RandomFractals
Copy link
Owner Author

@gbrueckl good news. I tried parquets lib you suggested and looks like we can load one of the sample parquet data files now.

I'll see if I can find more parquet data files online and if it works with more examples, I'll package and ship this soon.

image

@JacekPliszka
Copy link

One small favour - could you add .pq extension for parquet as well?

@RandomFractals
Copy link
Owner Author

RandomFractals commented Mar 1, 2021

@gbrueckl thanks for the test files. I hope they get loaded b/c I tried some from https://github.com/apache/parquet-testing and did not have much luck with the data files from that repo due to encoding or shortcomings of the parquets and the underlying parquetjs lib we are using.

@JacekPliszka I'll map .pq extension to parquet data provider. Is that a standard short file extension for parquet files? I have not seen it before.

Let me know if you can get your parquet files to load. This release is super alpha and I am sure there is much more we'll need to tweak to make it work with snappy compression, etc.

@JacekPliszka
Copy link

@RandomFractals I think only .parquet is "standard" but you allow .parq which I've never seen while .pq you can find used by different people on the internet. Possibly because this is a standard import in Python:

import pyarrow.parquet as pq

@RandomFractals
Copy link
Owner Author

btw, @gbrueckl I tried parquet files you shared. None of them load. I get page dictionary errors. will have to dig into that more later.

@peder1001
Copy link

Any updates on whether you will support parquet?

@RandomFractals
Copy link
Owner Author

@peder1001 most likely in the new Tabular Data Viewer extension I started working on this year, which does include Tabulator and Perspective viewer. You can try it with CSV data files from marketplace for now:

https://marketplace.visualstudio.com/items?itemName=RandomFractalsInc.tabular-data-viewer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants