-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow overriding schema when reading/scanning JSON files #8279
Comments
Have you got the file that produced this error? |
Yes, working on a reproducible example with a minimal sample of the data that I can share. Thanks for answering! |
I should have some time to answer properly on this on this weekend, thank you for your patience. EDIT: I was super busy lately, I should have time this week to go back on this, thanks again. |
Heya, sorry for taking so long. I took some time today to work this out. 1. Do not use
|
Relates to pola-rs#8279. I'm not 100% sure about the Python schema type annotation, there are a few different variations in this file but this seems to make the most sense? Happy to adjust though.
Problem description
I tried opening a JSON line files with both
read_ndjson
andscan_ndjson
; even though I setinfer_schema_length=0
there was a reading error:@ghuls suggested to open a PR to request schema overriding when reading/scanning JSONL.
As a side note, I could not understand why the error was raised even though schema inference was disabled. Is this also a bug to fix? It seems related to #3942? Would love to help, but I don't really know Rust. I cannot share the data publicly but I guess I could ask if I could send the assignee a small sample to work with.
The text was updated successfully, but these errors were encountered: