-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while writing Pandas DataFrame to Delta Lake (S3) #2051
Comments
@vinamrgrover please provide the deltalake version you're using, also share a minimal reproducible example |
Version : I already shared a reproducible example above |
@vinamrgrover a reproducible example should also include a sample dataframe.. |
You are likely passing something that is not a pandas dataframe, I can write a pandas dataframe with write_deltalake. import pandas as pd
from deltalake import write_deltalake
df = pd.DataFrame({'foo':['test']})
write_deltalake('test_table_PATH', data=df, mode='overwrite', overwrite_schema=True) |
Please don't mark it as completed, I better know I have passed Pandas DataFrame in |
write_deltalake
isn't working as expectedI encountered an error while calling
write_deltalake
method:ValueError : you must provide schema if data is iterable
Even though it worked perfectly with a PyArrow Table, it didn't work for Pandas DataFrame.
How to reproduce it:
The text was updated successfully, but these errors were encountered: