Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Caching #59

Closed
wants to merge 3 commits into from
Closed

Caching #59

wants to merge 3 commits into from

Conversation

StijnKas
Copy link
Collaborator

This PR adds caching of tables, currently implemented in the ADMDatamart class. Simply call save_data(path) which will save the model data and predictor data as .feather files to the path supplied in the function. The default scanning table names in ADMDatamart now also recognise these cached files.

Also, I changed the default importing behaviour by using Polars rather than PyArrow, and added logging during that importing function in cdh_utils.

@StijnKas
Copy link
Collaborator Author

StijnKas commented Dec 1, 2022

Running into some Polars issues, namely the fact that it can't handle Pega's OOTB scientific notation format when reading from json. I've filed an issue there - as reading with polars is quite a bit faster than the previous pandas-based method. So I think i'll wait for that to be resolved before continuing with this PR, especially since these are not essential changes.

@StijnKas
Copy link
Collaborator Author

StijnKas commented Dec 5, 2022

Underlying issue seems to be fixed in the backend json parser lib, so once that's propagated to Polars I can continue this work. Will probably be a few days still.

@StijnKas
Copy link
Collaborator Author

Decided to split this up into two separate PRs: #63 & #61, since #60 was causing too many merge conflicts. All items here have been included in those.

@StijnKas StijnKas closed this Dec 12, 2022
@StijnKas StijnKas deleted the caching branch December 12, 2022 14:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant