Spherical padding and faster tests #45

slevang · 2024-09-06T15:51:18Z

Spherical padding

This makes an initial attempt at building in some automatic logic for better handling the boundaries of a spherical domain with appropriate padding, as well as shifting the longitude source grid to match the target as needed.

Faster tests

Made some modifications and general cleanup of the existing tests. Mainly making the datasets dask-backed so the regridding functions are lazy for all the attribute checks. Also got rid of some unneeded copies, compute calls, etc. Tests run in about 17 seconds now.

With the addition of the spherical padding logic I was also able to remove some aspects of the tests where we avoided checking for good matches at the boundaries.

Conservative NaN tracking

I did some benchmarking and the version of NaN tracking where we track independently across non-grid dimensions as well doesn't actually have much of a run-time penalty. Therefore I went ahead and made this small change. The only issue is that it can induce a very large memory penalty if the input data isn't chunked, because the weights arrays can become huge if they are broadcast with all the other input dimensions. I added a note to the docstring about this but perhaps we want a more formal warning, or to explicitly add some chunking in certain cases?

TODO / open questions

tests added
fix typing
add examples to docstrings
add args to make the padding behavior configurable?

src/xarray_regrid/methods/conservative.py

src/xarray_regrid/utils.py

tests/test_regrid.py

src/xarray_regrid/methods/conservative.py

src/xarray_regrid/utils.py

tests/test_format.py

tests/test_regrid.py

src/xarray_regrid/utils.py

BSchilperoort

Thanks for making these improvements!

It would probably be good to add a single workflow runner with micromamba, so that the comparison with ESMF will run on the CI. I could do this in a separate PR.

The only issue is that it can induce a very large memory penalty if the input data isn't chunked, because the weights arrays can become huge if they are broadcast with all the other input dimensions. I added a note to the docstring about this but perhaps we want a more formal warning, or to explicitly add some chunking in certain cases?

This probably is quite unintuitive to users, was the reason for removing this tracking? mostly just simplifying the code? If it does allow for a much lower memory footprint (without users needing to improve their chunking) it could be interesting to leave the non-grid-dims tracking in.

Made some modifications and general cleanup of the existing tests. Mainly making the datasets dask-backed so the regridding functions are lazy for all the attribute checks. Also got rid of some unneeded copies, compute calls, etc. Tests run in about 17 seconds now.

Awesome!

slevang · 2024-09-19T21:02:02Z

This probably is quite unintuitive to users, was the reason for removing this tracking? mostly just simplifying the code? If it does allow for a much lower memory footprint (without users needing to improve their chunking) it could be interesting to leave the non-grid-dims tracking in.

It does simplify the code. But primarily, getting rid of this aggregation of the valid frac over non grid dims allows us to truly track isolated NaN fractions, so that you can have NaN in just a single time slice for example and get the correct output. I think what I have now is the most sensible baseline option, but I have some other ideas for improving conservative performance so I'll follow up in a different branch to avoid adding too much to this PR.

slevang · 2024-09-19T21:10:38Z

I made a few small changes/refactors which I guess cancelled out your previous review.

src/xarray_regrid/utils.py

src/xarray_regrid/methods/conservative.py

BSchilperoort

Thanks for the improvements! 😄

initial pass at spherical padding, faster tests, full nan tracking

ce16f74

slevang requested a review from BSchilperoort September 6, 2024 15:51

slevang commented Sep 6, 2024

View reviewed changes

src/xarray_regrid/methods/conservative.py Show resolved Hide resolved

src/xarray_regrid/utils.py Show resolved Hide resolved

src/xarray_regrid/utils.py Outdated Show resolved Hide resolved

tests/test_regrid.py Show resolved Hide resolved

tests/test_regrid.py Show resolved Hide resolved

slevang added 2 commits September 6, 2024 16:33

fix netcdf bug

ba27714

revert to keeping all test slices

6446048

BSchilperoort reviewed Sep 9, 2024

View reviewed changes

tests/test_regrid.py Outdated Show resolved Hide resolved

BSchilperoort reviewed Sep 9, 2024

View reviewed changes

src/xarray_regrid/methods/conservative.py Outdated Show resolved Hide resolved

slevang added 2 commits September 13, 2024 14:17

refactor to separate coord handling functions, appease mypy

283c310

add examples to docstrings

7b44358

BSchilperoort reviewed Sep 17, 2024

View reviewed changes

src/xarray_regrid/utils.py Show resolved Hide resolved

src/xarray_regrid/utils.py Show resolved Hide resolved

tests/test_format.py Show resolved Hide resolved

tests/test_regrid.py Show resolved Hide resolved

src/xarray_regrid/utils.py Outdated Show resolved Hide resolved

BSchilperoort previously approved these changes Sep 17, 2024

View reviewed changes

slevang added 2 commits September 19, 2024 13:02

fix modifying coordinates

a5788bf

fix typing

52e71c5

slevang dismissed BSchilperoort’s stale review via 52e71c5 September 19, 2024 20:32

slevang requested a review from BSchilperoort September 19, 2024 21:10

BSchilperoort reviewed Sep 20, 2024

View reviewed changes

src/xarray_regrid/utils.py Show resolved Hide resolved

BSchilperoort reviewed Sep 20, 2024

View reviewed changes

src/xarray_regrid/methods/conservative.py Outdated Show resolved Hide resolved

review suggestions

1f2e999

BSchilperoort approved these changes Sep 20, 2024

View reviewed changes

BSchilperoort linked an issue Sep 20, 2024 that may be closed by this pull request

Slow test suite #30

Closed

slevang merged commit 9d71962 into main Sep 20, 2024
11 checks passed

BSchilperoort deleted the boundary-padding branch September 20, 2024 13:09

slevang mentioned this pull request Sep 24, 2024

Implement new "most common" regridder. #46

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spherical padding and faster tests #45

Spherical padding and faster tests #45

slevang commented Sep 6, 2024 •

edited

Loading

BSchilperoort left a comment

slevang commented Sep 19, 2024

slevang commented Sep 19, 2024

BSchilperoort left a comment

Spherical padding and faster tests #45

Spherical padding and faster tests #45

Conversation

slevang commented Sep 6, 2024 • edited Loading