You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As you permute the rows, the values you get for min and max can change.
There may be a simpler example. But I started with 1.25 billion records, concatenated using scan_ipc of a wildcard/glob of 91 ipc files, with all calculations done in lazy mode. So it was quite the debugging process to nail down just what was going wrong. And these numbers seem to produce aberrant results. No idea why ...
The text was updated successfully, but these errors were encountered:
python 3.10.2
polars 0.13.9
Linux Mint 20.3
Describe your bug.
The
min
andmax
expressions yield the wrong results.What are the steps to reproduce the behavior?
This is a hard bug to reproduce (with simple values), but I found this data reproduces it.
Note that the minimum of
id
is 130352258, and the maximum is 130352833.Now let's run a simple query to calculate the minimum and the maximum...
The minimum calculated by the query is correct, but the maximum is not.
Now, let's sort the data and repeat the query.
Now we get the correct results.
Now let's permute the rows slightly, and repeat the process ...
This time, the calculated maximum is correct .. but the minimum is not.
However, sorting the data once again yields the correct results.
As you permute the rows, the values you get for
min
andmax
can change.There may be a simpler example. But I started with 1.25 billion records, concatenated using
scan_ipc
of a wildcard/glob of 91 ipc files, with all calculations done in lazy mode. So it was quite the debugging process to nail down just what was going wrong. And these numbers seem to produce aberrant results. No idea why ...The text was updated successfully, but these errors were encountered: