Large performance regression (compared to 0.21) for missing query cache #8668

Wumpf · 2025-01-13T15:37:23Z

This test scene alien_cake_addict.rrd.zip plays (!) now a lot slower.

On my Mac Rerun reports 90ms on 0.21 and 270ms cpu time on 92949cf.
When paused it it went from 50ms to 65ms (also bad, but not as crazy).

From a first glimpse it seems that the cost of not hitting the query cache has gone up considerably, but more investigation is needed.
This may be temporary due to the ongoing arrow/arrow2 conversion. (shipblocking in any case!).

The text was updated successfully, but these errors were encountered:

emilk · 2025-01-13T19:37:51Z

Since this is a scene with a huge number of objects, I suspect the added overhead of arrow1<->arrow2 conversions is to blame. The conversions are always zero-copy, but usually still involve a few small allocations and pointer-chasing, so they are by no means free.

emilk · 2025-01-14T13:53:44Z

fn row_sliced is being called almost 9k times per frame, and is around 10µs slower than it was before (per call). I'm investigating why.

row_ids and time columns have both been switched to arrow1, but initial investigation indicates this is not what is slower. It could maybe be related to tagging (ComponentDescriptor)?

emilk · 2025-01-14T14:28:56Z

This is super-weird. It is this piece of code that has become slower:

https://github.com/rerun-io/rerun/blob/0.21.0/crates/store/re_chunk/src/slice.rs#L110-L118

The things is - the code hasn’t changed:

https://github.com/rerun-io/rerun/blob/main/crates/store/re_chunk/src/slice.rs#L109-L117

Nor has the types surrounding it. Maybe the new Rustc optimizes worse? I think I need to git bisect 😭

emilk · 2025-01-14T14:33:11Z

This is the PR that regressed performance:

Port PendingRow to arrow-rs #8617

But why?

Specifically this commit:

78f676f

I wonder if the arrow2 -> arrow -> arrow2 conversions adds a null-array or something… 🤔

I'm not 100% sure why this happenes. It looks like roundtripping `ListArray` marks the inner datatype field as nullable, even when it wasn't originally. I'm taking a look at fixing this in `re_arrow2`, but I wanted to open this quick-fix in the meantime. * Closes #8668 * Introduced in PR #8617 * Introduced in commit 78f676f

* Hoping to fix rerun-io/rerun#8668, but no luck so far

* Part of #3741 * [x] Tested that it does not regress #8668 This makes `TransportChunk` a wrapper around an arrow `RecordBatch`. ### Future work * Remove `TransportChunk` and replace it with an extension trait on `RecordBatch` * Simplify the dataframe API to always return a full `RecordBatch` (adding a schema to the rows is basically free now)

Wumpf added 🚀 performance Optimization, memory use, etc 🪳 bug Something isn't working labels Jan 13, 2025

Wumpf added this to the 0.22 - ? milestone Jan 13, 2025

Wumpf changed the title ~~Large performance regression for missing query cache~~ Large performance regression (compared to 0.21) for missing query cache Jan 13, 2025

Wumpf added the 🦟 regression A thing that used to work in an earlier release label Jan 13, 2025

emilk self-assigned this Jan 14, 2025

This was referenced Jan 14, 2025

Fix performance regression due to extra arrow round-tripping #8684

Merged

Improve ListArray arrow conversions rerun-io/re_arrow2#19

Merged

Wumpf closed this as completed in #8684 Jan 14, 2025

emilk added a commit to rerun-io/re_arrow2 that referenced this issue Jan 14, 2025

Improve ListArray arrow conversions (#19)

6c16571

* Hoping to fix rerun-io/rerun#8668, but no luck so far

emilk mentioned this issue Jan 16, 2025

Port TransportChunk to arrow-rs #8700

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large performance regression (compared to 0.21) for missing query cache #8668

Large performance regression (compared to 0.21) for missing query cache #8668

Wumpf commented Jan 13, 2025

emilk commented Jan 13, 2025

emilk commented Jan 14, 2025 •

edited

Loading

emilk commented Jan 14, 2025

emilk commented Jan 14, 2025 •

edited

Loading

Large performance regression (compared to 0.21) for missing query cache #8668

Large performance regression (compared to 0.21) for missing query cache #8668

Comments

Wumpf commented Jan 13, 2025

emilk commented Jan 13, 2025

emilk commented Jan 14, 2025 • edited Loading

emilk commented Jan 14, 2025

emilk commented Jan 14, 2025 • edited Loading

emilk commented Jan 14, 2025 •

edited

Loading

emilk commented Jan 14, 2025 •

edited

Loading