x86jit: Perform vector transfers instead of flushing to memory #18234

unknownbrackets · 2023-09-25T02:17:45Z

This attempts to optimally retain values in registers even when going from and to Vec4. For example, a lv.q; vdot.t; sv.q previously caused the Vec4 to spill and each component to be loaded back, then to spill again and load as a vec4 for the store.

With this change, the values are kept in registers are reassembled if possible. This results in a 1-2% FPS improvement in LittleBigPlanet, for example. Could use more testing in other games, though. Remains to be seen if it'll manage to make any difference on arm64, but not done yet...

At this point LBP is around ~13% faster FPS than the old jit.

-[Unknown]

It's faster, this performs better.

unknownbrackets · 2023-09-26T05:38:14Z

It does seem like doing this on arm64 may have finally made a dent in arm64 FPS, but still not sure since a breeze can make it vary so much.

-[Unknown]

hrydgard · 2023-09-26T07:27:10Z

Indeed seems plausible that keeping stuff in registers is very important on ARM64: x86 are optimized for fast-ish spilling since they have to do it so much due to low register count, while ARM64 doesn't really have that problem so I guess isn't as much...

unknownbrackets added 4 commits September 24, 2023 16:28

irjit: Add facility for native reg transfer.

88b6442

x64jit: Initial reg transfer.

d9f6bae

x86jit: Cleanup and refactor transfer.

46e704f

x86jit: Retain old lanes when there's space.

685d2ac

unknownbrackets added the x86jit x86/x64 JIT bugs label Sep 25, 2023

unknownbrackets added this to the v1.17.0 milestone Sep 25, 2023

x86jit: Prefer BLENDPS to INSERTPS.

38e5b33

It's faster, this performs better.

hrydgard merged commit 9fffa33 into hrydgard:master Sep 26, 2023

hrydgard modified the milestones: v1.17.0, v1.16.5 Sep 26, 2023

unknownbrackets deleted the x86-ir-transfer branch September 27, 2023 04:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x86jit: Perform vector transfers instead of flushing to memory #18234

x86jit: Perform vector transfers instead of flushing to memory #18234

unknownbrackets commented Sep 25, 2023 •

edited

Loading

unknownbrackets commented Sep 26, 2023

hrydgard commented Sep 26, 2023

x86jit: Perform vector transfers instead of flushing to memory #18234

x86jit: Perform vector transfers instead of flushing to memory #18234

Conversation

unknownbrackets commented Sep 25, 2023 • edited Loading

unknownbrackets commented Sep 26, 2023

hrydgard commented Sep 26, 2023

unknownbrackets commented Sep 25, 2023 •

edited

Loading