IR: Add some interpreter-only IR instructions for faster interpretation #19262

hrydgard · 2024-06-07T20:03:15Z

Mainly just experimenting so far. There are some better wins to be had, but these are easy. These Opt* instructions will only be generated in the very last pass, and only for the IR interpreter - the IR Jits can do similar things themselves when generating native code and we don't want to have them handle more instructions.

part of #19143

Additionally, moves Downcount to the start of each block, saving us from one switch-dispatch per block execution.

Also adds a new optimization pass to take out some unnecessary loads after stores, seen in some games (likely some bad compiler..).

Overall, this seems to be a 5-7% speed boost. Possibly mostly due to the Downcount change...

hrydgard added 4 commits June 7, 2024 19:32

Specialize a few arithmetic instructions for the interpreter.

da88011

Add new IR optimization pass, OptimizeLoadsAfterStores

bd0beb6

Improve disasm

d1e0384

Create an IR op for a FPRtoGPR + shift-right-8, very common

0c24629

hrydgard added the IRInterpreter Occurs with IR Interpreter but not with another CPU backend. label Jun 7, 2024

hrydgard added this to the v1.18.0 milestone Jun 7, 2024

hrydgard marked this pull request as ready for review June 7, 2024 21:11

hrydgard merged commit 27815c7 into master Jun 7, 2024
18 checks passed

hrydgard deleted the ir-specialization branch June 7, 2024 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IR: Add some interpreter-only IR instructions for faster interpretation #19262

IR: Add some interpreter-only IR instructions for faster interpretation #19262

hrydgard commented Jun 7, 2024 •

edited

Loading

IR: Add some interpreter-only IR instructions for faster interpretation #19262

IR: Add some interpreter-only IR instructions for faster interpretation #19262

Conversation

hrydgard commented Jun 7, 2024 • edited Loading

hrydgard commented Jun 7, 2024 •

edited

Loading