Description[arm64] Use SP-offset rather than FP-offset.
A64 loads and stores can have much larger positive than negative
immediate offsets, and since most frame slots are below fp, we can
significantly improve accesses by basing them on sp instead. Typical
example:
Before After
mov x16, #-416
str x20, [fp, x16] str x20, [jssp, #32]
Notable benchmark results include lua_binarytrees, which improves by
about 7.5% on A57 and 5% on A53. Several other asm.js benchmarks gain
2-4%.
Committed: https://crrev.com/dad635ee7ada527cf9d5a0c4317ae1a226876a38
Cr-Commit-Position: refs/heads/master@{#32111}
Patch Set 1 #Patch Set 2 : Rebase. #
Messages
Total messages: 20 (6 generated)
|