Chromium Code Reviews
DescriptionSubzero. ARM32. Show FP lowering some love.
After some time of being neglected, this CL improves FP lowering for
ARM32.
1) It emits vpush {list}, and vpop {list} when possible.
2) It stops saving alised Vfp registers multiple times (yes, sz used to
save both D and S registers even when they aliased.)
3) Introduces Vmla (fp multiply and accumulate) and Vmls (multiply and
subtract.)
(1 + 2) minimally (but positively) affected SPEC.
(3) caused a 2% geomean improvement.
BUG= https://code.google.com/p/nativeclient/issues/detail?id=4076
R=stichnot@chromium.org
Committed: https://gerrit.chromium.org/gerrit/gitweb?p=native_client/pnacl-subzero.git;a=commit;h=eb13acc6221f60642d17473b6329a82b02ae557a
Patch Set 1 #Patch Set 2 : Enables fp Multiply and accumulate. #Patch Set 3 : Git pull; Painful merge; Fixes lit; Make format; #Patch Set 4 : Some refactoring. Adds a script for generating reg tables. #Patch Set 5 : Fixes formating in the new python script. #Patch Set 6 : #
Total comments: 56
Patch Set 7 : Addresses comments. #
Total comments: 7
Patch Set 8 : Handles comments; git pull; fixes lit tests. #
Messages
Total messages: 10 (3 generated)
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||