DescriptionSubzero: Add support for SSE4.1 instructions.
* Add initial support for code generation with SSE4.1 instructions. The
following operations are affected:
- multiplication with v4i32
- select
- insertelement
- extractelement
* Add appropriate lit checks for SSE4.1 instructions. Run the crosstests
in both SSE2 and SSE4.1 mode.
* Introduce the -mattr flag to llvm2ice to control which instruction set
gets used.
BUG=none
R=jvoung@chromium.org, stichnot@chromium.org
Committed: https://gerrit.chromium.org/gerrit/gitweb?p=native_client/pnacl-subzero.git;a=commit;h=0a45051
Patch Set 1 #Patch Set 2 : Blend instructions require SSE4.1 #Patch Set 3 : Preserve crosstest order. #Patch Set 4 : 1) Fix compilation 2) Fuse conditions in mul lowering to avoid code duplication #
Total comments: 11
Patch Set 5 : Address comments, round 1 #
Total comments: 7
Patch Set 6 : Use syntactically compatible asserts #Patch Set 7 : Fix an empty line that was deleted #
Messages
Total messages: 10 (0 generated)
|