DescriptionSubzero: Add Non-SFI support for x86-32.
The basic model is that each translated function begins with a special "GotVar = getIP" instruction, and each ConstantRelocatable reference is changed to GotVar+ConstantRelocatable@GOTOFF (assuming GotVar is legalized into a physical register). The getIP instruction is late-lowered into:
call __Sz_getIP_<reg>
add <reg>, $_GLOBAL_OFFSET_TABLE_
mov GotVar, <reg>
Note that _GLOBAL_OFFSET_TABLE_ gets a special relocation type.
The register allocator takes GotVar uses into account, giving appropriate weight toward register allocation.
If there are no uses of GotVar, the getIP instruction gets naturally dead-code eliminated. Special treatment is needed to prevent this elimination when the only GotVar uses are for (floating point) constant pool values from Phi instructions, since the Phi lowering with its GotVar legalization happens after the main round of register allocation.
The x86 mem operand now has a IsPIC field to indicate whether it has been PIC-legalized. Mem operands are sometimes legalized more than once, and this IsPIC field keeps GotVar from being added more than once.
We have to limit the aggressiveness of address mode inference, to make sure a register slot is left for the GotVar.
The Subzero runtime has new asm files to implement all possible __Sz_getIP_<reg> helpers.
The szbuild.py script and the spec2k version support Non-SFI builds. Running spec2k depends on a pending change to the spec2k run_all.sh script.
Read-only data sections need to be named .data.rel.ro instead of .rodata because of PIC rules.
Most cross tests are working, but there is some problem with vector types that seems to be not Subzero related, so most vector tests are disabled for now.
Still to do:
* Fix "--nonsfi --filetype=iasm". The llvm-mc assembler doesn't properly apply the _GLOBAL_OFFSET_TABLE_ relocation in iasm mode. Maybe I can find a different syntactic trick that works, or use hybrid iasm for this limited case.
BUG= https://bugs.chromium.org/p/nativeclient/issues/detail?id=4327
R=jpp@chromium.org
Committed: https://gerrit.chromium.org/gerrit/gitweb?p=native_client/pnacl-subzero.git;a=commit;h=8ff4b2819944bc4f02fb29204a1fa5ba7dea5682
Patch Set 1 : Initialize GotVar in prolog. Emit call to runtime helper for filetype=asm. #Patch Set 2 : Do PIC for mem operand legalization. Unfortunately this is subject to double legalization. #Patch Set 3 : Add FixupKind/RelocType explicitly to ConstantRelocatables #Patch Set 4 : Fix GOT legalization for MemOperands. Emit @GOTOFF suffix. #Patch Set 5 : Works as long as address mode inference isn't too aggressive #Patch Set 6 : Dial back address mode inference #Patch Set 7 : Checkpoint before redesigning this #Patch Set 8 : Take a different approach for relocations #Patch Set 9 : Basic "hello world" runs with -filetype-asm #Patch Set 10 : Add --nonsfi to the build script #Patch Set 11 : Validate X86OperandMem emission. spec2k builds. #Patch Set 12 : Fix GetIP placement bug. Update spec2k build script. #Patch Set 13 : Fix szbuild_spec2k.py --run for --sandbox and --nonsfi. #Patch Set 14 : Works for filetype=obj #Patch Set 15 : Minor cleanup #Patch Set 16 : Simplify getIP emission #Patch Set 17 : Cleanup #Patch Set 18 : Fill in part of the lit test #
Total comments: 28
Patch Set 19 : Code review changes #Patch Set 20 : Fix some regressions. Remove crosstest.py --crosstest-bitcode. #Patch Set 21 : Cross tests #Patch Set 22 : Clean up some python ternary operator stuff #Patch Set 23 : Complete lit test. Improve address mode inference. #Patch Set 24 : Refactor the link commands #
Total comments: 22
Patch Set 25 : Rebase #Patch Set 26 : Code review changes #Messages
Total messages: 12 (5 generated)
|