DescriptionAdd Memcpy32 bench.
This compares 32-bit copies using memcpy, autovectorization, and when SSE2 is
available, aligned and unaligned SSE2.
Running this on my desktop (Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz), I see
all four perform essentially the same, except Clang's autovectorization looks
a little better than GCC's. memcpy is calling libc 2.19's __memcpy_sse2_unaligned.
BUG=skia:
Committed: http://code.google.com/p/skia/source/detail?r=14799
Patch Set 1 #Patch Set 2 : const loops is dumb #Patch Set 3 : admit defeat #
Total comments: 2
Patch Set 4 : alpha #Messages
Total messages: 6 (0 generated)
|