DescriptionAdd MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions
R=fbarchard@google.com
BUG=libyuv:634
Performance Gain (vs C vectorized)
TransposeWx8_MSA - ~2.7x
TransposeWx8_Any_MSA - ~2.1x
TransposeUVWx8_MSA - ~2.5x
TransposeUVWx8_Any_MSA - ~2.7x
Performance Gain (vs C non-vectorized)
TransposeWx8_MSA - ~4.6x
TransposeWx8_Any_MSA - ~2.9x
TransposeUVWx8_MSA - ~4.4x
TransposeUVWx8_Any_MSA - ~3.7x
Committed: https://chromium.googlesource.com/libyuv/libyuv/+/6fa5e4eb780b67fe3275a529c6c0da9ea7b58cff
Patch Set 1 #
Total comments: 9
Patch Set 2 : Changes as per review comments #
Messages
Total messages: 16 (4 generated)
|