Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(224)

Issue 2559683002: Add MSA optimized remaining scale row functions (Closed)

Created:
4 years ago by manojkumar.bhosale
Modified:
4 years ago
Reviewers:
fbarchard1
CC:
gordana.cmiljanovic_imgtec.com, raghu.gandham_imgtec.com, parag.salasakar_imgtec.com, mandar.sahastrabuddhe_imgtec.com, rob.isherwood_imgtec.com
Target Ref:
refs/heads/master
Project:
libyuv
Visibility:
Public.

Description

Add MSA optimized remaining scale row functions R=fbarchard@google.com BUG=libyuv:634 Performance Gain (vs C vectorized) ScaleRowDown2_MSA - ~22.3x ScaleRowDown2_Any_MSA - ~19.9x ScaleRowDown2Linear_MSA - ~31.2x ScaleRowDown2Linear_Any_MSA - ~29.4x ScaleRowDown2Box_MSA - ~20.1x ScaleRowDown2Box_Any_MSA - ~19.6x ScaleRowDown4_MSA - ~11.7x ScaleRowDown4_Any_MSA - ~11.2x ScaleRowDown4Box_MSA - ~15.1x ScaleRowDown4Box_Any_MSA - ~15.1x ScaleRowDown38_MSA - ~1x ScaleRowDown38_Any_MSA - ~1x ScaleRowDown38_2_Box_MSA - ~1.7x ScaleRowDown38_2_Box_Any_MSA - ~1.7x ScaleRowDown38_3_Box_MSA - ~1.7x ScaleRowDown38_3_Box_Any_MSA - ~1.7x ScaleAddRow_MSA - ~1.2x ScaleAddRow_Any_MSA - ~1.15x Performance Gain (vs C non-vectorized) ScaleRowDown2_MSA - ~22.4x ScaleRowDown2_Any_MSA - ~19.8x ScaleRowDown2Linear_MSA - ~31.6x ScaleRowDown2Linear_Any_MSA - ~29.4x ScaleRowDown2Box_MSA - ~20.1x ScaleRowDown2Box_Any_MSA - ~19.6x ScaleRowDown4_MSA - ~11.7x ScaleRowDown4_Any_MSA - ~11.2x ScaleRowDown4Box_MSA - ~15.1x ScaleRowDown4Box_Any_MSA - ~15.1x ScaleRowDown38_MSA - ~3.2x ScaleRowDown38_Any_MSA - ~3.2x ScaleRowDown38_2_Box_MSA - ~2.4x ScaleRowDown38_2_Box_Any_MSA - ~2.3x ScaleRowDown38_3_Box_MSA - ~2.9x ScaleRowDown38_3_Box_Any_MSA - ~2.8x ScaleAddRow_MSA - ~8x ScaleAddRow_Any_MSA - ~7.46x Committed: https://chromium.googlesource.com/libyuv/libyuv/+/288bfbefb5f7966b56c7293c727c41aa674e86fb

Patch Set 1 #

Total comments: 4

Patch Set 2 : Fix merge conflicts with master #

Patch Set 3 : Changes as per review comments #

Total comments: 2

Patch Set 4 : Variable renaming and rework #

Total comments: 1
Unified diffs Side-by-side diffs Delta from patch set Stats (+579 lines, -25 lines) Patch
M include/libyuv/macros_msa.h View 1 2 4 chunks +38 lines, -25 lines 1 comment Download
M include/libyuv/scale_row.h View 2 chunks +73 lines, -0 lines 0 comments Download
M source/scale.cc View 4 chunks +52 lines, -0 lines 0 comments Download
M source/scale_any.cc View 4 chunks +47 lines, -0 lines 0 comments Download
M source/scale_msa.cc View 1 2 3 2 chunks +369 lines, -0 lines 0 comments Download

Messages

Total messages: 12 (4 generated)
manojkumar.bhosale
4 years ago (2016-12-07 09:06:12 UTC) #1
manojkumar.bhosale
Updated with reviewers, cc list and performance gain numbers
4 years ago (2016-12-07 09:10:20 UTC) #4
manojkumar.bhosale
4 years ago (2016-12-15 09:02:17 UTC) #6
fbarchard1
lgtm https://codereview.chromium.org/2559683002/diff/1/source/scale_msa.cc File source/scale_msa.cc (right): https://codereview.chromium.org/2559683002/diff/1/source/scale_msa.cc#newcode234 source/scale_msa.cc:234: vec0 = __msa_hadd_u_h(src0, src0); is hadd fast? on ...
4 years ago (2016-12-16 18:54:06 UTC) #7
manojkumar.bhosale
https://codereview.chromium.org/2559683002/diff/1/source/scale_msa.cc File source/scale_msa.cc (right): https://codereview.chromium.org/2559683002/diff/1/source/scale_msa.cc#newcode234 source/scale_msa.cc:234: vec0 = __msa_hadd_u_h(src0, src0); On 2016/12/16 18:54:06, fbarchard1 wrote: ...
4 years ago (2016-12-20 09:25:56 UTC) #8
manojkumar.bhosale
4 years ago (2016-12-20 09:28:16 UTC) #9
fbarchard1
lgtm https://codereview.chromium.org/2559683002/diff/60001/include/libyuv/macros_msa.h File include/libyuv/macros_msa.h (right): https://codereview.chromium.org/2559683002/diff/60001/include/libyuv/macros_msa.h#newcode147 include/libyuv/macros_msa.h:147: #define ST_UH(...) ST_H(v8u16, __VA_ARGS__) nit (not important) va_args ...
4 years ago (2016-12-20 18:39:17 UTC) #10
manojkumar.bhosale
4 years ago (2016-12-21 08:08:31 UTC) #12
Message was sent while issue was closed.
Committed patchset #4 (id:60001) manually as
288bfbefb5f7966b56c7293c727c41aa674e86fb (presubmit successful).

Powered by Google App Engine
This is Rietveld 408576698