Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(11)

Issue 2641153003: Add MSA optimized ARGB/ABGR/BGRA/RGBA To Y/UV row functions (Closed)

Created:
3 years, 11 months ago by manojkumar.bhosale
Modified:
3 years, 10 months ago
Reviewers:
fbarchard1
CC:
gordana.cmiljanovic_imgtec.com, raghu.gandham_imgtec.com, parag.salasakar_imgtec.com, mandar.sahastrabuddhe_imgtec.com, rob.isherwood_imgtec.com
Target Ref:
refs/heads/master
Project:
libyuv
Visibility:
Public.

Description

Add MSA optimized ARGB/ABGR/BGRA/RGBA To Y/UV row functions R=fbarchard@google.com BUG=libyuv:634 Performance Gain (vs C auto-vectorized) ARGBToYJRow_MSA - ~3.2x ARGBToYJRow_Any_MSA - ~2.7x BGRAToYRow_MSA - ~3.2x BGRAToYRow_Any_MSA - ~2.7x ABGRToYRow_MSA - ~3.2x ABGRToYRow_Any_MSA - ~2.6x RGBAToYRow_MSA - ~3.1x RGBAToYRow_Any_MSA - ~2.7x ARGBToUVJRow_MSA - ~5.5x ARGBToUVJRow_Any_MSA - ~4.5x BGRAToUVRow_MSA - ~2.1x BGRAToUVRow_Any_MSA - ~2.0x ABGRToUVRow_MSA - ~2.1x ABGRToUVRow_Any_MSA - ~1.9x RGBAToUVRow_MSA - ~2.2x RGBAToUVRow_Any_MSA - ~1.9x Performance Gain (vs C non-vectorized) ARGBToYJRow_MSA - ~10.9x ARGBToYJRow_Any_MSA - ~9.2x BGRAToYRow_MSA - ~10.9x BGRAToYRow_Any_MSA - ~9.3x ABGRToYRow_MSA - ~11.0x ABGRToYRow_Any_MSA - ~9.3x RGBAToYRow_MSA - ~10.9x RGBAToYRow_Any_MSA - ~9.1x ARGBToUVJRow_MSA - ~12.4x ARGBToUVJRow_Any_MSA - ~10.5x BGRAToUVRow_MSA - ~4.7x BGRAToUVRow_Any_MSA - ~4.4x ABGRToUVRow_MSA - ~4.7x ABGRToUVRow_Any_MSA - ~4.5x RGBAToUVRow_MSA - ~4.8x RGBAToUVRow_Any_MSA - ~4.4x Committed: https://chromium.googlesource.com/libyuv/libyuv/+/54ce8f23d6d82d671ec05229812842548faf14ef

Patch Set 1 #

Total comments: 4

Patch Set 2 : Fixed loop increments #

Total comments: 4
Unified diffs Side-by-side diffs Delta from patch set Stats (+580 lines, -14 lines) Patch
M include/libyuv/row.h View 7 chunks +57 lines, -1 line 0 comments Download
M source/convert.cc View 3 chunks +48 lines, -0 lines 0 comments Download
M source/convert_from_argb.cc View 3 chunks +40 lines, -0 lines 0 comments Download
M source/planar_functions.cc View 1 chunk +8 lines, -0 lines 0 comments Download
M source/row_any.cc View 1 2 chunks +24 lines, -0 lines 0 comments Download
M source/row_msa.cc View 1 4 chunks +403 lines, -13 lines 4 comments Download

Messages

Total messages: 13 (3 generated)
manojkumar.bhosale
3 years, 11 months ago (2017-01-20 10:24:31 UTC) #1
manojkumar.bhosale
Updated with reviewers, cc list and performance gain numbers
3 years, 11 months ago (2017-01-20 10:27:12 UTC) #3
fbarchard1
https://codereview.chromium.org/2641153003/diff/1/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/1/source/row_msa.cc#newcode146 source/row_msa.cc:146: // Laods current and next row of ARGB input ...
3 years, 11 months ago (2017-01-24 00:15:47 UTC) #4
manojkumar.bhosale
https://codereview.chromium.org/2641153003/diff/1/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/1/source/row_msa.cc#newcode146 source/row_msa.cc:146: // Laods current and next row of ARGB input ...
3 years, 11 months ago (2017-01-25 08:35:42 UTC) #5
manojkumar.bhosale
Updated performance gains of following functions BGRAToUVRow_MSA BGRAToUVRow_Any_MSA ABGRToUVRow_MSA ABGRToUVRow_Any_MSA RGBAToUVRow_MSA RGBAToUVRow_Any_MSA
3 years, 11 months ago (2017-01-25 08:49:02 UTC) #7
fbarchard1
https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc#newcode2485 source/row_msa.cc:2485: src0 = (v16u8)__msa_ld_b((v16i8*)s, 0); can use READ_ARGB(s, t, vec0, ...
3 years, 10 months ago (2017-01-26 22:25:53 UTC) #8
manojkumar.bhosale
https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc#newcode2485 source/row_msa.cc:2485: src0 = (v16u8)__msa_ld_b((v16i8*)s, 0); On 2017/01/26 22:25:53, fbarchard1 wrote: ...
3 years, 10 months ago (2017-01-30 10:54:45 UTC) #9
fbarchard1
lgtm https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc#newcode2485 source/row_msa.cc:2485: src0 = (v16u8)__msa_ld_b((v16i8*)s, 0); Looks like this has ...
3 years, 10 months ago (2017-02-01 01:58:46 UTC) #10
manojkumar.bhosale
https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc File source/row_msa.cc (right): https://codereview.chromium.org/2641153003/diff/20001/source/row_msa.cc#newcode2485 source/row_msa.cc:2485: src0 = (v16u8)__msa_ld_b((v16i8*)s, 0); On 2017/02/01 01:58:46, fbarchard1 wrote: ...
3 years, 10 months ago (2017-02-01 04:48:41 UTC) #11
manojkumar.bhosale
3 years, 10 months ago (2017-02-01 04:59:06 UTC) #13
Message was sent while issue was closed.
Committed patchset #2 (id:20001) manually as
54ce8f23d6d82d671ec05229812842548faf14ef (presubmit successful).

Powered by Google App Engine
This is Rietveld 408576698