Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(355)

Issue 1513443002: Optimize BlendPlane SIMD support for odd width. (Closed)

Created:
5 years ago by fbarchard1
Modified:
5 years ago
Base URL:
https://chromium.googlesource.com/libyuv/libyuv@master
Target Ref:
refs/heads/master
Project:
libyuv
Visibility:
Public.

Description

BlendPlane any width. Benchmark out\release\libyuv_unittest --libyuv_width=1279 --libyuv_height=719 --libyuv_repeat=999 --libyuv_flags=-1 --gtest_filter=*Blend* | sortms Was I420Blend_Any (2321 ms) I420Blend_Unaligned (1684 ms) I420Blend_Opt (1675 ms) I420Blend_Invert (1653 ms) BlendPlane_Invert (1556 ms) BlendPlane_Any (1552 ms) BlendPlane_Unaligned (1548 ms) BlendPlane_Opt (1535 ms) ARGBBlend_Unaligned (659 ms) ARGBBlend_Any (596 ms) ARGBBlend_Invert (591 ms) ARGBBlend_Opt (508 ms) BlendPlaneRow_Unaligned (186 ms) BlendPlaneRow_Opt (171 ms) Now ARGBBlend_Any (621 ms) ARGBBlend_Unaligned (585 ms) ARGBBlend_Invert (564 ms) ARGBBlend_Opt (512 ms) I420Blend_Unaligned (347 ms) I420Blend_Invert (345 ms) I420Blend_Any (337 ms) I420Blend_Opt (327 ms) BlendPlane_Unaligned (187 ms) BlendPlaneRow_Unaligned (187 ms) BlendPlane_Invert (186 ms) BlendPlane_Any (186 ms) BlendPlaneRow_Opt (173 ms) BlendPlane_Opt (171 ms) which is comparable to aligned case out\release\libyuv_unittest --libyuv_width=1280 --libyuv_height=720 --libyuv_repeat=999 --libyuv_flags=-1 --gtest_filter=*Blend* | sortms ARGBBlend_Any (625 ms) ARGBBlend_Unaligned (602 ms) ARGBBlend_Invert (508 ms) ARGBBlend_Opt (506 ms) I420Blend_Any (353 ms) I420Blend_Unaligned (322 ms) I420Blend_Invert (304 ms) I420Blend_Opt (301 ms) BlendPlaneRow_Unaligned (188 ms) BlendPlane_Unaligned (186 ms) BlendPlane_Invert (185 ms) BlendPlane_Any (184 ms) BlendPlaneRow_Opt (173 ms) BlendPlane_Opt (169 ms) R=dhrosa@google.com, harryjin@google.com BUG=libyuv:527 Committed: https://chromium.googlesource.com/libyuv/libyuv/+/a2ea90567998b1ab93ce7fe3acc25922862e4c9c

Patch Set 1 #

Total comments: 2

Patch Set 2 : for i420 check halfwidth is multiple of 32 bit #

Patch Set 3 : remove extraneous prototype #

Patch Set 4 : merge avx2 32 byte at time cl #

Patch Set 5 : bump libyuv version to identify alpha blender #

Unified diffs Side-by-side diffs Delta from patch set Stats (+12 lines, -8 lines) Patch
M README.chromium View 1 2 3 4 1 chunk +1 line, -1 line 0 comments Download
M include/libyuv/version.h View 1 2 3 4 1 chunk +1 line, -1 line 0 comments Download
M source/planar_functions.cc View 1 2 3 4 chunks +4 lines, -6 lines 0 comments Download
M source/row_any.cc View 1 2 1 chunk +6 lines, -0 lines 0 comments Download

Messages

Total messages: 8 (3 generated)
fbarchard1
5 years ago (2015-12-08 19:48:46 UTC) #1
Diony Rosa
lgtm https://codereview.chromium.org/1513443002/diff/1/source/planar_functions.cc File source/planar_functions.cc (right): https://codereview.chromium.org/1513443002/diff/1/source/planar_functions.cc#newcode689 source/planar_functions.cc:689: if (IS_ALIGNED(width, 32)) { You're also using BlendPlaneRow ...
5 years ago (2015-12-08 20:06:53 UTC) #3
fbarchard
https://codereview.chromium.org/1513443002/diff/1/source/planar_functions.cc File source/planar_functions.cc (right): https://codereview.chromium.org/1513443002/diff/1/source/planar_functions.cc#newcode689 source/planar_functions.cc:689: if (IS_ALIGNED(width, 32)) { On 2015/12/08 20:06:53, Diony Rosa ...
5 years ago (2015-12-08 20:09:47 UTC) #5
Diony Rosa
lgtm
5 years ago (2015-12-09 01:31:30 UTC) #6
fbarchard1
5 years ago (2015-12-09 02:59:52 UTC) #8
Message was sent while issue was closed.
Committed patchset #5 (id:80001) manually as
a2ea90567998b1ab93ce7fe3acc25922862e4c9c (presubmit successful).

Powered by Google App Engine
This is Rietveld 408576698