source/row_msa.cc - Issue 2368313003: Add MIPS SIMD Arch (MSA) optimized ARGBMirrorRow function

Keyboard Shortcuts

	File
u :	up to issue
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line

	Issue
u :	up to list of issues
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue

Unified Diff: source/row_msa.cc

Issue 2368313003: Add MIPS SIMD Arch (MSA) optimized ARGBMirrorRow function (Closed)

Patch Set: Created 4 years, 3 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

Index: source/row_msa.cc

diff --git a/source/row_msa.cc b/source/row_msa.cc

index 6dd6f5f3b7a4c581ea02369f9cf1af4606a9bf98..29e913b53281436d979dbb85cb1902dee696fd27 100644

--- a/source/row_msa.cc

+++ b/source/row_msa.cc

@@ -37,6 +37,24 @@ void MirrorRow_MSA(const uint8* src, uint8* dst, int width) {

src -= 64;

}

+void ARGBMirrorRow_MSA(const uint8* src, uint8* dst, int width) {

+ int count;

fbarchard1 2016/09/26 18:04:34 prefer int x for horizontal counts for consistency

+ v16u8 src0, src1, src2, src3;

+ v16u8 dst0, dst1, dst2, dst3;

+ v4i32 mask = { 3, 2, 1, 0 };

+ src += width * 4 - 64;

+ for (count = 0; count < width; count += 16) {

+ LD_UB4(src, 16, src3, src2, src1, src0);

+ VSHF_W4_UB(src0, src0, src1, src1, src2, src2, src3, src3,

fbarchard1 2016/09/26 18:04:34 consider less unrolling. (measure performance) F

+ mask, mask, mask, mask, dst0, dst1, dst2, dst3);

+ ST_UB4(dst0, dst1, dst2, dst3, dst, 16);

+ dst += 64;

+ src -= 64;

+ }

#endif // !defined(LIBYUV_DISABLE_MSA) && defined(__mips_msa)

#ifdef __cplusplus

« include/libyuv/macros_msa.h ('K') | « source/row_any.cc ('k') | no next file » | no next file with comments »