Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(336)

Side by Side Diff: source/libvpx/vp8/common/arm/armv6/vp8_variance8x8_armv6.asm

Issue 181493009: libvpx: Pull from upstream (Closed) Base URL: svn://svn.chromium.org/chrome/trunk/deps/third_party/libvpx/
Patch Set: Created 6 years, 9 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View unified diff | Download patch | Annotate | Revision Log
OLDNEW
1 ; 1 ;
2 ; Copyright (c) 2011 The WebM project authors. All Rights Reserved. 2 ; Copyright (c) 2011 The WebM project authors. All Rights Reserved.
3 ; 3 ;
4 ; Use of this source code is governed by a BSD-style license 4 ; Use of this source code is governed by a BSD-style license
5 ; that can be found in the LICENSE file in the root of the source 5 ; that can be found in the LICENSE file in the root of the source
6 ; tree. An additional intellectual property rights grant can be found 6 ; tree. An additional intellectual property rights grant can be found
7 ; in the file PATENTS. All contributing project authors may 7 ; in the file PATENTS. All contributing project authors may
8 ; be found in the AUTHORS file in the root of the source tree. 8 ; be found in the AUTHORS file in the root of the source tree.
9 ; 9 ;
10 10
(...skipping 33 matching lines...) Expand 10 before | Expand all | Expand 10 after
44 usub8 r9, r7, r6 ; calculate difference with reversed operands 44 usub8 r9, r7, r6 ; calculate difference with reversed operands
45 pld [r2, r3, lsl #1] 45 pld [r2, r3, lsl #1]
46 sel r8, r9, lr ; select bytes with negative difference 46 sel r8, r9, lr ; select bytes with negative difference
47 47
48 ; calculate partial sums 48 ; calculate partial sums
49 usad8 r6, r10, lr ; calculate sum of positive differences 49 usad8 r6, r10, lr ; calculate sum of positive differences
50 usad8 r7, r8, lr ; calculate sum of negative differences 50 usad8 r7, r8, lr ; calculate sum of negative differences
51 orr r8, r8, r10 ; differences of all 4 pixels 51 orr r8, r8, r10 ; differences of all 4 pixels
52 ; calculate total sum 52 ; calculate total sum
53 add r4, r4, r6 ; add positive differences to sum 53 add r4, r4, r6 ; add positive differences to sum
54 sub r4, r4, r7 ; substract negative differences from sum 54 sub r4, r4, r7 ; subtract negative differences from sum
55 55
56 ; calculate sse 56 ; calculate sse
57 uxtb16 r7, r8 ; byte (two pixels) to halfwords 57 uxtb16 r7, r8 ; byte (two pixels) to halfwords
58 uxtb16 r10, r8, ror #8 ; another two pixels to halfwords 58 uxtb16 r10, r8, ror #8 ; another two pixels to halfwords
59 smlad r5, r7, r7, r5 ; dual signed multiply, add and accumulate (1) 59 smlad r5, r7, r7, r5 ; dual signed multiply, add and accumulate (1)
60 60
61 ; 2nd 4 pixels 61 ; 2nd 4 pixels
62 ldr r6, [r0, #0x4] ; load 4 src pixels 62 ldr r6, [r0, #0x4] ; load 4 src pixels
63 ldr r7, [r2, #0x4] ; load 4 ref pixels 63 ldr r7, [r2, #0x4] ; load 4 ref pixels
64 smlad r5, r10, r10, r5 ; dual signed multiply, add and accumulate (2) 64 smlad r5, r10, r10, r5 ; dual signed multiply, add and accumulate (2)
65 65
66 usub8 r8, r6, r7 ; calculate difference 66 usub8 r8, r6, r7 ; calculate difference
67 add r0, r0, r1 ; set src_ptr to next row 67 add r0, r0, r1 ; set src_ptr to next row
68 sel r10, r8, lr ; select bytes with positive difference 68 sel r10, r8, lr ; select bytes with positive difference
69 usub8 r9, r7, r6 ; calculate difference with reversed operands 69 usub8 r9, r7, r6 ; calculate difference with reversed operands
70 add r2, r2, r3 ; set dst_ptr to next row 70 add r2, r2, r3 ; set dst_ptr to next row
71 sel r8, r9, lr ; select bytes with negative difference 71 sel r8, r9, lr ; select bytes with negative difference
72 72
73 ; calculate partial sums 73 ; calculate partial sums
74 usad8 r6, r10, lr ; calculate sum of positive differences 74 usad8 r6, r10, lr ; calculate sum of positive differences
75 usad8 r7, r8, lr ; calculate sum of negative differences 75 usad8 r7, r8, lr ; calculate sum of negative differences
76 orr r8, r8, r10 ; differences of all 4 pixels 76 orr r8, r8, r10 ; differences of all 4 pixels
77 77
78 ; calculate total sum 78 ; calculate total sum
79 add r4, r4, r6 ; add positive differences to sum 79 add r4, r4, r6 ; add positive differences to sum
80 sub r4, r4, r7 ; substract negative differences from sum 80 sub r4, r4, r7 ; subtract negative differences from sum
81 81
82 ; calculate sse 82 ; calculate sse
83 uxtb16 r7, r8 ; byte (two pixels) to halfwords 83 uxtb16 r7, r8 ; byte (two pixels) to halfwords
84 uxtb16 r10, r8, ror #8 ; another two pixels to halfwords 84 uxtb16 r10, r8, ror #8 ; another two pixels to halfwords
85 smlad r5, r7, r7, r5 ; dual signed multiply, add and accumulate (1) 85 smlad r5, r7, r7, r5 ; dual signed multiply, add and accumulate (1)
86 subs r12, r12, #1 ; next row 86 subs r12, r12, #1 ; next row
87 smlad r5, r10, r10, r5 ; dual signed multiply, add and accumulate (2) 87 smlad r5, r10, r10, r5 ; dual signed multiply, add and accumulate (2)
88 88
89 bne loop 89 bne loop
90 90
91 ; return stuff 91 ; return stuff
92 ldr r8, [sp, #32] ; get address of sse 92 ldr r8, [sp, #32] ; get address of sse
93 mul r1, r4, r4 ; sum * sum 93 mul r1, r4, r4 ; sum * sum
94 str r5, [r8] ; store sse 94 str r5, [r8] ; store sse
95 sub r0, r5, r1, ASR #6 ; return (sse - ((sum * sum) >> 6)) 95 sub r0, r5, r1, ASR #6 ; return (sse - ((sum * sum) >> 6))
96 96
97 pop {r4-r10, pc} 97 pop {r4-r10, pc}
98 98
99 ENDP 99 ENDP
100 100
101 END 101 END
OLDNEW

Powered by Google App Engine
This is Rietveld 408576698