Issue 1642703003: starter procs for blending with pm4f

reed1

Description was changed from ========== starter procs for blending with pm4f BUG=skia: ========== to ========== ...

4 years, 10 months ago (2016-01-27 21:22:58 UTC) #1

reed1

Description was changed from ========== starter procs for blending with pm4f BUG=skia: GOLD_TRYBOT_URL= https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&issue=1642703003 ========== ...

4 years, 10 months ago (2016-01-27 21:23:24 UTC) #2

reed1

reed@google.com changed reviewers: + herb@google.com, mtklein@google.com

4 years, 10 months ago (2016-01-27 21:24:16 UTC) #3

reed1

just a starter set. - need to support coverage at some point - need to ...

4 years, 10 months ago (2016-01-27 21:24:16 UTC) #4

reed1

The CQ bit was checked by reed@google.com to run a CQ dry run

4 years, 10 months ago (2016-01-27 21:24:22 UTC) #5

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1642703003/1 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1642703003/1

4 years, 10 months ago (2016-01-27 21:24:29 UTC) #6

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 10 months ago (2016-01-27 21:25:19 UTC) #7

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: Build-Mac10.9-Clang-Arm7-Debug-iOS-Trybot on client.skia.compile (JOB_FAILED, http://build.chromium.org/p/client.skia.compile/builders/Build-Mac10.9-Clang-Arm7-Debug-iOS-Trybot/builds/1072) Build-Ubuntu-GCC-Mips-Debug-Android-Trybot on ...

4 years, 10 months ago (2016-01-27 21:25:19 UTC) #8

reed1

The CQ bit was checked by reed@google.com to run a CQ dry run

4 years, 10 months ago (2016-01-27 21:57:19 UTC) #9

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1642703003/20001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1642703003/20001

4 years, 10 months ago (2016-01-27 21:57:27 UTC) #10

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 10 months ago (2016-01-27 22:08:08 UTC) #11

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

4 years, 10 months ago (2016-01-27 22:08:09 UTC) #12

reed1

Description was changed from ========== starter procs for blending with pm4f curr/maxrss loops min median ...

4 years, 10 months ago (2016-01-28 20:40:23 UTC) #13

Description was changed from

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	91µs	91.2µs	193µs	563µs	89%	▁█▆▄▁▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   8/8  
MB	3	151µs	151µs	152µs	154µs	1%	█▁▁▁▁▁▁▁▄▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   8/8  
MB	2	289µs	290µs	315µs	504µs	21%	▂▁▁█▁▁▁▁▁▁	nonrendering	xfer4f_srcover_N_alpha_linear
   8/8  
MB	1	496µs	496µs	496µs	497µs	0%	▁▂▂▂▂▂█▂▂▂	nonrendering	xfer4f_srcover_N_alpha_srgb
   8/8  
MB	24	13.3µs	13.3µs	13.3µs	13.5µs	0%	▁█▁▁▁▁▁▁▁▁	nonrendering	xfer4f_srcover_1_opaque_linear
   8/8  
MB	25	12.6µs	12.6µs	12.6µs	12.6µs	0%	█▄▃▂▁▂▁▂▁▃	nonrendering	xfer4f_srcover_1_opaque_srgb
   8/8  
MB	2	175µs	175µs	176µs	182µs	1%	▁▁▁█▁▁▁▁▁▁	nonrendering	xfer4f_srcover_1_alpha_linear
   8/8  
MB	1	479µs	479µs	479µs	479µs	0%	█▆▂▂▂▂▂▂▁▂	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...
==========

to

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	87.1µs	91µs	89.8µs	92µs	2%	▇▇▇▇█▇▅▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   9/9  
MB	2	196µs	196µs	215µs	383µs	27%	▁▁▁▁█▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   9/9  
MB	1	313µs	313µs	313µs	313µs	0%	▁▄▅▅▅▂████	nonrendering	xfer4f_srcover_N_alpha_linear
   9/9  
MB	1	580µs	580µs	582µs	602µs	1%	▁▁▁▁▁▁▂▁▁█	nonrendering	xfer4f_srcover_N_alpha_srgb
   9/9  
MB	23	13.1µs	13.1µs	13.1µs	13.1µs	0%	▆▄▄█▂▂▂▁▂▁	nonrendering	xfer4f_srcover_1_opaque_linear
   9/9  
MB	23	13.2µs	13.2µs	13.2µs	13.2µs	0%	█▄▂▁▃▁▂▂▂▂	nonrendering	xfer4f_srcover_1_opaque_srgb
   9/9  
MB	2	178µs	183µs	183µs	185µs	1%	▇▇▇█▇▇▇▇▇▁	nonrendering	xfer4f_srcover_1_alpha_linear
   9/9  
MB	1	517µs	517µs	517µs	517µs	0%	▇█▄▃▄▁▂▁▂▄	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...
==========

reed1

The CQ bit was checked by reed@google.com to run a CQ dry run

4 years, 10 months ago (2016-01-28 20:40:30 UTC) #14

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1642703003/40001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1642703003/40001

4 years, 10 months ago (2016-01-28 20:40:36 UTC) #15

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 10 months ago (2016-01-28 20:43:10 UTC) #16

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: Build-Win-MSVC-x86_64-Debug-Trybot on client.skia.compile (JOB_FAILED, http://build.chromium.org/p/client.skia.compile/builders/Build-Win-MSVC-x86_64-Debug-Trybot/builds/5757)

4 years, 10 months ago (2016-01-28 20:43:11 UTC) #17

reed1

The CQ bit was checked by reed@google.com to run a CQ dry run

4 years, 10 months ago (2016-01-28 20:49:42 UTC) #18

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1642703003/60001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1642703003/60001

4 years, 10 months ago (2016-01-28 20:49:46 UTC) #19

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 10 months ago (2016-01-28 21:16:38 UTC) #21

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

4 years, 10 months ago (2016-01-28 21:16:39 UTC) #22

reed1

Description was changed from ========== starter procs for blending with pm4f curr/maxrss loops min median ...

4 years, 10 months ago (2016-01-29 13:22:11 UTC) #23

Description was changed from

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	87.1µs	91µs	89.8µs	92µs	2%	▇▇▇▇█▇▅▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   9/9  
MB	2	196µs	196µs	215µs	383µs	27%	▁▁▁▁█▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   9/9  
MB	1	313µs	313µs	313µs	313µs	0%	▁▄▅▅▅▂████	nonrendering	xfer4f_srcover_N_alpha_linear
   9/9  
MB	1	580µs	580µs	582µs	602µs	1%	▁▁▁▁▁▁▂▁▁█	nonrendering	xfer4f_srcover_N_alpha_srgb
   9/9  
MB	23	13.1µs	13.1µs	13.1µs	13.1µs	0%	▆▄▄█▂▂▂▁▂▁	nonrendering	xfer4f_srcover_1_opaque_linear
   9/9  
MB	23	13.2µs	13.2µs	13.2µs	13.2µs	0%	█▄▂▁▃▁▂▂▂▂	nonrendering	xfer4f_srcover_1_opaque_srgb
   9/9  
MB	2	178µs	183µs	183µs	185µs	1%	▇▇▇█▇▇▇▇▇▁	nonrendering	xfer4f_srcover_1_alpha_linear
   9/9  
MB	1	517µs	517µs	517µs	517µs	0%	▇█▄▃▄▁▂▁▂▄	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...
==========

to

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	87.1µs	91µs	89.8µs	92µs	2%	▇▇▇▇█▇▅▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   9/9  
MB	2	196µs	196µs	215µs	383µs	27%	▁▁▁▁█▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   9/9  
MB	1	313µs	313µs	313µs	313µs	0%	▁▄▅▅▅▂████	nonrendering	xfer4f_srcover_N_alpha_linear
   9/9  
MB	1	580µs	580µs	582µs	602µs	1%	▁▁▁▁▁▁▂▁▁█	nonrendering	xfer4f_srcover_N_alpha_srgb
   9/9  
MB	23	13.1µs	13.1µs	13.1µs	13.1µs	0%	▆▄▄█▂▂▂▁▂▁	nonrendering	xfer4f_srcover_1_opaque_linear
   9/9  
MB	23	13.2µs	13.2µs	13.2µs	13.2µs	0%	█▄▂▁▃▁▂▂▂▂	nonrendering	xfer4f_srcover_1_opaque_srgb
   9/9  
MB	2	178µs	183µs	183µs	185µs	1%	▇▇▇█▇▇▇▇▇▁	nonrendering	xfer4f_srcover_1_alpha_linear
   9/9  
MB	1	517µs	517µs	517µs	517µs	0%	▇█▄▃▄▁▂▁▂▄	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...

TBR=
landing now so these incremental types/functions can be used to collaborate with
herb's work. nothing is active at this point
==========

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1642703003/60001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1642703003/60001

4 years, 10 months ago (2016-01-29 13:22:25 UTC) #25

commit-bot: I haz the power

Description was changed from ========== starter procs for blending with pm4f curr/maxrss loops min median ...

4 years, 10 months ago (2016-01-29 13:23:01 UTC) #26

Message was sent while issue was closed.

Description was changed from

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	87.1µs	91µs	89.8µs	92µs	2%	▇▇▇▇█▇▅▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   9/9  
MB	2	196µs	196µs	215µs	383µs	27%	▁▁▁▁█▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   9/9  
MB	1	313µs	313µs	313µs	313µs	0%	▁▄▅▅▅▂████	nonrendering	xfer4f_srcover_N_alpha_linear
   9/9  
MB	1	580µs	580µs	582µs	602µs	1%	▁▁▁▁▁▁▂▁▁█	nonrendering	xfer4f_srcover_N_alpha_srgb
   9/9  
MB	23	13.1µs	13.1µs	13.1µs	13.1µs	0%	▆▄▄█▂▂▂▁▂▁	nonrendering	xfer4f_srcover_1_opaque_linear
   9/9  
MB	23	13.2µs	13.2µs	13.2µs	13.2µs	0%	█▄▂▁▃▁▂▂▂▂	nonrendering	xfer4f_srcover_1_opaque_srgb
   9/9  
MB	2	178µs	183µs	183µs	185µs	1%	▇▇▇█▇▇▇▇▇▁	nonrendering	xfer4f_srcover_1_alpha_linear
   9/9  
MB	1	517µs	517µs	517µs	517µs	0%	▇█▄▃▄▁▂▁▂▄	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...

TBR=
landing now so these incremental types/functions can be used to collaborate with
herb's work. nothing is active at this point
==========

to

==========
starter procs for blending with pm4f

curr/maxrss	loops	min	median	mean	max	stddev	samples   	config	bench
   8/8  
MB	4	87.1µs	91µs	89.8µs	92µs	2%	▇▇▇▇█▇▅▁▁▁	nonrendering	xfer4f_srcover_N_opaque_linear
   9/9  
MB	2	196µs	196µs	215µs	383µs	27%	▁▁▁▁█▁▁▁▁▁	nonrendering	xfer4f_srcover_N_opaque_srgb
   9/9  
MB	1	313µs	313µs	313µs	313µs	0%	▁▄▅▅▅▂████	nonrendering	xfer4f_srcover_N_alpha_linear
   9/9  
MB	1	580µs	580µs	582µs	602µs	1%	▁▁▁▁▁▁▂▁▁█	nonrendering	xfer4f_srcover_N_alpha_srgb
   9/9  
MB	23	13.1µs	13.1µs	13.1µs	13.1µs	0%	▆▄▄█▂▂▂▁▂▁	nonrendering	xfer4f_srcover_1_opaque_linear
   9/9  
MB	23	13.2µs	13.2µs	13.2µs	13.2µs	0%	█▄▂▁▃▁▂▂▂▂	nonrendering	xfer4f_srcover_1_opaque_srgb
   9/9  
MB	2	178µs	183µs	183µs	185µs	1%	▇▇▇█▇▇▇▇▇▁	nonrendering	xfer4f_srcover_1_alpha_linear
   9/9  
MB	1	517µs	517µs	517µs	517µs	0%	▇█▄▃▄▁▂▁▂▄	nonrendering	xfer4f_srcover_1_alpha_srgb

BUG=skia:
GOLD_TRYBOT_URL=
https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&is...

TBR=
landing now so these incremental types/functions can be used to collaborate with
herb's work. nothing is active at this point

Committed:
https://skia.googlesource.com/skia/+/fbc1e296b2e98dc76de533a2bb45d9ccc8c2498f
==========

commit-bot: I haz the power

Committed patchset #4 (id:60001) as https://skia.googlesource.com/skia/+/fbc1e296b2e98dc76de533a2bb45d9ccc8c2498f

4 years, 10 months ago (2016-01-29 13:23:02 UTC) #27

mtklein

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp File bench/Xfer4fBench.cpp (right): https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp#newcode26 bench/Xfer4fBench.cpp:26: c.fVec[0] = 1; c.fVec[1] = 1; c.fVec[2] = 1; ...

4 years, 10 months ago (2016-01-29 15:23:00 UTC) #28

mtklein

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkXfer4f.cpp File src/core/SkXfer4f.cpp (right): https://codereview.chromium.org/1642703003/diff/60001/src/core/SkXfer4f.cpp#newcode55 src/core/SkXfer4f.cpp:55: s4 = s4 * Sk4f(255); I think if we ...

4 years, 10 months ago (2016-01-29 15:41:31 UTC) #29

reed1

all fixes to comments rolling into https://codereview.chromium.org/1634273002/ except that SkXfer4f.cpp is totally gone https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp File ...

4 years, 10 months ago (2016-01-29 22:35:27 UTC) #30

Message was sent while issue was closed.

all fixes to comments rolling into 

https://codereview.chromium.org/1634273002/

except that SkXfer4f.cpp is totally gone

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp
File bench/Xfer4fBench.cpp (right):

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp#n...
bench/Xfer4fBench.cpp:26: c.fVec[0] = 1; c.fVec[1] = 1; c.fVec[2] = 1; c.fVec[3]
= 1;
On 2016/01/29 15:23:00, mtklein wrote:
> This can also be
>    SkPM4f c = {{ 1,1,1,1 }};

Done.

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp#n...
bench/Xfer4fBench.cpp:39: for (int i = 0; i < loops; ++i) {
On 2016/01/29 15:23:00, mtklein wrote:
> Generally, you can also write 
>   for (int i = 0; i < INNER_LOOPS*loops; ++i) {
>      ...
>   }

Ah, so you never pass 0 for loops?

https://codereview.chromium.org/1642703003/diff/60001/gm/xfer4f.cpp
File gm/xfer4f.cpp (right):

https://codereview.chromium.org/1642703003/diff/60001/gm/xfer4f.cpp#newcode30
gm/xfer4f.cpp:30: const SkPM4f src = SkPM4f::FromPMColor(SkPreMultiplyColor(c));
On 2016/01/29 15:23:00, mtklein wrote:
> Just had a thought that we might want to generally do this in the opposite
order
> (-> float, then premul), so that the premultiply step does not lose precision.

Totally agree, was just being lazy since I already have a helper function to
premul the SkColor. Will change.

https://codereview.chromium.org/1642703003/diff/60001/include/core/SkColor.h
File include/core/SkColor.h (right):

https://codereview.chromium.org/1642703003/diff/60001/include/core/SkColor.h#...
include/core/SkColor.h:175: float a() const { return fVec[3]; }
On 2016/01/29 15:23:00, mtklein wrote:
> Ahem, fVec[A];

Doh

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkColor.cpp
File src/core/SkColor.cpp (right):

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkColor.cpp#ne...
src/core/SkColor.cpp:145: bool SkPM4f::isUnit() const {
On 2016/01/29 15:23:00, mtklein wrote:
> If this is only used inside asserts, we may want to power this down to
> 
> #ifdef SK_DEBUG
>     void assertIsUnit() const;
> #else
>    void assertIsUnit() const {}
> #endif
> 
> allTrue() is the best way to do this sort of thing, but it's not exactly fast
> (especially on ARM).  Best to not use in hot code.

Gotcha. No idea if we'll need it for real, so for now I'll just make it a
debug-assert.

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkPM4fPriv.h
File src/core/SkPM4fPriv.h (right):

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkPM4fPriv.h#n...
src/core/SkPM4fPriv.h:30: static inline Sk4f s2l(const Sk4f& s4) {
On 2016/01/29 15:23:00, mtklein wrote:
> Had to think about what these were for a second.  Might wanna expand some of
> these names out: sRGB_to_linear(), linear_to_sRGB().

Done.

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkPM4fPriv.h#n...
src/core/SkPM4fPriv.h:58: static Sk4f unit_to_l255_round(const SkPM4f& pm4) {
On 2016/01/29 15:23:00, mtklein wrote:
> I think this might be clearer if we start first with completely orthogonal
> helper functions, e.g.
> 
>     Sk4f to_4f(uint32_t);
>     uint32_t to_4b(Sk4f f255);
> 
>     Sk4f scale_255_to_1(Sk4f f255);
>     Sk4f scale_1_to_255(Sk4f f1);    
> 
>     Sk4f to_sRGB(Sk4f f1);
>     Sk4f to_linear(Sk4f f1);
> 
> then build the compound operations (sRGB 8888 -> linear unit float) out of
those
> exclusively, either as more static helper functions after a big slashline or
as
> little local lambdas.
>    

Agree, this is definitely a worktable of experimental helpers. Will work to
refine/reduce over time.

https://codereview.chromium.org/1642703003/diff/60001/src/core/SkPM4fPriv.h#n...
src/core/SkPM4fPriv.h:94: static inline void SkPM4f_s32_src_mode(SkPMColor
dst[], const SkPM4f src[], int count) {
On 2016/01/29 15:23:00, mtklein wrote:
> I think we're going to have to come up with a new abbreviation for sRGB 8888,
or
> not abbreviate it.  I keep reading this as "uses 4 floats... 32-bit source...
> src mode".  "s32" screams 32-bit source to me.
> 
> Let's just write out _sRGB_ and _linear_ ?
> 
> We could probably name these 
>    SkPM4f_src_sRGB
>    SkPM4f_src_linear
>    SkPM4f_srcover_sRGB
>    SkPM4f_srcover_linear
> without losing any information.

Done.

mtklein

4 years, 10 months ago (2016-01-30 01:18:19 UTC) #31

Message was sent while issue was closed.

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp
File bench/Xfer4fBench.cpp (right):

https://codereview.chromium.org/1642703003/diff/60001/bench/Xfer4fBench.cpp#n...
bench/Xfer4fBench.cpp:39: for (int i = 0; i < loops; ++i) {
On 2016/01/29 22:35:27, reed1 wrote:
> On 2016/01/29 15:23:00, mtklein wrote:
> > Generally, you can also write 
> >   for (int i = 0; i < INNER_LOOPS*loops; ++i) {
> >      ...
> >   }
> 
> Ah, so you never pass 0 for loops?

Um, it's true that nanobench happens to never pass 0 for loops, but I can't help
but ask... what do you think goes wrong if loops is 0?

That's still just

for (int i = 0; i < 0; ++i) {
    ...
}

Issue 1642703003: starter procs for blending with pm4f (Closed)

Description

Patch Set 1 #

Patch Set 2 : #

Patch Set 3 : #

Patch Set 4 : int to float #

Messages