Issue 172683002: Separates faster and slower bench alerts; sorts by delta.

Issue 172683002: Separates faster and slower bench alerts; sorts by delta. (Closed)

Created:
6 years, 10 months ago by benchen

Modified:
6 years, 10 months ago

Reviewers:
epoger

CC:
skia-review_googlegroups.com, skiabot_google.com

Base URL:
https://skia.googlesource.com/skia.git@master

Visibility:
Public.

More Reviews

Description

Separates faster and slower bench alerts; sorts by delta. Separate faster and slower bench alerts; sort by delta. BUG=skia:2193 NOTRY=true Committed: http://code.google.com/p/skia/source/detail?r=13512

Patch Set 1 #

Total comments: 12

Patch Set 2 : improvement #

Total comments: 12

Patch Set 3 : another round... #

Created: 6 years, 10 months ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+75 lines, -24 lines)			Patch
M	bench/check_bench_regressions.py	View	1 2	4 chunks	+60 lines, -15 lines	0 comments	Download
M	tools/tests/benchalerts/Perf-Android-Nexus7-Tegra3-Arm7-Release/expectations.txt	View		1 chunk	+3 lines, -2 lines	0 comments	Download
M	tools/tests/benchalerts/Perf-Android-Nexus7-Tegra3-Arm7-Release/output-expected/stderr	View	1	1 chunk	+12 lines, -7 lines	0 comments	Download

Messages

Total messages: 16 (0 generated)

Expand Messages | Collapse Messages

epoger

Looks good on the whole, some suggestions below. https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regressions.py File bench/check_bench_regressions.py (right): https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regressions.py#newcode140 bench/check_bench_regressions.py:140: float(elements[-3])) ...

6 years, 10 months ago (2014-02-19 18:51:08 UTC) #2

benchen

Thanks for the suggestions! PTAL. https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regressions.py File bench/check_bench_regressions.py (right): https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regressions.py#newcode140 bench/check_bench_regressions.py:140: float(elements[-3])) Done using the ...

6 years, 10 months ago (2014-02-19 21:08:08 UTC) #3

Thanks for the suggestions! PTAL.

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
File bench/check_bench_regressions.py (right):

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
bench/check_bench_regressions.py:140: float(elements[-3]))
Done using the constant index approach, which is more lightweight.

On 2014/02/19 18:51:08, epoger wrote:
> Not new to this CL... but these are strange magic indexes into the "elements"
> array.  I suspect a more maintainable and Pythony approach would be something
> like a named tuple... see
> http://docs.python.org/2/library/collections.html#collections.namedtuple
> 
> Or, at least, constant index values defined at the top of the module.
> 
> Anyway, not essential for this CL, but I think it would be a good TODO to add
in
> here.

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
bench/check_bench_regressions.py:143: """Check if there are benches in the given
revising out of range.
On 2014/02/19 18:51:08, epoger wrote:
> TODO: document function arguments, return value, and possible exceptions
raised
> in the docstring
> 
> See http://google-styleguide.googlecode.com/svn/trunk/pyguide.html#Comments

Done.

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
bench/check_bench_regressions.py:147: exceptions = {}
On 2014/02/19 18:51:08, epoger wrote:
> Please document the structure of this dictionary.  I think it is: exception
> message strings, keyed by off_ratio (ratio by which actual result varies from
> expectation)

Done.

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
bench/check_bench_regressions.py:158: exception = 'Bench %s off range [%s, %s]
(%s vs %s, %s%%).' % (
On 2014/02/19 18:51:08, epoger wrote:
> off -> out of

Done.

https://codereview.chromium.org/172683002/diff/1/bench/check_bench_regression...
bench/check_bench_regressions.py:167: if ratio < 1 and 'Faster:' not in outputs:
On 2014/02/19 18:51:08, epoger wrote:
> I think it might be cleaner to build two expectations dicts
> (slower_than_expected, faster_than_expected) in the above loop, and then
output
> them both here.  Then you could do nice things like:
> 
> 0 bench values faster than expected
> 18 bench values slower than expected (worst was xx% faster):
>   blah
>   blah
> 
> or
> 
> 4 bench values faster than expected (best was xx% faster):
>   blah
>   blah
> 18 bench values slower than expected (worst was xx% faster):
>   blah
>   blah

Done.

https://codereview.chromium.org/172683002/diff/1/tools/tests/benchalerts/Perf...
File
tools/tests/benchalerts/Perf-Android-Nexus7-Tegra3-Arm7-Release/output-expected/stderr
(right):

https://codereview.chromium.org/172683002/diff/1/tools/tests/benchalerts/Perf...
tools/tests/benchalerts/Perf-Android-Nexus7-Tegra3-Arm7-Release/output-expected/stderr:12:
Bench desk_amazon.skp_record_,Perf-Android-Nexus7-Tegra3-Arm7-Release-25th off
range [-1.0, 1.2] (1.213 vs 1.1, 10.2727272727%).
Yes, the batch script algorithm could make the lower bound smaller than zero. I
think it's good to leave them as is since they reflect the actual calculated
ranges (not truncated by 0), so we still know if they would be too wide.
On 2014/02/19 18:51:08, epoger wrote:
> we allow negative results?  (-1.0 as lower bound?)

epoger

Cool, getting down to the nits. https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regressions.py File bench/check_bench_regressions.py (right): https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regressions.py#newcode149 bench/check_bench_regressions.py:149: """Check if there ...

6 years, 10 months ago (2014-02-19 21:27:51 UTC) #4

benchen

Thanks for the careful review! PTAL. https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regressions.py File bench/check_bench_regressions.py (right): https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regressions.py#newcode149 bench/check_bench_regressions.py:149: """Check if there ...

6 years, 10 months ago (2014-02-19 21:47:53 UTC) #5

Thanks for the careful review! PTAL.

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
File bench/check_bench_regressions.py (right):

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:149: """Check if there are benches in the given
revising out of range.
On 2014/02/19 21:27:51, epoger wrote:
> revising -> revision ?
> 
> Actually, revision disappeared from the arg list anyway.  So I guess this
should
> read:
> 
> Check if any bench results are outside of expected range.

Done.

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:158: bench representation algorithm (default to
"25th").
Agreed. Done.
On 2014/02/19 21:27:51, epoger wrote:
> I don't see anything about defaulting to "25th" in this function.  Maybe just
> remove "(default to "25th")" from this arg description?

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:164: Exception containing bench data that are
out of range, if nonempty.
On 2014/02/19 21:27:51, epoger wrote:
> nonempty -> any

Done.

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:170: # to a list of correspondig exception
messages.
Done. How did you catch this one?
On 2014/02/19 21:27:51, epoger wrote:
> correspondig -> corresponding

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:186: exceptions[0].setdefault(off_ratio,
[]).append(exception)
Done using constants.
On 2014/02/19 21:27:51, epoger wrote:
> Rather than magic values 0 and 1, please use defined constants or a named
tuple.
>  Or just create two separate dictionary variables (slower_than_expected,
> faster_than_expected).

https://codereview.chromium.org/172683002/diff/70001/bench/check_bench_regres...
bench/check_bench_regressions.py:197: header = '%s Slower benche(s) (Sorted):' %
len(li)
Agreed. Done.
On 2014/02/19 21:27:51, epoger wrote:
> I think this would be clearer:
> 
> '%s benches got slower (sorted by %% difference):'
> 
> Part of this is: I think it's easier to read if you just leave out the "(s)"
> business to account for singular-vs-plural.  I think that's distracting.

epoger

LGTM! Thanks for making this improvement so quickly, Ben!

6 years, 10 months ago (2014-02-19 21:50:30 UTC) #6

commit-bot: I haz the power