Issue 2746363003: Compute inter-percentile range statistics in Histogram.

Issue 2746363003: Compute inter-percentile range statistics in Histogram. (Closed)

Created:
3 years, 9 months ago by benjhayden

Modified:
3 years, 7 months ago

Reviewers:
eakuefner, Liquan (Max) Gu, tdresser

CC:
catapult-reviews_chromium.org, dproy, tracing-review_chromium.org

Target Ref:
refs/heads/master

Project:
catapult

Visibility:
Public.

More Reviews

Description

Compute inter-percentile range statistics in Histogram. Currently, the only statistic that measures noise is std, but this is not strictly applicable to non-normal distributions, such as most of our metrics. Inter-quartile range is another measure of noise that is more broadly applicable. This CL generalizes IQR to inter-percentile ranges: histogram.customizeSummaryOptions({iprs: [ tr.b.Range.fromExplicitRange(0.25, 0.75), tr.b.Range.fromExplicitRange(0.1, 0.9), ]}); BUG=catapult:#3319 Review-Url: https://codereview.chromium.org/2746363003 Committed: https://chromium.googlesource.com/external/github.com/catapult-project/catapult/+/4628b575b1d7baa7e729288f3397c0b75926881e

Patch Set 1 #

Patch Set 2 : rebase #

Total comments: 1

Patch Set 3 : rebase #

Total comments: 5

Created: 3 years, 7 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+105 lines, -24 lines)			Patch
	M	tracing/tracing/value/histogram.html	View	1 2	11 chunks	+56 lines, -13 lines	5 comments	Download
	M	tracing/tracing/value/histogram_test.html	View	1 2	4 chunks	+49 lines, -11 lines	0 comments	Download

Messages

Total messages: 26 (9 generated)

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages

eakuefner

lgtm; i think it's ultimately up to tim what we end up adding here, so ...

3 years, 9 months ago (2017-03-17 20:14:21 UTC) #3

tdresser

Shall we wait for Deep to gather some data for us to look at before ...

3 years, 9 months ago (2017-03-17 21:18:41 UTC) #4

benjhayden

On 2017/03/17 at 21:18:41, tdresser wrote: > Shall we wait for Deep to gather some ...

3 years, 9 months ago (2017-03-17 21:52:43 UTC) #5

Liquan (Max) Gu

I have posted a report on the comparison of STD, MAD, AAD, IQR here: https://docs.google.com/document/d/1cu0h94BiL9BoBMSvAjNCZk1g4kixJvBTyOrUkUY3TwM/edit# ...

3 years, 8 months ago (2017-04-24 19:58:43 UTC) #7

benjhayden

On 2017/04/24 at 19:58:43, maxlg wrote: > I have posted a report on the comparison ...

3 years, 7 months ago (2017-05-03 17:42:24 UTC) #8

benjhayden

On 2017/05/03 at 17:42:24, benjhayden_OOO wrote: > On 2017/04/24 at 19:58:43, maxlg wrote: > > ...

3 years, 7 months ago (2017-05-03 17:50:31 UTC) #9

Liquan (Max) Gu

On 2017/05/03 17:50:31, benjhayden_OOO wrote: > On 2017/05/03 at 17:42:24, benjhayden_OOO wrote: > > On ...

3 years, 7 months ago (2017-05-03 17:55:56 UTC) #10

tdresser

On 2017/05/03 17:55:56, Liquan (Max) Gu wrote: > On 2017/05/03 17:50:31, benjhayden_OOO wrote: > > ...

3 years, 7 months ago (2017-05-09 15:33:55 UTC) #11

On 2017/05/03 17:55:56, Liquan (Max) Gu wrote:
> On 2017/05/03 17:50:31, benjhayden_OOO wrote:
> > On 2017/05/03 at 17:42:24, benjhayden_OOO wrote:
> > > On 2017/04/24 at 19:58:43, maxlg wrote:
> > > > I have posted a report on the comparison of STD, MAD, AAD, IQR here:
> >
>
https://docs.google.com/document/d/1cu0h94BiL9BoBMSvAjNCZk1g4kixJvBTyOrUkUY3T...
> > > > 
> > > >
> >
>
https://codereview.chromium.org/2746363003/diff/20001/tracing/tracing/value/h...
> > > > File tracing/tracing/value/histogram.html (right):
> > > > 
> > > >
> >
>
https://codereview.chromium.org/2746363003/diff/20001/tracing/tracing/value/h...
> > > > tracing/tracing/value/histogram.html:574: }
> > > > When this.numValues === 0, it will return a scalar whose {value: NaN}
and
> > fails the test "assert.strictEqual(0,
> > hist.getStatisticScalar('ipr_000_100').value);"
> > > > 
> > > > Do we want to return scalar whose {value: 0} or {value: NaN} or scalar
===
> > undefined when the histogram is empty?
> > > 
> > > I think we try to limit use of NaN because it's often a source of bugs,
> > despite the number of isNaN checks already in this directory.
> > > The primary use-case is the dashboard. If the scalar is undefined, then
the
> > dashboard might show gaps in the charts. OTOH, these statistics are
naturally
> > zero when there's only 1 or 2 samples, so if we also use zero when the
> histogram
> > is empty, then that might cause ambiguity.
> > > I'll ask Ethan what these statistics should look like in the dashboard
when
> > the histogram is empty.
> > > Either way, the Histogram class and the Statistics static class should
> exhibit
> > the same behavior.
> > 
> > Offline discussion suggests using undefined for all statistic scalars that
> > cannot be computed when a histogram is empty. "count" should be 0, but all
> other
> > statistics should be undefined.
> 
> I agree with it.

So we've done some investigation here, and think IQR is the right thing to land.

Let's go ahead and land this.

benjhayden

On 2017/05/09 at 15:33:55, tdresser wrote: > On 2017/05/03 17:55:56, Liquan (Max) Gu wrote: > ...

3 years, 7 months ago (2017-05-09 16:09:25 UTC) #12

On 2017/05/09 at 15:33:55, tdresser wrote:
> On 2017/05/03 17:55:56, Liquan (Max) Gu wrote:
> > On 2017/05/03 17:50:31, benjhayden_OOO wrote:
> > > On 2017/05/03 at 17:42:24, benjhayden_OOO wrote:
> > > > On 2017/04/24 at 19:58:43, maxlg wrote:
> > > > > I have posted a report on the comparison of STD, MAD, AAD, IQR here:
> > >
> >
https://docs.google.com/document/d/1cu0h94BiL9BoBMSvAjNCZk1g4kixJvBTyOrUkUY3T...
> > > > > 
> > > > >
> > >
> >
https://codereview.chromium.org/2746363003/diff/20001/tracing/tracing/value/h...
> > > > > File tracing/tracing/value/histogram.html (right):
> > > > > 
> > > > >
> > >
> >
https://codereview.chromium.org/2746363003/diff/20001/tracing/tracing/value/h...
> > > > > tracing/tracing/value/histogram.html:574: }
> > > > > When this.numValues === 0, it will return a scalar whose {value: NaN}
and
> > > fails the test "assert.strictEqual(0,
> > > hist.getStatisticScalar('ipr_000_100').value);"
> > > > > 
> > > > > Do we want to return scalar whose {value: 0} or {value: NaN} or scalar
===
> > > undefined when the histogram is empty?
> > > > 
> > > > I think we try to limit use of NaN because it's often a source of bugs,
> > > despite the number of isNaN checks already in this directory.
> > > > The primary use-case is the dashboard. If the scalar is undefined, then
the
> > > dashboard might show gaps in the charts. OTOH, these statistics are
naturally
> > > zero when there's only 1 or 2 samples, so if we also use zero when the
> > histogram
> > > is empty, then that might cause ambiguity.
> > > > I'll ask Ethan what these statistics should look like in the dashboard
when
> > > the histogram is empty.
> > > > Either way, the Histogram class and the Statistics static class should
> > exhibit
> > > the same behavior.
> > > 
> > > Offline discussion suggests using undefined for all statistic scalars that
> > > cannot be computed when a histogram is empty. "count" should be 0, but all
> > other
> > > statistics should be undefined.
> > 
> > I agree with it.
> 
> So we've done some investigation here, and think IQR is the right thing to
land.
> 
> Let's go ahead and land this.

Great! I'll rebase and clean this up and say PTAL to restart reviews.

tdresser

lgtm with nit. https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html File tracing/tracing/value/histogram.html (right): https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html#newcode133 tracing/tracing/value/histogram.html:133: ['std', true], Should std and avg ...

3 years, 7 months ago (2017-05-11 21:03:12 UTC) #15

benjhayden

https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html File tracing/tracing/value/histogram.html (right): https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html#newcode133 tracing/tracing/value/histogram.html:133: ['std', true], On 2017/05/11 at 21:03:12, tdresser wrote: > ...

3 years, 7 months ago (2017-05-11 21:24:15 UTC) #16

tdresser

https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html File tracing/tracing/value/histogram.html (right): https://codereview.chromium.org/2746363003/diff/40001/tracing/tracing/value/histogram.html#newcode779 tracing/tracing/value/histogram.html:779: if (stat === 'percentile' || stat === 'iprs') { ...

3 years, 7 months ago (2017-05-12 12:31:17 UTC) #17

commit-bot: I haz the power

CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2746363003/40001

3 years, 7 months ago (2017-05-12 20:12:55 UTC) #23

commit-bot: I haz the power

Description was changed from ========== Compute inter-percentile range statistics in Histogram. Currently, the only statistic ...

3 years, 7 months ago (2017-05-12 20:14:12 UTC) #25

commit-bot: I haz the power

3 years, 7 months ago (2017-05-12 20:14:13 UTC) #26

Message was sent while issue was closed.

Committed patchset #3 (id:40001) as
https://chromium.googlesource.com/external/github.com/catapult-project/catapu...

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages