Issue 2271793003: Add a new benchmark for cpu power measurements on steady state sites.

erikchen

Description was changed from ========== Add a new benchmark for cpu power measurements on steady ...

4 years, 4 months ago (2016-08-24 00:54:09 UTC) #1

erikchen

erikchen@chromium.org changed reviewers: + charliea@chromium.org, nednguyen@google.com

4 years, 4 months ago (2016-08-24 00:54:51 UTC) #2

nednguyen

nednguyen@google.com changed reviewers: + rnephew@chromium.org

4 years, 4 months ago (2016-08-24 19:51:43 UTC) #4

nednguyen

https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py File tools/perf/page_sets/steady_state.py (right): https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py#newcode8 tools/perf/page_sets/steady_state.py:8: class SteadyStateStorySet(story.StorySet): Why create new story set & benchmark ...

4 years, 4 months ago (2016-08-24 19:51:44 UTC) #5

erikchen

https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py File tools/perf/page_sets/steady_state.py (right): https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py#newcode8 tools/perf/page_sets/steady_state.py:8: class SteadyStateStorySet(story.StorySet): On 2016/08/24 19:51:43, nednguyen wrote: > Why ...

4 years, 4 months ago (2016-08-24 22:13:32 UTC) #6

nednguyen

On 2016/08/24 22:13:32, erikchen wrote: > https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py > File tools/perf/page_sets/steady_state.py (right): > > https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py#newcode8 > ...

4 years, 4 months ago (2016-08-25 00:27:16 UTC) #7

nednguyen

Description was changed from ========== Add a new benchmark for cpu power measurements on steady ...

4 years, 4 months ago (2016-08-25 00:27:29 UTC) #8

nednguyen

nednguyen@google.com changed reviewers: + petrcermak@chromium.org

4 years, 4 months ago (2016-08-25 00:27:29 UTC) #9

erikchen

On 2016/08/25 00:27:16, nednguyen wrote: > On 2016/08/24 22:13:32, erikchen wrote: > > > https://codereview.chromium.org/2271793003/diff/1/tools/perf/page_sets/steady_state.py ...

4 years, 4 months ago (2016-08-25 00:29:38 UTC) #10

nednguyen

nednguyen@google.com changed reviewers: + cbruni@chromium.org, fmeawad@chromium.org, kouhei@chromium.org - rnephew@chromium.org

4 years, 4 months ago (2016-08-25 00:40:07 UTC) #11

nednguyen

If you're looking into create a bunch of realistic pages that just wait x seconds ...

4 years, 4 months ago (2016-08-25 00:40:08 UTC) #12

erikchen

On 2016/08/25 00:40:08, nednguyen wrote: > If you're looking into create a bunch of realistic ...

4 years, 4 months ago (2016-08-25 00:45:35 UTC) #13

nednguyen

On 2016/08/25 00:45:35, erikchen wrote: > On 2016/08/25 00:40:08, nednguyen wrote: > > If you're ...

4 years, 4 months ago (2016-08-25 00:51:43 UTC) #14

erikchen

> I think this correlates well with whether the page is idle after load. I ...

4 years, 4 months ago (2016-08-25 01:01:51 UTC) #15

nednguyen

On 2016/08/25 01:01:51, erikchen wrote: > > I think this correlates well with whether the ...

4 years, 4 months ago (2016-08-25 01:28:40 UTC) #16

On 2016/08/25 01:01:51, erikchen wrote:
> > I think this correlates well with whether the page is idle after load. I
would
> > expect 
> > pages with high idle CPU usage to have high TimeToInteractive metrics values
> as
> > well.
> 
> I do not have this expectation. From everything we've seen so far, most pages
> with high idle CPU usage are due to too many Timer firings, excessive
relayouts,
> etc. See the "Potential Areas for Investigation" section under
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...

Wont't these tasks run on the main threads? If the main threads is quiet, we
consider the page to be interactive. (see more on
https://github.com/tdresser/time-to-interactive/blob/master/README.md)

To be clear, my main question here is:
"if a page exhibit interesting power problems around loading period, would that
page also be interesting for loading & v8 to track their performance?"

If we believe the nature of these pages are different, I am fine with not
sharing the pages but we should discuss first. 

> 
> > The main cost of maintenance is triaging/fixing bugs when Chrome crashes
when
> it
> > to load these pages. For an ideas of how often that happens,
> > you can look at the number of blocking bugs in
> > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> 
> I started going through the blocking bugs in
> https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm failing to
see
> any that are caused by Chrome crashes associated with loading pages from WPR.
> Could you point out some specific examples?
> 
> There are several possibilities that I see:
> 
> 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this problem
> before). This is a serious issue, and I could see an argument about wanting to
> have fewer WPR-ed pages to avoid this issue. Although the right solution here
is
> to rely less on WPR.
> 
> 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If we're
> catching real Chrome crashes that are making it past the Chrome CQ, we should
> find a set of WPR pages to add to the CQ. I'm sure everyone would want to see
us
> catch more crashes before they make it live.

Yes. We already enable running all the system health stories on CQ and catch a
bunch of Chrome crashes with those. Examples:
https://bugs.chromium.org/p/chromium/issues/detail?id=624474
https://bugs.chromium.org/p/chromium/issues/detail?id=624587
https://bugs.chromium.org/p/chromium/issues/detail?id=624701
https://bugs.chromium.org/p/chromium/issues/detail?id=612135
https://bugs.chromium.org/p/chromium/issues/detail?id=637230
...

I would want to enable more tests but stip@ from infra team is having some
concern with telemetry_perf_unittests currently is taking most of resources in
android_chromium_rel_ng. +__+ 
I will advocate for more tests anyway, but we need to adjust the number of tests
we enable with android infra's capacity.

kouhei (in TOK)

> +Kouhei, Fadi & Camillo: what do you folks think? I'm supportive of erikchen@'s idea ...

4 years, 4 months ago (2016-08-25 01:48:10 UTC) #17

erikchen

On 2016/08/25 01:28:40, nednguyen wrote: > On 2016/08/25 01:01:51, erikchen wrote: > > > I ...

4 years, 3 months ago (2016-08-25 15:02:03 UTC) #18

On 2016/08/25 01:28:40, nednguyen wrote:
> On 2016/08/25 01:01:51, erikchen wrote:
> > > I think this correlates well with whether the page is idle after load. I
> would
> > > expect 
> > > pages with high idle CPU usage to have high TimeToInteractive metrics
values
> > as
> > > well.
> > 
> > I do not have this expectation. From everything we've seen so far, most
pages
> > with high idle CPU usage are due to too many Timer firings, excessive
> relayouts,
> > etc. See the "Potential Areas for Investigation" section under
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> 
> Wont't these tasks run on the main threads? If the main threads is quiet, we
> consider the page to be interactive. (see more on
> https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> 
> To be clear, my main question here is:
> "if a page exhibit interesting power problems around loading period, would
that
> page also be interesting for loading & v8 to track their performance?"
> 
> If we believe the nature of these pages are different, I am fine with not
> sharing the pages but we should discuss first. 
> 
> 
> > 
> > > The main cost of maintenance is triaging/fixing bugs when Chrome crashes
> when
> > it
> > > to load these pages. For an ideas of how often that happens,
> > > you can look at the number of blocking bugs in
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > 
> > I started going through the blocking bugs in
> > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm failing to
> see
> > any that are caused by Chrome crashes associated with loading pages from
WPR.
> > Could you point out some specific examples?
> > 
> > There are several possibilities that I see:
> > 
> > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this problem
> > before). This is a serious issue, and I could see an argument about wanting
to
> > have fewer WPR-ed pages to avoid this issue. Although the right solution
here
> is
> > to rely less on WPR.
> > 
> > 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If we're
> > catching real Chrome crashes that are making it past the Chrome CQ, we
should
> > find a set of WPR pages to add to the CQ. I'm sure everyone would want to
see
> us
> > catch more crashes before they make it live.
> 
> Yes. We already enable running all the system health stories on CQ and catch a
> bunch of Chrome crashes with those. Examples:
> https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> ...
> 
> I would want to enable more tests but stip@ from infra team is having some
> concern with telemetry_perf_unittests currently is taking most of resources in
> android_chromium_rel_ng. +__+ 
> I will advocate for more tests anyway, but we need to adjust the number of
tests
> we enable with android infra's capacity.
It's a good thing these tests will only be running on Mac for now.

I went through the 5 bugs you posted - 3 of them were chrome crashes, and 2 were
webpage load timeouts (guessing WPR failure?)

nednguyen

On 2016/08/25 15:02:03, erikchen wrote: > On 2016/08/25 01:28:40, nednguyen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:19:49 UTC) #19

On 2016/08/25 15:02:03, erikchen wrote:
> On 2016/08/25 01:28:40, nednguyen wrote:
> > On 2016/08/25 01:01:51, erikchen wrote:
> > > > I think this correlates well with whether the page is idle after load. I
> > would
> > > > expect 
> > > > pages with high idle CPU usage to have high TimeToInteractive metrics
> values
> > > as
> > > > well.
> > > 
> > > I do not have this expectation. From everything we've seen so far, most
> pages
> > > with high idle CPU usage are due to too many Timer firings, excessive
> > relayouts,
> > > etc. See the "Potential Areas for Investigation" section under
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > 
> > Wont't these tasks run on the main threads? If the main threads is quiet, we
> > consider the page to be interactive. (see more on
> > https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > 
> > To be clear, my main question here is:
> > "if a page exhibit interesting power problems around loading period, would
> that
> > page also be interesting for loading & v8 to track their performance?"
> > 
> > If we believe the nature of these pages are different, I am fine with not
> > sharing the pages but we should discuss first. 
> > 
> > 
> > > 
> > > > The main cost of maintenance is triaging/fixing bugs when Chrome crashes
> > when
> > > it
> > > > to load these pages. For an ideas of how often that happens,
> > > > you can look at the number of blocking bugs in
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > 
> > > I started going through the blocking bugs in
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm failing
to
> > see
> > > any that are caused by Chrome crashes associated with loading pages from
> WPR.
> > > Could you point out some specific examples?
> > > 
> > > There are several possibilities that I see:
> > > 
> > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this problem
> > > before). This is a serious issue, and I could see an argument about
wanting
> to
> > > have fewer WPR-ed pages to avoid this issue. Although the right solution
> here
> > is
> > > to rely less on WPR.
> > > 
> > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If we're
> > > catching real Chrome crashes that are making it past the Chrome CQ, we
> should
> > > find a set of WPR pages to add to the CQ. I'm sure everyone would want to
> see
> > us
> > > catch more crashes before they make it live.
> > 
> > Yes. We already enable running all the system health stories on CQ and catch
a
> > bunch of Chrome crashes with those. Examples:
> > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > ...
> > 
> > I would want to enable more tests but stip@ from infra team is having some
> > concern with telemetry_perf_unittests currently is taking most of resources
in
> > android_chromium_rel_ng. +__+ 
> > I will advocate for more tests anyway, but we need to adjust the number of
> tests
> > we enable with android infra's capacity.
> It's a good thing these tests will only be running on Mac for now.
> 
> I went through the 5 bugs you posted - 3 of them were chrome crashes, and 2
were
> webpage load timeouts (guessing WPR failure?)

Ok. Let's name these stories: busy_idle_loading_stories.py?

erikchen

On 2016/08/25 15:19:49, nednguyen wrote: > On 2016/08/25 15:02:03, erikchen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:22:07 UTC) #20

On 2016/08/25 15:19:49, nednguyen wrote:
> On 2016/08/25 15:02:03, erikchen wrote:
> > On 2016/08/25 01:28:40, nednguyen wrote:
> > > On 2016/08/25 01:01:51, erikchen wrote:
> > > > > I think this correlates well with whether the page is idle after load.
I
> > > would
> > > > > expect 
> > > > > pages with high idle CPU usage to have high TimeToInteractive metrics
> > values
> > > > as
> > > > > well.
> > > > 
> > > > I do not have this expectation. From everything we've seen so far, most
> > pages
> > > > with high idle CPU usage are due to too many Timer firings, excessive
> > > relayouts,
> > > > etc. See the "Potential Areas for Investigation" section under
> > > >
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > > 
> > > Wont't these tasks run on the main threads? If the main threads is quiet,
we
> > > consider the page to be interactive. (see more on
> > > https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > > 
> > > To be clear, my main question here is:
> > > "if a page exhibit interesting power problems around loading period, would
> > that
> > > page also be interesting for loading & v8 to track their performance?"
> > > 
> > > If we believe the nature of these pages are different, I am fine with not
> > > sharing the pages but we should discuss first. 
> > > 
> > > 
> > > > 
> > > > > The main cost of maintenance is triaging/fixing bugs when Chrome
crashes
> > > when
> > > > it
> > > > > to load these pages. For an ideas of how often that happens,
> > > > > you can look at the number of blocking bugs in
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > > 
> > > > I started going through the blocking bugs in
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm
failing
> to
> > > see
> > > > any that are caused by Chrome crashes associated with loading pages from
> > WPR.
> > > > Could you point out some specific examples?
> > > > 
> > > > There are several possibilities that I see:
> > > > 
> > > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this
problem
> > > > before). This is a serious issue, and I could see an argument about
> wanting
> > to
> > > > have fewer WPR-ed pages to avoid this issue. Although the right solution
> > here
> > > is
> > > > to rely less on WPR.
> > > > 
> > > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If
we're
> > > > catching real Chrome crashes that are making it past the Chrome CQ, we
> > should
> > > > find a set of WPR pages to add to the CQ. I'm sure everyone would want
to
> > see
> > > us
> > > > catch more crashes before they make it live.
> > > 
> > > Yes. We already enable running all the system health stories on CQ and
catch
> a
> > > bunch of Chrome crashes with those. Examples:
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > > ...
> > > 
> > > I would want to enable more tests but stip@ from infra team is having some
> > > concern with telemetry_perf_unittests currently is taking most of
resources
> in
> > > android_chromium_rel_ng. +__+ 
> > > I will advocate for more tests anyway, but we need to adjust the number of
> > tests
> > > we enable with android infra's capacity.
> > It's a good thing these tests will only be running on Mac for now.
> > 
> > I went through the 5 bugs you posted - 3 of them were chrome crashes, and 2
> were
> > webpage load timeouts (guessing WPR failure?)
> 
> Ok. Let's name these stories: busy_idle_loading_stories.py?

There are many names I am okay with but this is not one of them. Busy and Idle
have the exact opposite meaning. And the whole point is that there is no loading
involved at all. If you don't like the term "steady state", we can go with
"idle_stories.py". Note that the fact that Chrome has high CPU usage on these
sites is almost certainly a chrome bug, rather than a feature of the pages
themselves. Once these bugs are fixed, then they will just be "idle pages"

nednguyen

On 2016/08/25 15:22:07, erikchen wrote: > On 2016/08/25 15:19:49, nednguyen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:25:05 UTC) #21

On 2016/08/25 15:22:07, erikchen wrote:
> On 2016/08/25 15:19:49, nednguyen wrote:
> > On 2016/08/25 15:02:03, erikchen wrote:
> > > On 2016/08/25 01:28:40, nednguyen wrote:
> > > > On 2016/08/25 01:01:51, erikchen wrote:
> > > > > > I think this correlates well with whether the page is idle after
load.
> I
> > > > would
> > > > > > expect 
> > > > > > pages with high idle CPU usage to have high TimeToInteractive
metrics
> > > values
> > > > > as
> > > > > > well.
> > > > > 
> > > > > I do not have this expectation. From everything we've seen so far,
most
> > > pages
> > > > > with high idle CPU usage are due to too many Timer firings, excessive
> > > > relayouts,
> > > > > etc. See the "Potential Areas for Investigation" section under
> > > > >
> > > >
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > > > 
> > > > Wont't these tasks run on the main threads? If the main threads is
quiet,
> we
> > > > consider the page to be interactive. (see more on
> > > > https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > > > 
> > > > To be clear, my main question here is:
> > > > "if a page exhibit interesting power problems around loading period,
would
> > > that
> > > > page also be interesting for loading & v8 to track their performance?"
> > > > 
> > > > If we believe the nature of these pages are different, I am fine with
not
> > > > sharing the pages but we should discuss first. 
> > > > 
> > > > 
> > > > > 
> > > > > > The main cost of maintenance is triaging/fixing bugs when Chrome
> crashes
> > > > when
> > > > > it
> > > > > > to load these pages. For an ideas of how often that happens,
> > > > > > you can look at the number of blocking bugs in
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > > > 
> > > > > I started going through the blocking bugs in
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm
> failing
> > to
> > > > see
> > > > > any that are caused by Chrome crashes associated with loading pages
from
> > > WPR.
> > > > > Could you point out some specific examples?
> > > > > 
> > > > > There are several possibilities that I see:
> > > > > 
> > > > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this
> problem
> > > > > before). This is a serious issue, and I could see an argument about
> > wanting
> > > to
> > > > > have fewer WPR-ed pages to avoid this issue. Although the right
solution
> > > here
> > > > is
> > > > > to rely less on WPR.
> > > > > 
> > > > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If
> we're
> > > > > catching real Chrome crashes that are making it past the Chrome CQ, we
> > > should
> > > > > find a set of WPR pages to add to the CQ. I'm sure everyone would want
> to
> > > see
> > > > us
> > > > > catch more crashes before they make it live.
> > > > 
> > > > Yes. We already enable running all the system health stories on CQ and
> catch
> > a
> > > > bunch of Chrome crashes with those. Examples:
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > > > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > > > ...
> > > > 
> > > > I would want to enable more tests but stip@ from infra team is having
some
> > > > concern with telemetry_perf_unittests currently is taking most of
> resources
> > in
> > > > android_chromium_rel_ng. +__+ 
> > > > I will advocate for more tests anyway, but we need to adjust the number
of
> > > tests
> > > > we enable with android infra's capacity.
> > > It's a good thing these tests will only be running on Mac for now.
> > > 
> > > I went through the 5 bugs you posted - 3 of them were chrome crashes, and
2
> > were
> > > webpage load timeouts (guessing WPR failure?)
> > 
> > Ok. Let's name these stories: busy_idle_loading_stories.py?
> 
> There are many names I am okay with but this is not one of them. Busy and Idle
> have the exact opposite meaning. And the whole point is that there is no
loading
> involved at all. If you don't like the term "steady state", we can go with
> "idle_stories.py". Note that the fact that Chrome has high CPU usage on these
> sites is almost certainly a chrome bug, rather than a feature of the pages
> themselves. Once these bugs are fixed, then they will just be "idle pages"

sgtm.

erikchen

On 2016/08/25 15:25:05, nednguyen wrote: > On 2016/08/25 15:22:07, erikchen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:28:30 UTC) #22

On 2016/08/25 15:25:05, nednguyen wrote:
> On 2016/08/25 15:22:07, erikchen wrote:
> > On 2016/08/25 15:19:49, nednguyen wrote:
> > > On 2016/08/25 15:02:03, erikchen wrote:
> > > > On 2016/08/25 01:28:40, nednguyen wrote:
> > > > > On 2016/08/25 01:01:51, erikchen wrote:
> > > > > > > I think this correlates well with whether the page is idle after
> load.
> > I
> > > > > would
> > > > > > > expect 
> > > > > > > pages with high idle CPU usage to have high TimeToInteractive
> metrics
> > > > values
> > > > > > as
> > > > > > > well.
> > > > > > 
> > > > > > I do not have this expectation. From everything we've seen so far,
> most
> > > > pages
> > > > > > with high idle CPU usage are due to too many Timer firings,
excessive
> > > > > relayouts,
> > > > > > etc. See the "Potential Areas for Investigation" section under
> > > > > >
> > > > >
> > > >
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > > > > 
> > > > > Wont't these tasks run on the main threads? If the main threads is
> quiet,
> > we
> > > > > consider the page to be interactive. (see more on
> > > > > https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > > > > 
> > > > > To be clear, my main question here is:
> > > > > "if a page exhibit interesting power problems around loading period,
> would
> > > > that
> > > > > page also be interesting for loading & v8 to track their performance?"
> > > > > 
> > > > > If we believe the nature of these pages are different, I am fine with
> not
> > > > > sharing the pages but we should discuss first. 
> > > > > 
> > > > > 
> > > > > > 
> > > > > > > The main cost of maintenance is triaging/fixing bugs when Chrome
> > crashes
> > > > > when
> > > > > > it
> > > > > > > to load these pages. For an ideas of how often that happens,
> > > > > > > you can look at the number of blocking bugs in
> > > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > > > > 
> > > > > > I started going through the blocking bugs in
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm
> > failing
> > > to
> > > > > see
> > > > > > any that are caused by Chrome crashes associated with loading pages
> from
> > > > WPR.
> > > > > > Could you point out some specific examples?
> > > > > > 
> > > > > > There are several possibilities that I see:
> > > > > > 
> > > > > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this
> > problem
> > > > > > before). This is a serious issue, and I could see an argument about
> > > wanting
> > > > to
> > > > > > have fewer WPR-ed pages to avoid this issue. Although the right
> solution
> > > > here
> > > > > is
> > > > > > to rely less on WPR.
> > > > > > 
> > > > > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge! If
> > we're
> > > > > > catching real Chrome crashes that are making it past the Chrome CQ,
we
> > > > should
> > > > > > find a set of WPR pages to add to the CQ. I'm sure everyone would
want
> > to
> > > > see
> > > > > us
> > > > > > catch more crashes before they make it live.
> > > > > 
> > > > > Yes. We already enable running all the system health stories on CQ and
> > catch
> > > a
> > > > > bunch of Chrome crashes with those. Examples:
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > > > > ...
> > > > > 
> > > > > I would want to enable more tests but stip@ from infra team is having
> some
> > > > > concern with telemetry_perf_unittests currently is taking most of
> > resources
> > > in
> > > > > android_chromium_rel_ng. +__+ 
> > > > > I will advocate for more tests anyway, but we need to adjust the
number
> of
> > > > tests
> > > > > we enable with android infra's capacity.
> > > > It's a good thing these tests will only be running on Mac for now.
> > > > 
> > > > I went through the 5 bugs you posted - 3 of them were chrome crashes,
and
> 2
> > > were
> > > > webpage load timeouts (guessing WPR failure?)
> > > 
> > > Ok. Let's name these stories: busy_idle_loading_stories.py?
> > 
> > There are many names I am okay with but this is not one of them. Busy and
Idle
> > have the exact opposite meaning. And the whole point is that there is no
> loading
> > involved at all. If you don't like the term "steady state", we can go with
> > "idle_stories.py". Note that the fact that Chrome has high CPU usage on
these
> > sites is almost certainly a chrome bug, rather than a feature of the pages
> > themselves. Once these bugs are fixed, then they will just be "idle pages"
> 
> sgtm.

sgtm to the current name or idle_stories.py?

erikchen

On 2016/08/25 15:28:30, erikchen wrote: > On 2016/08/25 15:25:05, nednguyen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:29:40 UTC) #23

On 2016/08/25 15:28:30, erikchen wrote:
> On 2016/08/25 15:25:05, nednguyen wrote:
> > On 2016/08/25 15:22:07, erikchen wrote:
> > > On 2016/08/25 15:19:49, nednguyen wrote:
> > > > On 2016/08/25 15:02:03, erikchen wrote:
> > > > > On 2016/08/25 01:28:40, nednguyen wrote:
> > > > > > On 2016/08/25 01:01:51, erikchen wrote:
> > > > > > > > I think this correlates well with whether the page is idle after
> > load.
> > > I
> > > > > > would
> > > > > > > > expect 
> > > > > > > > pages with high idle CPU usage to have high TimeToInteractive
> > metrics
> > > > > values
> > > > > > > as
> > > > > > > > well.
> > > > > > > 
> > > > > > > I do not have this expectation. From everything we've seen so far,
> > most
> > > > > pages
> > > > > > > with high idle CPU usage are due to too many Timer firings,
> excessive
> > > > > > relayouts,
> > > > > > > etc. See the "Potential Areas for Investigation" section under
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > > > > > 
> > > > > > Wont't these tasks run on the main threads? If the main threads is
> > quiet,
> > > we
> > > > > > consider the page to be interactive. (see more on
> > > > > >
https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > > > > > 
> > > > > > To be clear, my main question here is:
> > > > > > "if a page exhibit interesting power problems around loading period,
> > would
> > > > > that
> > > > > > page also be interesting for loading & v8 to track their
performance?"
> > > > > > 
> > > > > > If we believe the nature of these pages are different, I am fine
with
> > not
> > > > > > sharing the pages but we should discuss first. 
> > > > > > 
> > > > > > 
> > > > > > > 
> > > > > > > > The main cost of maintenance is triaging/fixing bugs when Chrome
> > > crashes
> > > > > > when
> > > > > > > it
> > > > > > > > to load these pages. For an ideas of how often that happens,
> > > > > > > > you can look at the number of blocking bugs in
> > > > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > > > > > 
> > > > > > > I started going through the blocking bugs in
> > > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm
> > > failing
> > > > to
> > > > > > see
> > > > > > > any that are caused by Chrome crashes associated with loading
pages
> > from
> > > > > WPR.
> > > > > > > Could you point out some specific examples?
> > > > > > > 
> > > > > > > There are several possibilities that I see:
> > > > > > > 
> > > > > > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this
> > > problem
> > > > > > > before). This is a serious issue, and I could see an argument
about
> > > > wanting
> > > > > to
> > > > > > > have fewer WPR-ed pages to avoid this issue. Although the right
> > solution
> > > > > here
> > > > > > is
> > > > > > > to rely less on WPR.
> > > > > > > 
> > > > > > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge!
If
> > > we're
> > > > > > > catching real Chrome crashes that are making it past the Chrome
CQ,
> we
> > > > > should
> > > > > > > find a set of WPR pages to add to the CQ. I'm sure everyone would
> want
> > > to
> > > > > see
> > > > > > us
> > > > > > > catch more crashes before they make it live.
> > > > > > 
> > > > > > Yes. We already enable running all the system health stories on CQ
and
> > > catch
> > > > a
> > > > > > bunch of Chrome crashes with those. Examples:
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > > > > > ...
> > > > > > 
> > > > > > I would want to enable more tests but stip@ from infra team is
having
> > some
> > > > > > concern with telemetry_perf_unittests currently is taking most of
> > > resources
> > > > in
> > > > > > android_chromium_rel_ng. +__+ 
> > > > > > I will advocate for more tests anyway, but we need to adjust the
> number
> > of
> > > > > tests
> > > > > > we enable with android infra's capacity.
> > > > > It's a good thing these tests will only be running on Mac for now.
> > > > > 
> > > > > I went through the 5 bugs you posted - 3 of them were chrome crashes,
> and
> > 2
> > > > were
> > > > > webpage load timeouts (guessing WPR failure?)
> > > > 
> > > > Ok. Let's name these stories: busy_idle_loading_stories.py?
> > > 
> > > There are many names I am okay with but this is not one of them. Busy and
> Idle
> > > have the exact opposite meaning. And the whole point is that there is no
> > loading
> > > involved at all. If you don't like the term "steady state", we can go with
> > > "idle_stories.py". Note that the fact that Chrome has high CPU usage on
> these
> > > sites is almost certainly a chrome bug, rather than a feature of the pages
> > > themselves. Once these bugs are fixed, then they will just be "idle pages"
> > 
> > sgtm.
> 
> sgtm to the current name or idle_stories.py?

as per discussion, going with idle_after_loading_stories.py

nednguyen

On 2016/08/25 15:28:30, erikchen wrote: > On 2016/08/25 15:25:05, nednguyen wrote: > > On 2016/08/25 ...

4 years, 3 months ago (2016-08-25 15:29:49 UTC) #24

On 2016/08/25 15:28:30, erikchen wrote:
> On 2016/08/25 15:25:05, nednguyen wrote:
> > On 2016/08/25 15:22:07, erikchen wrote:
> > > On 2016/08/25 15:19:49, nednguyen wrote:
> > > > On 2016/08/25 15:02:03, erikchen wrote:
> > > > > On 2016/08/25 01:28:40, nednguyen wrote:
> > > > > > On 2016/08/25 01:01:51, erikchen wrote:
> > > > > > > > I think this correlates well with whether the page is idle after
> > load.
> > > I
> > > > > > would
> > > > > > > > expect 
> > > > > > > > pages with high idle CPU usage to have high TimeToInteractive
> > metrics
> > > > > values
> > > > > > > as
> > > > > > > > well.
> > > > > > > 
> > > > > > > I do not have this expectation. From everything we've seen so far,
> > most
> > > > > pages
> > > > > > > with high idle CPU usage are due to too many Timer firings,
> excessive
> > > > > > relayouts,
> > > > > > > etc. See the "Potential Areas for Investigation" section under
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>
https://docs.google.com/document/d/1QlX7BJDjDmHH2BuysnwYA5QqqGOW24bj6OpQzqWHF...
> > > > > > 
> > > > > > Wont't these tasks run on the main threads? If the main threads is
> > quiet,
> > > we
> > > > > > consider the page to be interactive. (see more on
> > > > > >
https://github.com/tdresser/time-to-interactive/blob/master/README.md)
> > > > > > 
> > > > > > To be clear, my main question here is:
> > > > > > "if a page exhibit interesting power problems around loading period,
> > would
> > > > > that
> > > > > > page also be interesting for loading & v8 to track their
performance?"
> > > > > > 
> > > > > > If we believe the nature of these pages are different, I am fine
with
> > not
> > > > > > sharing the pages but we should discuss first. 
> > > > > > 
> > > > > > 
> > > > > > > 
> > > > > > > > The main cost of maintenance is triaging/fixing bugs when Chrome
> > > crashes
> > > > > > when
> > > > > > > it
> > > > > > > > to load these pages. For an ideas of how often that happens,
> > > > > > > > you can look at the number of blocking bugs in
> > > > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726
> > > > > > > 
> > > > > > > I started going through the blocking bugs in
> > > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=589726. I'm
> > > failing
> > > > to
> > > > > > see
> > > > > > > any that are caused by Chrome crashes associated with loading
pages
> > from
> > > > > WPR.
> > > > > > > Could you point out some specific examples?
> > > > > > > 
> > > > > > > There are several possibilities that I see:
> > > > > > > 
> > > > > > > 1) Pages fail to load/quiesce/whatever under WPR. (I've seen this
> > > problem
> > > > > > > before). This is a serious issue, and I could see an argument
about
> > > > wanting
> > > > > to
> > > > > > > have fewer WPR-ed pages to avoid this issue. Although the right
> > solution
> > > > > here
> > > > > > is
> > > > > > > to rely less on WPR.
> > > > > > > 
> > > > > > > 2) Chrome crashes under WPR-ed pages (your claim). This is huge!
If
> > > we're
> > > > > > > catching real Chrome crashes that are making it past the Chrome
CQ,
> we
> > > > > should
> > > > > > > find a set of WPR pages to add to the CQ. I'm sure everyone would
> want
> > > to
> > > > > see
> > > > > > us
> > > > > > > catch more crashes before they make it live.
> > > > > > 
> > > > > > Yes. We already enable running all the system health stories on CQ
and
> > > catch
> > > > a
> > > > > > bunch of Chrome crashes with those. Examples:
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624474
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624587
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=624701
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=612135
> > > > > > https://bugs.chromium.org/p/chromium/issues/detail?id=637230
> > > > > > ...
> > > > > > 
> > > > > > I would want to enable more tests but stip@ from infra team is
having
> > some
> > > > > > concern with telemetry_perf_unittests currently is taking most of
> > > resources
> > > > in
> > > > > > android_chromium_rel_ng. +__+ 
> > > > > > I will advocate for more tests anyway, but we need to adjust the
> number
> > of
> > > > > tests
> > > > > > we enable with android infra's capacity.
> > > > > It's a good thing these tests will only be running on Mac for now.
> > > > > 
> > > > > I went through the 5 bugs you posted - 3 of them were chrome crashes,
> and
> > 2
> > > > were
> > > > > webpage load timeouts (guessing WPR failure?)
> > > > 
> > > > Ok. Let's name these stories: busy_idle_loading_stories.py?
> > > 
> > > There are many names I am okay with but this is not one of them. Busy and
> Idle
> > > have the exact opposite meaning. And the whole point is that there is no
> > loading
> > > involved at all. If you don't like the term "steady state", we can go with
> > > "idle_stories.py". Note that the fact that Chrome has high CPU usage on
> these
> > > sites is almost certainly a chrome bug, rather than a feature of the pages
> > > themselves. Once these bugs are fixed, then they will just be "idle pages"
> > 
> > sgtm.
> 
> sgtm to the current name or idle_stories.py?

Chatted offline, we both agree on idle_after_loading_stories.py

nednguyen

https://codereview.chromium.org/2271793003/diff/40001/tools/perf/page_sets/idle_after_loading_stories.py File tools/perf/page_sets/idle_after_loading_stories.py (right): https://codereview.chromium.org/2271793003/diff/40001/tools/perf/page_sets/idle_after_loading_stories.py#newcode17 tools/perf/page_sets/idle_after_loading_stories.py:17: 'http://www.labradortraininghq.com/labrador-training/how-to-crate-train' nits: can you add comment or bug link ...

4 years, 3 months ago (2016-08-27 00:10:38 UTC) #27

erikchen

https://codereview.chromium.org/2271793003/diff/40001/tools/perf/page_sets/idle_after_loading_stories.py File tools/perf/page_sets/idle_after_loading_stories.py (right): https://codereview.chromium.org/2271793003/diff/40001/tools/perf/page_sets/idle_after_loading_stories.py#newcode17 tools/perf/page_sets/idle_after_loading_stories.py:17: 'http://www.labradortraininghq.com/labrador-training/how-to-crate-train' On 2016/08/27 00:10:38, nednguyen (ooo til 8-29) wrote: ...

4 years, 3 months ago (2016-08-27 00:20:22 UTC) #28

erikchen

The patchset sent to the CQ was uploaded after l-g-t-m from nednguyen@google.com Link to the ...

4 years, 3 months ago (2016-08-27 00:20:27 UTC) #30