Issue 23717016: Add cpu_stats for the browser

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+74 lines, -0 lines)			Patch
M	tools/telemetry/telemetry/core/browser.py	View	1 2 3 4 5 6	1 chunk	+29 lines, -0 lines	0 comments	Download
M	tools/telemetry/telemetry/core/platform/linux_platform_backend.py	View	1 2 3 4 5	1 chunk	+8 lines, -0 lines	0 comments	Download
M	tools/telemetry/telemetry/core/platform/platform_backend.py	View	1 2 3 4 5	1 chunk	+6 lines, -0 lines	0 comments	Download
M	tools/telemetry/telemetry/core/platform/proc_util.py	View	1 2 3 4 5	2 chunks	+31 lines, -0 lines	0 comments	Download

Messages

Total messages: 25 (0 generated)

Expand Messages | Collapse Messages

edmundyan

How's this? One issue is that the browser process is not affected by the javascript ...

7 years, 3 months ago (2013-09-03 17:12:32 UTC) #1

nduca

looks solid, lgtm on my part. seems like the test cases are gonna fail on ...

7 years, 3 months ago (2013-09-03 22:53:15 UTC) #3

edmundyan

On 2013/09/03 22:53:15, nduca wrote: > looks solid, lgtm on my part. seems like the ...

7 years, 3 months ago (2013-09-04 00:23:08 UTC) #4

tonyg

https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry/core/platform/linux_platform_backend.py File tools/telemetry/telemetry/core/platform/linux_platform_backend.py (right): https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry/core/platform/linux_platform_backend.py#newcode35 tools/telemetry/telemetry/core/platform/linux_platform_backend.py:35: cpu_load = p.get_cpu_percent(1) I don't think this is what ...

7 years, 3 months ago (2013-09-05 00:11:41 UTC) #5

edmundyan

Would making cpu_load a metric make more sense then? It seems to follow better with ...

7 years, 3 months ago (2013-09-05 01:05:44 UTC) #6

nduca

tonyg, i'm not sure i agree with you on the api thing. I'm pretty sure ...

7 years, 3 months ago (2013-09-05 01:24:26 UTC) #7

tonyg

On 2013/09/05 01:24:26, nduca wrote: > tonyg, i'm not sure i agree with you on ...

7 years, 3 months ago (2013-09-05 04:37:30 UTC) #8

tonyg

On 2013/09/05 01:05:44, edmundyan wrote: > Would making cpu_load a metric make more sense then? ...

7 years, 3 months ago (2013-09-05 04:43:40 UTC) #9

On 2013/09/05 01:05:44, edmundyan wrote:
> Would making cpu_load a metric make more sense then?  It seems to follow
better
> with the start/stop idea you have.
> 
>
https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry...
> File tools/telemetry/telemetry/core/platform/linux_platform_backend.py
(right):
> 
>
https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry...
> tools/telemetry/telemetry/core/platform/linux_platform_backend.py:35: cpu_load
=
> p.get_cpu_percent(1)
> On 2013/09/05 00:11:41, tonyg wrote:
> > I don't think this is what we want. This will block for 1 second while it
> > measures CPU usage.
> > 
> > Also, we currently don't have psutil available on non-linux platforms and
even
> > if we installed psutil on the other platforms, this will not work on android
> > because there Telemetry runs on the host machine instead of the target
device.
> > 
> > Instead, what I think we want is an API that allows us to start the CPU
> sampling
> > period without blocking, then query again later and get the average CPU
usage
> > over the sampling period.
> > 
> > One way to grab the data we need on linux, android and cros is to scrape the
> CPU
> > time counters from /proc/pid/stat (see GetMemoryStats), and I think there
are
> > equivalent ways of doing that on mac and windows (similar to their
> > GetMemoryStats methods).
> 
> Would making cpu_load a metric make more sense then?  It seems to follow
better
> with the start/stop idea you have.

Yes, this should be a Metric. But platform needs to take care of the OS specific
bits.

The way this works is that /proc/<pid>/stat (or win/mac equivalent) keeps track
of the number of cycles that have been used by that pid. We read the values once
at the beginning along with a timestamp. Then we read them again at the end with
a timestamp. The utilization percent is then something like (end_cycles -
start_cycles) / (end_timestamp - start_timestamp) * CYCLES_PER_SECOND.

So the best way for this to work is for the platform method to return the
current cycle's used by the process and the timestamp. Then the Metric can do
the math upon stopping.

edmundyan

On 2013/09/05 04:43:40, tonyg wrote: > On 2013/09/05 01:05:44, edmundyan wrote: > > Would making ...

7 years, 3 months ago (2013-09-05 04:59:45 UTC) #10

On 2013/09/05 04:43:40, tonyg wrote:
> On 2013/09/05 01:05:44, edmundyan wrote:
> > Would making cpu_load a metric make more sense then?  It seems to follow
> better
> > with the start/stop idea you have.
> > 
> >
>
https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry...
> > File tools/telemetry/telemetry/core/platform/linux_platform_backend.py
> (right):
> > 
> >
>
https://codereview.chromium.org/23717016/diff/16001/tools/telemetry/telemetry...
> > tools/telemetry/telemetry/core/platform/linux_platform_backend.py:35:
cpu_load
> =
> > p.get_cpu_percent(1)
> > On 2013/09/05 00:11:41, tonyg wrote:
> > > I don't think this is what we want. This will block for 1 second while it
> > > measures CPU usage.
> > > 
> > > Also, we currently don't have psutil available on non-linux platforms and
> even
> > > if we installed psutil on the other platforms, this will not work on
android
> > > because there Telemetry runs on the host machine instead of the target
> device.
> > > 
> > > Instead, what I think we want is an API that allows us to start the CPU
> > sampling
> > > period without blocking, then query again later and get the average CPU
> usage
> > > over the sampling period.
> > > 
> > > One way to grab the data we need on linux, android and cros is to scrape
the
> > CPU
> > > time counters from /proc/pid/stat (see GetMemoryStats), and I think there
> are
> > > equivalent ways of doing that on mac and windows (similar to their
> > > GetMemoryStats methods).
> > 
> > Would making cpu_load a metric make more sense then?  It seems to follow
> better
> > with the start/stop idea you have.
> 
> Yes, this should be a Metric. But platform needs to take care of the OS
specific
> bits.
> 
> The way this works is that /proc/<pid>/stat (or win/mac equivalent) keeps
track
> of the number of cycles that have been used by that pid. We read the values
once
> at the beginning along with a timestamp. Then we read them again at the end
with
> a timestamp. The utilization percent is then something like (end_cycles -
> start_cycles) / (end_timestamp - start_timestamp) * CYCLES_PER_SECOND.
> 
> So the best way for this to work is for the platform method to return the
> current cycle's used by the process and the timestamp. Then the Metric can do
> the math upon stopping.

Cool, that's exactly what I had implemented (I think)!  Here's a linux version. 
Will add other platforms if this is what you want.  'CpuPercent' is a bit
confusing now, as it calculates the avg CPU usage throughout the whole pid's
runtime (which for the unittest includes a bunch of the startup time).  It still
passes though.

tonyg

This is great, exactly what I had in mind. A couple of last nits and ...

7 years, 3 months ago (2013-09-05 15:20:13 UTC) #11

edmundyan

https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry/core/platform/proc_util.py File tools/telemetry/telemetry/core/platform/proc_util.py (right): https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry/core/platform/proc_util.py#newcode42 tools/telemetry/telemetry/core/platform/proc_util.py:42: hertz = float(os.sysconf(os.sysconf_names['SC_CLK_TCK'])) or 100 On 2013/09/05 15:20:14, tonyg ...

7 years, 3 months ago (2013-09-05 18:11:18 UTC) #12

edmundyan

On 2013/09/05 18:11:18, edmundyan wrote: > https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry/core/platform/proc_util.py > File tools/telemetry/telemetry/core/platform/proc_util.py (right): > > https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry/core/platform/proc_util.py#newcode42 > ...

7 years, 3 months ago (2013-09-06 01:54:01 UTC) #13

On 2013/09/05 18:11:18, edmundyan wrote:
>
https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry...
> File tools/telemetry/telemetry/core/platform/proc_util.py (right):
> 
>
https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry...
> tools/telemetry/telemetry/core/platform/proc_util.py:42: hertz =
> float(os.sysconf(os.sysconf_names['SC_CLK_TCK'])) or 100
> On 2013/09/05 15:20:14, tonyg wrote:
> > Is there a way we can get the clock tick from proc instead of os? As this is
> > written, this method could be shared on android except for this line
(because
> it
> > will return the host's frequency).
> From some googling, not easily.  Atleast there is no direct way to access it
> from /proc.  The only way I found is to take a jiffy sample before/after and
> calculate the average jiffies/second.  This can be done using:
> 
> date +"%s.%N" && grep '^jiffies' /proc/timer_list
> 
> You would think /proc/uptime would work, but I'm not getting 100 from it.
> 
> For android/mac (and actually all systems) we can use ps instead.
> ps -p <pid> -o cputime,etime
> 
> And just convert "[dd-]hh:mm:ss" to seconds.  It might be cleaner having all
> non-windows platforms do it this way.
> 
>
https://codereview.chromium.org/23717016/diff/30001/tools/telemetry/telemetry...
> tools/telemetry/telemetry/core/platform/proc_util.py:50: return {'CpuPercent':
> 100 * cpu_process_time_secs / total_time_secs,
> On 2013/09/05 15:20:14, tonyg wrote:
> > I don't think we should include this line because we'll never actually want
to
> > know the CPU percent over the entire process lifetime. We'll only want to
know
> > it over the course of the duration we are measuring.
> > 
> > The second two fields give us exactly the info needed to calculate that.
> Shall we remove the browser unittest then?
Actually, ignore that first paragraph. There IS a way to use /proc for
everything.  We can just skip converting back to seconds and store everything as
jiffies since we only care about %CPU in the end.

I ran some crude tests, and they oddly output consistently different results. 
After running high_cpu.html for 5 seconds a few times, the avg cpu load over
that span were:

Jiffies%:
Renderer: 8.745
Browser: 2.19

Seconds%:
Renderer: 9.2032
Browser: 1.62

They both give a good ballpark, but not sure which one is more correct.  Which
one do you like better?  I guess if we are using jiffies-version for android, we
should stay with that on cros/linux.

The 'ps -p <pid> -o cputime,etime' option I mention above is actually not very
optimal as the lowest denomination is seconds, so we won't get good precision
unless we run the process quite long.  Not sure how else we will do it on OSX
though.

tl;dr Tony: Added new patch.  Lmk which method you like better.

tonyg

If only jiffies works on android, then let's go with that everywhere. After that change, ...

7 years, 3 months ago (2013-09-06 02:24:24 UTC) #14

edmundyan

On 2013/09/06 02:24:24, tonyg wrote: > If only jiffies works on android, then let's go ...

7 years, 3 months ago (2013-09-06 17:39:16 UTC) #15

tonyg

lgtm https://codereview.chromium.org/23717016/diff/58001/tools/telemetry/telemetry/core/platform/proc_util.py File tools/telemetry/telemetry/core/platform/proc_util.py (right): https://codereview.chromium.org/23717016/diff/58001/tools/telemetry/telemetry/core/platform/proc_util.py#newcode34 tools/telemetry/telemetry/core/platform/proc_util.py:34: return 0 Should this raise?

7 years, 3 months ago (2013-09-06 17:42:15 UTC) #16

edmundyan

Sorry, actually found a bug. If we grab cpu_stats when the # of Renderer processes ...

7 years, 3 months ago (2013-09-06 21:24:50 UTC) #18

tonyg

https://codereview.chromium.org/23717016/diff/65001/tools/telemetry/telemetry/core/browser.py File tools/telemetry/telemetry/core/browser.py (right): https://codereview.chromium.org/23717016/diff/65001/tools/telemetry/telemetry/core/browser.py#newcode189 tools/telemetry/telemetry/core/browser.py:189: for process_type in ['Browser', 'Renderer']: Why restrict to Browser/Renderer?

7 years, 3 months ago (2013-09-06 21:35:48 UTC) #19

edmundyan

7 years, 3 months ago (2013-09-06 22:21:36 UTC) #20

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/edmundyan@chromium.org/23717016/74001

7 years, 3 months ago (2013-09-06 22:28:19 UTC) #22

commit-bot: I haz the power

Commit queue rejected this change because the description was changed between the time the change ...

7 years, 3 months ago (2013-09-07 04:20:32 UTC) #23

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/edmundyan@chromium.org/23717016/74001

7 years, 3 months ago (2013-09-07 04:42:08 UTC) #24

Message was sent while issue was closed.

Change committed as 221897

Expand Messages | Collapse Messages

Issue 23717016: Add cpu_stats for the browser (Closed)

Description

Patch Set 1 #

Patch Set 2 : . #

Patch Set 3 : Using /proc to get current CPU counters #

Patch Set 4 : Get jiffies from timer_list #

Patch Set 5 : Finalize jiffies. Remove unittests #

Patch Set 6 : Only store the timestamp once instead of summing it #

Patch Set 7 : . #

Messages