Issue 2451553006: [Common] Move WaitFor from telemetry to common/py_utils

rnephew (Reviews Here)

Patchset #1 (id:1) has been deleted

4 years, 1 month ago (2016-10-25 21:53:19 UTC) #1

rnephew (Reviews Here)

rnephew@chromium.org changed reviewers: + charliea@chromium.org, nednguyen@google.com

4 years, 1 month ago (2016-10-25 21:53:56 UTC) #2

rnephew (Reviews Here)

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/py_utils_unittest.py File common/py_utils/py_utils/py_utils_unittest.py (right): https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/py_utils_unittest.py#newcode37 common/py_utils/py_utils/py_utils_unittest.py:37: def _ReturnCounterBasedValue(self): Anyone have a suggestion on a better ...

4 years, 1 month ago (2016-10-25 21:53:57 UTC) #3

charliea (OOO until 10-5)

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py File common/py_utils/py_utils/__init__.py (right): https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py#newcode101 common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1 Should these be constants? https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py#newcode101 common/py_utils/py_utils/__init__.py:101: ...

4 years, 1 month ago (2016-10-26 15:19:33 UTC) #4

rnephew (Reviews Here)

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py File common/py_utils/py_utils/__init__.py (right): https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py#newcode101 common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1 On 2016/10/26 15:19:32, charliea wrote: > ...

4 years, 1 month ago (2016-10-26 16:26:05 UTC) #5

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
File common/py_utils/py_utils/__init__.py (right):

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1
On 2016/10/26 15:19:32, charliea wrote:
> Should these be constants?

Done.

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1
On 2016/10/26 15:19:32, charliea wrote:
> It's probably worth attaching units to these (MIN_POLL_INTERVAL_IN_SECONDS) or
> something like that
> 
> see:
>
https://cs.corp.google.com/search/?q=p:github+f:%5Ecatapult-project/catapult/...

Done.

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:105: def GetConditionString():
On 2016/10/26 15:19:32, charliea wrote:
> I'm confused: what's going on here?

I just c/p the old code from telemetry and tested that it worked as expected.. I
dont fully know what this code is for.

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:115: while True:
On 2016/10/26 15:19:32, charliea wrote:
> Maybe this could be more simply expressed as:
> 
> condition_met = False
> while elapsed_time < timeout:
>   if condition():
>     return
> 
>   # Sleep until next poll...
> 
> raise TimeoutException(...)
> 
> I'm generally against infinite loops (where possible), because:
> 
> 1) I think putting the termination condition in the normal place makes it more
> obvious what that termination condition is
> 
> 2) I think that it's easier to accidentally create actual infinite loops with
> them

Done.

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
File common/py_utils/py_utils/py_utils_unittest.py (right):

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/py_utils_unittest.py:37: def
_ReturnCounterBasedValue(self):
On 2016/10/26 15:19:32, charliea wrote:
> On 2016/10/25 21:53:56, rnephew (Reviews Here) wrote:
> > Anyone have a suggestion on a better implementation of a condition that will
> > start out false and eventually return true?
> 
> I think this one is okay, but I might put the counter variable down in the
test
> and make this function local to that test. Having the counter  be a member of
> the class feels like we're leaking implementation details unnecessarily

Done, but to make the variable live in the test function, it needs to be passed
as either a list or dict because of strange python scoping issues.

https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
common/py_utils/py_utils/py_utils_unittest.py:47:
py_utils.WaitFor(self._ReturnFalse, 1)
On 2016/10/26 15:19:32, charliea wrote:
> I wonder if isn't clearer to just have:
> 
> py_utils.WaitFor(lambda: False, 1)

I actually am adding some lambda based tests. Based on  GetConditionString I
think the code I'm porting over handles lambdas differently.

nednguyen

On 2016/10/26 16:26:05, rnephew (Reviews Here) wrote: > https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_utils/__init__.py > File common/py_utils/py_utils/__init__.py (right): > > ...

4 years, 1 month ago (2016-10-26 16:39:38 UTC) #6

On 2016/10/26 16:26:05, rnephew (Reviews Here) wrote:
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> File common/py_utils/py_utils/__init__.py (right):
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1
> On 2016/10/26 15:19:32, charliea wrote:
> > Should these be constants?
> 
> Done.
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/__init__.py:101: min_poll_interval = 0.1
> On 2016/10/26 15:19:32, charliea wrote:
> > It's probably worth attaching units to these (MIN_POLL_INTERVAL_IN_SECONDS)
or
> > something like that
> > 
> > see:
> >
>
https://cs.corp.google.com/search/?q=p:github+f:%5Ecatapult-project/catapult/...
> 
> Done.
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/__init__.py:105: def GetConditionString():
> On 2016/10/26 15:19:32, charliea wrote:
> > I'm confused: what's going on here?
> 
> I just c/p the old code from telemetry and tested that it worked as expected..
I
> dont fully know what this code is for.
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/__init__.py:115: while True:
> On 2016/10/26 15:19:32, charliea wrote:
> > Maybe this could be more simply expressed as:
> > 
> > condition_met = False
> > while elapsed_time < timeout:
> >   if condition():
> >     return
> > 
> >   # Sleep until next poll...
> > 
> > raise TimeoutException(...)
> > 
> > I'm generally against infinite loops (where possible), because:
> > 
> > 1) I think putting the termination condition in the normal place makes it
more
> > obvious what that termination condition is
> > 
> > 2) I think that it's easier to accidentally create actual infinite loops
with
> > them
> 
> Done.
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> File common/py_utils/py_utils/py_utils_unittest.py (right):
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/py_utils_unittest.py:37: def
> _ReturnCounterBasedValue(self):
> On 2016/10/26 15:19:32, charliea wrote:
> > On 2016/10/25 21:53:56, rnephew (Reviews Here) wrote:
> > > Anyone have a suggestion on a better implementation of a condition that
will
> > > start out false and eventually return true?
> > 
> > I think this one is okay, but I might put the counter variable down in the
> test
> > and make this function local to that test. Having the counter  be a member
of
> > the class feels like we're leaking implementation details unnecessarily
> 
> Done, but to make the variable live in the test function, it needs to be
passed
> as either a list or dict because of strange python scoping issues.
> 
>
https://codereview.chromium.org/2451553006/diff/20001/common/py_utils/py_util...
> common/py_utils/py_utils/py_utils_unittest.py:47:
> py_utils.WaitFor(self._ReturnFalse, 1)
> On 2016/10/26 15:19:32, charliea wrote:
> > I wonder if isn't clearer to just have:
> > 
> > py_utils.WaitFor(lambda: False, 1)
> 
> I actually am adding some lambda based tests. Based on  GetConditionString I
> think the code I'm porting over handles lambdas differently.

Can you also update the code in Telemetry to use this one?

charliea (OOO until 10-5)

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_utils/__init__.py File common/py_utils/py_utils/__init__.py (right): https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_utils/__init__.py#newcode97 common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout): It seems weird that we're both ...

4 years, 1 month ago (2016-10-26 17:12:57 UTC) #7

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
File common/py_utils/py_utils/__init__.py (right):

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout):
It seems weird that we're both waiting for |condition| to return True and also
returning the value of |condition|. Can't we assume that if this function
doesn't throw an exception that the return of |condition| is True? The
condition() function seems inherently unfit to pass back data to the caller,
because it can't pass back all values (because some value needs to be reserved
for stopping the wait).

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout):
Suggestion: a more common way to solve the problem of wanting to do something
less as time progresses is to use something called "exponential backoff"
(https://en.wikipedia.org/wiki/Exponential_backoff). 

The basic idea is that you multiply the poll delay by some factor after every
failure. So with an initial wait of 1 second and an exponential factor of 2, the
first failure would result in a wait of 1 * 2^0 seconds (1 second), the second
failure would result in a wait of 1 * 2^1 seconds (2 seconds), the third failure
would result in a wait of 1 * 2^2 seconds (4 seconds), etc. until you hit the
timeout.

Generally, exponential backoff algorithms allow you to specify an initial wait
time, an exponential factor, and a max wait time. I think that all of those
would be good things to expose here, because it's going to be hard to anticipate
how callers would want to use this code.

The advantage of this is that it's customizable depending on whether you're
trying to wait for something that's expected to happen on the order of
microseconds or on the order of hours, whereas the current implementation
doesn't really have that flexibility.

It's worth looking quickly at how the "retrying" apache library handles it here,
too: https://pypi.python.org/pypi/retrying. We probably don't need anything that
complex here, but it's good to at least see.

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:127: poll_interval = min(max(elapsed_time /
10., MIN_POLL_INTERVAL_IN_SECONDS),
Maybe use Telemetry's existing clamp function for this?
https://cs.corp.google.com/github/catapult-project/catapult/telemetry/telemet...

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
File common/py_utils/py_utils/py_utils_unittest.py (right):

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/py_utils_unittest.py:45: # Use list to pass to inner
function because of strange python scoping.
Maybe clarify "because of strange Python scoping" to something like "in order to
allow a lambda function to modify a variable from the outer scope."?

rnephew (Reviews Here)

The CQ bit was checked by rnephew@chromium.org to run a CQ dry run

4 years, 1 month ago (2016-10-26 17:28:11 UTC) #8

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2451553006/80001

4 years, 1 month ago (2016-10-26 17:28:17 UTC) #9

rnephew (Reviews Here)

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_utils/__init__.py File common/py_utils/py_utils/__init__.py (right): https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_utils/__init__.py#newcode97 common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout): On 2016/10/26 17:12:56, charliea wrote: > ...

4 years, 1 month ago (2016-10-26 17:28:42 UTC) #10

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
File common/py_utils/py_utils/__init__.py (right):

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout):
On 2016/10/26 17:12:56, charliea wrote:
> Suggestion: a more common way to solve the problem of wanting to do something
> less as time progresses is to use something called "exponential backoff"
> (https://en.wikipedia.org/wiki/Exponential_backoff). 
> 
> The basic idea is that you multiply the poll delay by some factor after every
> failure. So with an initial wait of 1 second and an exponential factor of 2,
the
> first failure would result in a wait of 1 * 2^0 seconds (1 second), the second
> failure would result in a wait of 1 * 2^1 seconds (2 seconds), the third
failure
> would result in a wait of 1 * 2^2 seconds (4 seconds), etc. until you hit the
> timeout.
> 
> Generally, exponential backoff algorithms allow you to specify an initial wait
> time, an exponential factor, and a max wait time. I think that all of those
> would be good things to expose here, because it's going to be hard to
anticipate
> how callers would want to use this code.
> 
> The advantage of this is that it's customizable depending on whether you're
> trying to wait for something that's expected to happen on the order of
> microseconds or on the order of hours, whereas the current implementation
> doesn't really have that flexibility.
> 
> It's worth looking quickly at how the "retrying" apache library handles it
here,
> too: https://pypi.python.org/pypi/retrying. We probably don't need anything
that
> complex here, but it's good to at least see.

Ned, do you know why the current version was implemented this way instead of
will exponential backoff?

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:97: def WaitFor(condition, timeout):
oobe and ios_browser_backend rely on this return.

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/__init__.py:127: poll_interval = min(max(elapsed_time /
10., MIN_POLL_INTERVAL_IN_SECONDS),
On 2016/10/26 17:12:56, charliea wrote:
> Maybe use Telemetry's existing clamp function for this?
>
https://cs.corp.google.com/github/catapult-project/catapult/telemetry/telemet...

This part of the repo can't rely on telemetry.

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
File common/py_utils/py_utils/py_utils_unittest.py (right):

https://codereview.chromium.org/2451553006/diff/40001/common/py_utils/py_util...
common/py_utils/py_utils/py_utils_unittest.py:45: # Use list to pass to inner
function because of strange python scoping.
On 2016/10/26 17:12:56, charliea wrote:
> Maybe clarify "because of strange Python scoping" to something like "in order
to
> allow a lambda function to modify a variable from the outer scope."?

Done.

rnephew (Reviews Here)

Patchset 3 moves telemetry to use the version in py_utils, running tryjob on that to ...

4 years, 1 month ago (2016-10-26 17:29:19 UTC) #11

nednguyen

On 2016/10/26 17:29:19, rnephew (Reviews Here) wrote: > Patchset 3 moves telemetry to use the ...

4 years, 1 month ago (2016-10-26 17:38:43 UTC) #12

nednguyen

Description was changed from ========== [Common] Move WaitFor from telemetry to common/py_utils BUG=catapult:#2955 ========== to ...

4 years, 1 month ago (2016-10-26 17:39:04 UTC) #13

nednguyen

nednguyen@google.com changed reviewers: + dtu@chromium.org

4 years, 1 month ago (2016-10-26 17:39:04 UTC) #14

nednguyen

On 2016/10/26 17:38:43, nednguyen wrote: > On 2016/10/26 17:29:19, rnephew (Reviews Here) wrote: > > ...

4 years, 1 month ago (2016-10-26 17:39:32 UTC) #15

nednguyen

https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/py_utils_unittest.py File common/py_utils/py_utils/py_utils_unittest.py (right): https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/py_utils_unittest.py#newcode29 common/py_utils/py_utils/py_utils_unittest.py:29: @mock.patch('time.sleep', mock.Mock) Instead of use mock, can you just ...

4 years, 1 month ago (2016-10-26 17:42:12 UTC) #16

rnephew (Reviews Here)

> I think we should only one step at a time. The steps are: > ...

4 years, 1 month ago (2016-10-26 17:54:22 UTC) #17

rnephew (Reviews Here)

https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/py_utils_unittest.py File common/py_utils/py_utils/py_utils_unittest.py (right): https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/py_utils_unittest.py#newcode29 common/py_utils/py_utils/py_utils_unittest.py:29: @mock.patch('time.sleep', mock.Mock) On 2016/10/26 17:42:12, nednguyen wrote: > Instead ...

4 years, 1 month ago (2016-10-26 17:54:29 UTC) #18

nednguyen

On 2016/10/26 17:54:22, rnephew (Reviews Here) wrote: > > I think we should only one ...

4 years, 1 month ago (2016-10-26 17:56:50 UTC) #19

nednguyen

https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/__init__.py File common/py_utils/py_utils/__init__.py (right): https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_utils/__init__.py#newcode124 common/py_utils/py_utils/__init__.py:124: logging.info('Continuing to wait %ds for %s. Elapsed: %ds.', timeout, ...

4 years, 1 month ago (2016-10-26 18:00:07 UTC) #20

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 1 month ago (2016-10-26 18:01:48 UTC) #21

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: Catapult Linux Tryserver on master.tryserver.client.catapult (JOB_FAILED, https://build.chromium.org/p/tryserver.client.catapult/builders/Catapult%20Linux%20Tryserver/builds/5445)

4 years, 1 month ago (2016-10-26 18:01:49 UTC) #22

rnephew (Reviews Here)

I undid the changes, see comment on the one file left about why its there. ...

4 years, 1 month ago (2016-10-26 18:12:48 UTC) #23

I undid the changes, see comment on the one file left about why its there.

https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_util...
File common/py_utils/py_utils/py_utils_unittest.py (right):

https://codereview.chromium.org/2451553006/diff/80001/common/py_utils/py_util...
common/py_utils/py_utils/py_utils_unittest.py:29: @mock.patch('time.sleep',
mock.Mock)
On 2016/10/26 18:00:07, nednguyen wrote:
> On 2016/10/26 17:54:29, rnephew (Reviews Here) wrote:
> > On 2016/10/26 17:42:12, nednguyen wrote:
> > > Instead of use mock, can you just use a testing strategy similar to
> > >
> >
>
https://github.com/catapult-project/catapult/blob/master/telemetry/telemetry/...
> > > ?
> > > 
> > > I don't like mock very much because they usually draw people to test the
> > > implementation instead of test the interface.
> > 
> > This is only to mock out the sleep method in time, so that when its called
it
> > doens't _actually_ sleep for that time.
> 
> If these tests don't invoke the real time.sleep(), that means we don't
actually
> test the time.time() comparison logic?

The time.time() calls would still yield real results, only the sleep() method is
mocked. The logic is still tested, just not much time will have elapsed because
of time.sleep() time not actually elapsing. The mocking in the negative case is
not required, since it will always have to wait until the end, but I will lower
the ammount of time it is waiting to speed the tests up.

https://codereview.chromium.org/2451553006/diff/100001/telemetry/telemetry/in...
File telemetry/telemetry/internal/results/html_output_formatter.py (right):

https://codereview.chromium.org/2451553006/diff/100001/telemetry/telemetry/in...
telemetry/telemetry/internal/results/html_output_formatter.py:15: from
telemetry.internal.results import html2_output_formatter
Uh, to undo my old changes I did a git checkout origin/master -- <file names>

I assume these changes originate from that command. I did not make these.

nednguyen

lgtm I am fine with either implement the wait as-is or using exponential back-off https://codereview.chromium.org/2451553006/diff/120001/common/py_utils/py_utils/__init__.py ...

4 years, 1 month ago (2016-10-26 18:59:40 UTC) #24

rnephew (Reviews Here)

Killed the logging, I'm going to land it with the current timing implementation, and when ...

4 years, 1 month ago (2016-10-26 19:02:38 UTC) #25

rnephew (Reviews Here)

The CQ bit was checked by rnephew@chromium.org

4 years, 1 month ago (2016-10-26 19:02:52 UTC) #26

rnephew (Reviews Here)

The patchset sent to the CQ was uploaded after l-g-t-m from nednguyen@google.com Link to the ...

4 years, 1 month ago (2016-10-26 19:02:52 UTC) #27

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2451553006/160001

4 years, 1 month ago (2016-10-26 19:03:01 UTC) #28

commit-bot: I haz the power

Description was changed from ========== [Common] Move WaitFor from telemetry to common/py_utils BUG=catapult:#2955 ========== to ...

4 years, 1 month ago (2016-10-26 19:30:35 UTC) #29

commit-bot: I haz the power

4 years, 1 month ago (2016-10-26 19:30:36 UTC) #30

Message was sent while issue was closed.

Committed patchset #8 (id:160001) as
https://chromium.googlesource.com/external/github.com/catapult-project/catapu...

Issue 2451553006: [Common] Move WaitFor from telemetry to common/py_utils (Closed)

Description

Patch Set 1 : [Common] Move WaitFor from telemetry to common/py_utils #

Patch Set 2 : [Common] Move WaitFor from telemetry to common/py_utils #

Patch Set 3 : Make telemetry calls use new version #

Patch Set 4 : Charlies comments #

Patch Set 5 : undo telemetry changes #

Patch Set 6 : change times in unittetss #

Patch Set 7 : rebase #

Patch Set 8 : nix logging #

Messages