Issue 2308283002: Allow seeding the random layout test order and write out seed.

qyearsley

The CQ bit was checked by qyearsley@chromium.org to run a CQ dry run

4 years, 3 months ago (2016-09-05 00:09:24 UTC) #1

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2308283002/1

4 years, 3 months ago (2016-09-05 00:09:34 UTC) #2

qyearsley

qyearsley@chromium.org changed reviewers: + dpranke@chromium.org, tansell@chromium.org

4 years, 3 months ago (2016-09-05 00:10:56 UTC) #3

qyearsley

qyearsley@chromium.org changed reviewers: + ojan@chromium.org

4 years, 3 months ago (2016-09-05 00:11:28 UTC) #4

qyearsley

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode281 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:281: self._filesystem.write_text_file(path, contents) We could also potentially: - Not write ...

4 years, 3 months ago (2016-09-05 00:15:28 UTC) #5

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 3 months ago (2016-09-05 01:17:14 UTC) #6

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

4 years, 3 months ago (2016-09-05 01:17:14 UTC) #7

Dirk Pranke

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode281 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:281: self._filesystem.write_text_file(path, contents) On 2016/09/05 00:15:28, qyearsley wrote: > We ...

4 years, 3 months ago (2016-09-06 01:19:57 UTC) #8

mithro

Thanks for picking up this CL Quinten. Definitely happy for someone else to get it ...

4 years, 3 months ago (2016-09-06 02:02:25 UTC) #9

Thanks for picking up this CL Quinten. 

Definitely happy for someone else to get it in sooner than I can. I was meaning
to get back to it, but it just hadn't reached the top of my TODO list yet.

Tim 'mithro' Ansell

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py
(right):

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:281:
self._filesystem.write_text_file(path, contents)
On 2016/09/06 01:19:57, Dirk Pranke wrote:
> On 2016/09/05 00:15:28, qyearsley wrote:
> > We could also potentially:
> >  - Not write the seed to the results directory; to reproduce a test order
> would
> > then involve getting the seed that's logged when the test was run.
> >  - Also write the test order. This might sometimes be helpful, but may be
> > unnecessary.
> 
> I would include the seed as a field in the results.json file, along with all
of
> the other records of the test run. I don't think we should write a separate
file
> just for this.
> 
> I would probably also log the seed that was used at the beginning of the run,
as
> part of the printer.print_config() function called at run_webkit_tests.py:544,
> rather than logging it here.

I'm happy with Dirk's suggestions. 

The aim is to be able to find the random seed uses for a given run so it can be
reproduced. This includes both when run on the bots and when run locally.

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
(right):

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:388:
help="Seed to use for random test order. Only applicable in combination with
--order=random."),
For the help you should add what the default value? There are two reasonable
options - current time or a fix value.

ojan

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) I think we should do a ...

4 years, 3 months ago (2016-09-06 16:53:29 UTC) #10

qyearsley

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) On 2016/09/06 at 16:53:29, ojan wrote: ...

4 years, 3 months ago (2016-09-06 17:22:48 UTC) #11

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py
(right):

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262:
self._random_seed = int(time.time())
On 2016/09/06 at 16:53:29, ojan wrote:
> I think we should do a fixed value. Using current time is almost always a
mistake because it's then hard to reproduce flakiness you encounter.

If the seed is output with the results, it should then be possible to reproduce
flakiness that is encountered by passing --order=random --seed=<seed from prior
run>.

One possible advantage of a random seed based on current time by default is that
more time-dependent flakiness would be revealed over the course of multiple test
runs. Although, the order should also change over time as the number of tests
changes, so an order-dependent flaky test shouldn't be able to "hide" for too
long.

Given that, does it still seem like a fixed seed would be a good idea?

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:281:
self._filesystem.write_text_file(path, contents)
On 2016/09/06 at 02:02:25, mithro wrote:
> On 2016/09/06 01:19:57, Dirk Pranke wrote:
> > On 2016/09/05 00:15:28, qyearsley wrote:
> > > We could also potentially:
> > >  - Not write the seed to the results directory; to reproduce a test order
> > would
> > > then involve getting the seed that's logged when the test was run.
> > >  - Also write the test order. This might sometimes be helpful, but may be
> > > unnecessary.
> > 
> > I would include the seed as a field in the results.json file, along with all
of
> > the other records of the test run. I don't think we should write a separate
file
> > just for this.
> > 
> > I would probably also log the seed that was used at the beginning of the
run, as
> > part of the printer.print_config() function called at
run_webkit_tests.py:544,
> > rather than logging it here.
> 
> I'm happy with Dirk's suggestions. 
> 
> The aim is to be able to find the random seed uses for a given run so it can
be reproduced. This includes both when run on the bots and when run locally.

Aye, SGTM

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
(right):

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:388:
help="Seed to use for random test order. Only applicable in combination with
--order=random."),
On 2016/09/06 at 02:02:25, mithro wrote:
> For the help you should add what the default value? There are two reasonable
options - current time or a fix value.

Agreed - will add this.

jeffcarp

jeffcarp@chromium.org changed reviewers: + jeffcarp@chromium.org

4 years, 3 months ago (2016-09-06 17:22:50 UTC) #12

jeffcarp

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) On 2016/09/06 at 16:53:29, ojan wrote: ...

4 years, 3 months ago (2016-09-06 17:22:51 UTC) #13

ojan

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) On 2016/09/06 at 17:22:48, qyearsley wrote: ...

4 years, 3 months ago (2016-09-06 21:27:11 UTC) #14

jeffcarp

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) On 2016/09/06 at 21:27:11, ojan wrote: ...

4 years, 3 months ago (2016-09-06 22:57:30 UTC) #15

ojan

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py (right): https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py#newcode262 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262: self._random_seed = int(time.time()) On 2016/09/06 at 22:57:30, jeffcarp wrote: ...

4 years, 3 months ago (2016-09-06 23:07:43 UTC) #16

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py
(right):

https://codereview.chromium.org/2308283002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/manager.py:262:
self._random_seed = int(time.time())
On 2016/09/06 at 22:57:30, jeffcarp wrote:
> On 2016/09/06 at 21:27:11, ojan wrote:
> > On 2016/09/06 at 17:22:48, qyearsley wrote:
> > > On 2016/09/06 at 16:53:29, ojan wrote:
> > > > I think we should do a fixed value. Using current time is almost always
a mistake because it's then hard to reproduce flakiness you encounter.
> > > 
> > > If the seed is output with the results, it should then be possible to
reproduce flakiness that is encountered by passing --order=random --seed=<seed
from prior run>.
> > > 
> > > One possible advantage of a random seed based on current time by default
is that more time-dependent flakiness would be revealed over the course of
multiple test runs. Although, the order should also change over time as the
number of tests changes, so an order-dependent flaky test shouldn't be able to
"hide" for too long.
> > > 
> > > Given that, does it still seem like a fixed seed would be a good idea?
> > 
> > Think of this from the perspective of a sheriff trying to figure out why a
failure happened. It will be hard to reason about current time. Admittedly, a
fixed seed is hard to reason about as well, but at least that only changes when
the list of test to run changes.
> 
> If this would make it hard to tell whether a failing test is flaky or
order-dependent, should we investigate other methods to make sure tests are
order-independent? Or what if we started with a small set of tests that are
known to be order-independent, run those in random order, and slowly move more
tests into that bucket?

I think this is a good idea. You probably want it to be a blacklist so that when
you're done moving things you can get rid of the concept of order-dependent
tests. Concretely, we could have an OrderDependentTests file that lets you list
directories or individual tests and the order they should run in. Then you can
remove one directory at a time from that list.

Theoretically, we could even run the order-independent tests on swarming while
we're working on emptying the OrderDependentTest file. I don't know how
complicated that would be to do. If it's easy, I think it'd be a good idea. If
complicated, it might not be worth it.

qyearsley

In the latest patch now, I've now changed it so that: - The seed is ...

4 years, 3 months ago (2016-09-07 18:07:12 UTC) #17

qyearsley

The CQ bit was checked by qyearsley@chromium.org to run a CQ dry run

4 years, 3 months ago (2016-09-07 20:47:35 UTC) #18

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2308283002/40001

4 years, 3 months ago (2016-09-07 20:48:30 UTC) #19

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 3 months ago (2016-09-07 21:59:42 UTC) #20

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

4 years, 3 months ago (2016-09-07 21:59:43 UTC) #21

qyearsley

Ping - does the latest version of this CL look OK to commit?

4 years, 3 months ago (2016-09-09 15:54:51 UTC) #22

Dirk Pranke

lgtm, if you also change it so that we write the seed into the results.json ...

4 years, 3 months ago (2016-09-09 19:50:23 UTC) #23

qyearsley

On 2016/09/09 at 19:50:23, dpranke wrote: > lgtm, if you also change it so that ...

4 years, 3 months ago (2016-09-09 21:26:39 UTC) #24

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2308283002/60001

4 years, 3 months ago (2016-09-12 16:19:30 UTC) #27

commit-bot: I haz the power

Description was changed from ========== Support specifying a seed for pseudo-random test order. This is ...

4 years, 3 months ago (2016-09-12 17:42:46 UTC) #29

commit-bot: I haz the power

4 years, 3 months ago (2016-09-12 17:42:47 UTC) #30

Message was sent while issue was closed.

Patchset 4 (id:??) landed as
https://crrev.com/fa5c50adb9b096701b9c0e1e9eee458665484bc5
Cr-Commit-Position: refs/heads/master@{#417968}

Issue 2308283002: Allow seeding the random layout test order and write out seed. (Closed)

Description

Patch Set 1 #

Patch Set 2 : Use fixed seed by default; move logging of random seed to Printer; remove writing of file. #

Patch Set 3 : Remove incomplete docstring change #

Patch Set 4 : Add random order seed to results file. #

Messages