Issue 426123002: Implement auto-retry of failed tests for telemetry{_perf}_unittests.

Issue 426123002: Implement auto-retry of failed tests for telemetry{_perf}_unittests. (Closed)

Created:
6 years, 4 months ago by Dirk Pranke

Modified:
6 years, 4 months ago

Reviewers:
dtu, tonyg

CC:
chromium-reviews, telemetry+watch_chromium.org, ghost stip (do not use), dtu, Paweł Hajdan Jr., jam

Base URL:
svn://svn.chromium.org/chrome/trunk/src

Project:
chromium

Visibility:
Public.

More Reviews

Description

Implement auto-retry of failed tests for telemetry{_perf}_unittests. This implements the same basic logic we use in GTest-based tests to retry tests that fail during a run. By default, we will not retry anything (this matches what gtests do when run not in botmode). If --retry-limit is passed on the command line, we will retry each failing test up to that number of times. So, to enable retries on the bots by default they will have to pass this arg. R=dtu@chromium.org, tonyg@chromium.org BUG=398027 Committed: https://src.chromium.org/viewvc/chrome?view=rev&revision=288265

Patch Set 1 : initial patch for review #

Total comments: 11

Patch Set 2 : merge forward to r287841 #

Patch Set 3 : update w/ review feedback #

Created: 6 years, 4 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+79 lines, -31 lines)			Patch
	M	tools/telemetry/telemetry/unittest/json_results.py	View	1 2	3 chunks	+48 lines, -23 lines	0 comments	Download
	M	tools/telemetry/telemetry/unittest/run_tests.py	View	1 2	2 chunks	+31 lines, -8 lines	0 comments	Download

Messages

Total messages: 24 (0 generated)

Expand Messages | Collapse Messages

Dirk Pranke

Paweł, John, Ojan, can you review this and confirm that the logic looks right? Dave ...

6 years, 4 months ago (2014-07-29 23:13:47 UTC) #1

tonyg

lgtm, but please wait for Dave to take a pass too. https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/json_results.py File tools/telemetry/telemetry/unittest/json_results.py (right): ...

6 years, 4 months ago (2014-07-30 00:08:30 UTC) #2

Dirk Pranke

https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/json_results.py File tools/telemetry/telemetry/unittest/json_results.py (right): https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/json_results.py#newcode93 tools/telemetry/telemetry/unittest/json_results.py:93: assert(actuals) On 2014/07/30 00:08:30, tonyg wrote: > Nit: > ...

6 years, 4 months ago (2014-07-30 00:13:29 UTC) #3

tonyg

On 2014/07/30 00:13:29, Dirk Pranke wrote: > https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/json_results.py > File tools/telemetry/telemetry/unittest/json_results.py (right): > > https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/json_results.py#newcode93 ...

6 years, 4 months ago (2014-07-30 01:08:06 UTC) #4

dtu

https://chromiumcodereview.appspot.com/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py File tools/telemetry/telemetry/unittest/run_tests.py (right): https://chromiumcodereview.appspot.com/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py#newcode158 tools/telemetry/telemetry/unittest/run_tests.py:158: failed_tests = json_results.FailedTestNames(result) None of these calls are json-specific, ...

6 years, 4 months ago (2014-07-30 01:31:19 UTC) #5

Dirk Pranke

6 years, 4 months ago (2014-07-30 03:49:39 UTC) #6

Paweł Hajdan Jr.

I defer to other reviewers, I'm not familiar with this code.

6 years, 4 months ago (2014-07-30 11:01:19 UTC) #7

jam

just one comment about the default retries. i'm not qualified to review this as I'm ...

6 years, 4 months ago (2014-07-30 20:44:54 UTC) #8

Dirk Pranke

https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py File tools/telemetry/telemetry/unittest/run_tests.py (right): https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py#newcode129 tools/telemetry/telemetry/unittest/run_tests.py:129: parser.add_option('--retry-limit', type='int', default=0, On 2014/07/30 20:44:54, jam wrote: > ...

6 years, 4 months ago (2014-07-30 21:40:33 UTC) #9

jam

6 years, 4 months ago (2014-07-30 21:42:29 UTC) #10

ojan

On 2014/07/30 at 21:42:29, jam wrote: > https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py > File tools/telemetry/telemetry/unittest/run_tests.py (right): > > https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetry/unittest/run_tests.py#newcode129 ...

6 years, 4 months ago (2014-07-30 23:14:33 UTC) #11

Dirk Pranke

On 2014/07/30 23:14:33, ojan-only-code-yellow-reviews wrote: > On 2014/07/30 at 21:42:29, jam wrote: > > > ...

6 years, 4 months ago (2014-07-30 23:31:08 UTC) #12

jam

On 2014/07/30 23:31:08, Dirk Pranke wrote: > On 2014/07/30 23:14:33, ojan-only-code-yellow-reviews wrote: > > On ...

6 years, 4 months ago (2014-07-31 16:55:09 UTC) #13

Dirk Pranke

On 2014/07/31 16:55:09, jam wrote: > On 2014/07/30 23:31:08, Dirk Pranke wrote: > > On ...

6 years, 4 months ago (2014-07-31 16:59:41 UTC) #14

On 2014/07/31 16:55:09, jam wrote:
> On 2014/07/30 23:31:08, Dirk Pranke wrote:
> > On 2014/07/30 23:14:33, ojan-only-code-yellow-reviews wrote:
> > > On 2014/07/30 at 21:42:29, jam wrote:
> > > >
> > >
> >
>
https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetr...
> > > > File tools/telemetry/telemetry/unittest/run_tests.py (right):
> > > > 
> > > >
> > >
> >
>
https://codereview.chromium.org/426123002/diff/20001/tools/telemetry/telemetr...
> > > > tools/telemetry/telemetry/unittest/run_tests.py:129:
> > > parser.add_option('--retry-limit', type='int', default=0,
> > > > On 2014/07/30 21:40:33, Dirk Pranke wrote:
> > > > > On 2014/07/30 20:44:54, jam wrote:
> > > > > > why not make this 3 to match our other test gtest-based test
> harnesses?
> > > i.e.
> > > > > we
> > > > > > don't need to specify extra flags in recipes to enable test retries.
> > > > > 
> > > > > Good question. As I understand it, the gtest-based tests don't
actually
> > > retry
> > > > > tests by default; you have to specify the --test-launcher-bot-mode
flag
> to
> > > get
> > > > > the retries (or use --test-launcher-retry-limit to specify a number
> > > explicitly).
> > > > > 
> > > > 
> > > > ah, I was not aware of that. nvm then
> > > 
> > > Are you saying tests don't retry when people run them locally?
> > 
> > Correct, not without a flag telling them to do so.
> 
> Ojan is correct to point out that this is a bit confusing. Why don't we make
> this the default for telemetry, and we can switch gtest to also do the same?
> 
> I think the reason we didn't do that in the gtest launcher initially is to
make
> debugging easier. But now the test launcher detects when it's in a debugger
and
> runs in-process anyways, so this isn't needed anymore.

run-webkit-tests actually retries by default unless you specify tests to run
explicitly; the idea is that if you are focused on a particular test, you'd
probably rather see the failure fast rather than wait for retries.

That said, I have no strong opinions on this one way or another.

How about we leave this as-is for now, and change both in a separate patch?

Dirk Pranke

Oh, I would note that we should probably get the input of the people that ...

6 years, 4 months ago (2014-07-31 17:07:35 UTC) #15

tonyg

On 2014/07/31 17:07:35, Dirk Pranke wrote: > Oh, I would note that we should probably ...

6 years, 4 months ago (2014-07-31 17:13:05 UTC) #16

jam

On 2014/07/31 17:13:05, tonyg wrote: > On 2014/07/31 17:07:35, Dirk Pranke wrote: > > Oh, ...

6 years, 4 months ago (2014-08-06 21:17:53 UTC) #17

Dirk Pranke

On 2014/08/06 21:17:53, jam wrote: > i hope my comments aren't holding this up? :) ...

6 years, 4 months ago (2014-08-06 22:27:54 UTC) #18

Dirk Pranke

Okay, I've updated w/ the review feedback. Dave, can you take another look?

6 years, 4 months ago (2014-08-07 01:30:54 UTC) #19

Dirk Pranke

On 2014/08/07 01:30:54, Dirk Pranke wrote: > Okay, I've updated w/ the review feedback. Dave, ...

6 years, 4 months ago (2014-08-07 03:38:07 UTC) #20

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/dpranke@chromium.org/426123002/60001

6 years, 4 months ago (2014-08-07 22:22:04 UTC) #23

Message was sent while issue was closed.

Change committed as 288265

Expand Messages | Collapse Messages