Issue 2151983002: Adding logic to restart the browser if there is an exception

eyaich

Description was changed from ========== Adding logic to restart the browser if there is an ...

4 years, 5 months ago (2016-07-14 18:39:54 UTC) #1

eyaich

Description was changed from ========== Adding logic to restart the browser if there is an ...

4 years, 5 months ago (2016-07-14 18:47:24 UTC) #2

eyaich

eyaich@google.com changed reviewers: + kbr@chromium.org, nednguyen@google.com

4 years, 5 months ago (2016-07-14 18:48:22 UTC) #3

nednguyen

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') We still should add "raise" after ...

4 years, 5 months ago (2016-07-14 19:04:12 UTC) #5

Ken Russell (switch to Gerrit)

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test_unittest.py File content/test/gpu/gpu_tests/gpu_integration_test_unittest.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test_unittest.py#newcode39 content/test/gpu/gpu_tests/gpu_integration_test_unittest.py:39: self.tab.Navigate('chrome://crash') Per comments in https://codereview.chromium.org/2148283003 , what if this ...

4 years, 5 months ago (2016-07-14 21:38:22 UTC) #6

Ken Russell (switch to Gerrit)

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/14 19:04:12, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-14 23:06:17 UTC) #7

nednguyen

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/14 23:06:17, Ken Russell wrote: ...

4 years, 5 months ago (2016-07-14 23:10:01 UTC) #8

nednguyen

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/14 23:10:01, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-14 23:16:47 UTC) #9

Ken Russell (switch to Gerrit)

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/14 23:16:47, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-15 01:10:37 UTC) #10

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:134:
self._RestartBrowser('failure in setup')
On 2016/07/14 23:16:47, nednguyen wrote:
> On 2016/07/14 23:10:01, nednguyen wrote:
> > On 2016/07/14 23:06:17, Ken Russell wrote:
> > > On 2016/07/14 19:04:12, nednguyen wrote:
> > > > We still should add "raise" after this so the exception bubbles up &
cause
> a
> > > > test failure
> > > 
> > > Is failing the test a good idea? setUp() is called before the test method,
> so
> > > there won't be any opportunity to mark a test flaky to retry it.
> (_RunGpuTest
> > is
> > > what provides for flaky retries)
> > > 
> > > We should definitely at least print the exception to see what's going
wrong.
> > > There's no good reason for browser startup to fail.
> > > 
> > > Would it be a good idea to just get rid of this method entirely in
> > > GpuIntegrationTest, and fetch self.tab if necessary (i.e., it's None)
right
> > > before the call to self.RunActualGpuTest ?
> > As you mention the problem of marking test flaky & retry, probably this is
the
> > best way to go.
> 
> Though people can always subclass GpuIntegrationTest & override setUp method.
> Maybe we should add a unittest that enforce  no overriding setUp & tearDown 

OK. Let's start by adding:

  if not self.tab:
    self.tab = self.browser.tabs[0]

in GpuIntegrationTest._RunGpuTest, and removing setUp here. That way any
exception that's raised is handled by the normal mechanisms, and will be printed
out, and the browser restarted if it happens. A big question is why the renderer
is crashing, and that change will allow diagnosis.

Regarding such a unittest: that would be great, though I don't know how to write
it. Any advice you can offer would be great.

nednguyen

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/15 01:10:37, Ken Russell wrote: ...

4 years, 5 months ago (2016-07-15 01:39:55 UTC) #11

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:134:
self._RestartBrowser('failure in setup')
On 2016/07/15 01:10:37, Ken Russell wrote:
> On 2016/07/14 23:16:47, nednguyen wrote:
> > On 2016/07/14 23:10:01, nednguyen wrote:
> > > On 2016/07/14 23:06:17, Ken Russell wrote:
> > > > On 2016/07/14 19:04:12, nednguyen wrote:
> > > > > We still should add "raise" after this so the exception bubbles up &
> cause
> > a
> > > > > test failure
> > > > 
> > > > Is failing the test a good idea? setUp() is called before the test
method,
> > so
> > > > there won't be any opportunity to mark a test flaky to retry it.
> > (_RunGpuTest
> > > is
> > > > what provides for flaky retries)
> > > > 
> > > > We should definitely at least print the exception to see what's going
> wrong.
> > > > There's no good reason for browser startup to fail.
> > > > 
> > > > Would it be a good idea to just get rid of this method entirely in
> > > > GpuIntegrationTest, and fetch self.tab if necessary (i.e., it's None)
> right
> > > > before the call to self.RunActualGpuTest ?
> > > As you mention the problem of marking test flaky & retry, probably this is
> the
> > > best way to go.
> > 
> > Though people can always subclass GpuIntegrationTest & override setUp
method.
> > Maybe we should add a unittest that enforce  no overriding setUp & tearDown 
> 
> OK. Let's start by adding:
> 
>   if not self.tab:
>     self.tab = self.browser.tabs[0]
> 
> in GpuIntegrationTest._RunGpuTest, and removing setUp here. That way any
> exception that's raised is handled by the normal mechanisms, and will be
printed
> out, and the browser restarted if it happens. A big question is why the
renderer
> is crashing, and that change will allow diagnosis.
> 
> Regarding such a unittest: that would be great, though I don't know how to
write
> it. Any advice you can offer would be great.

We can use what's similar to
https://cs.chromium.org/chromium/src/third_party/catapult/telemetry/telemetry...

Basically use telemetry.core.discover to dynamically discover all the subclass
of SeriallyExecutedBrowserTestCase & assert
1) they are subclass of GpuIntegrationTestCase
2) they don't override setup & tearDown
..Other things

eyaich

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode134 content/test/gpu/gpu_tests/gpu_integration_test.py:134: self._RestartBrowser('failure in setup') On 2016/07/15 01:39:55, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-15 13:30:27 UTC) #12

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:134:
self._RestartBrowser('failure in setup')
On 2016/07/15 01:39:55, nednguyen wrote:
> On 2016/07/15 01:10:37, Ken Russell wrote:
> > On 2016/07/14 23:16:47, nednguyen wrote:
> > > On 2016/07/14 23:10:01, nednguyen wrote:
> > > > On 2016/07/14 23:06:17, Ken Russell wrote:
> > > > > On 2016/07/14 19:04:12, nednguyen wrote:
> > > > > > We still should add "raise" after this so the exception bubbles up &
> > cause
> > > a
> > > > > > test failure
> > > > > 
> > > > > Is failing the test a good idea? setUp() is called before the test
> method,
> > > so
> > > > > there won't be any opportunity to mark a test flaky to retry it.
> > > (_RunGpuTest
> > > > is
> > > > > what provides for flaky retries)
> > > > > 
> > > > > We should definitely at least print the exception to see what's going
> > wrong.
> > > > > There's no good reason for browser startup to fail.
> > > > > 
> > > > > Would it be a good idea to just get rid of this method entirely in
> > > > > GpuIntegrationTest, and fetch self.tab if necessary (i.e., it's None)
> > right
> > > > > before the call to self.RunActualGpuTest ?
> > > > As you mention the problem of marking test flaky & retry, probably this
is
> > the
> > > > best way to go.
> > > 
> > > Though people can always subclass GpuIntegrationTest & override setUp
> method.
> > > Maybe we should add a unittest that enforce  no overriding setUp &
tearDown 
> > 
> > OK. Let's start by adding:
> > 
> >   if not self.tab:
> >     self.tab = self.browser.tabs[0]
> > 
> > in GpuIntegrationTest._RunGpuTest, and removing setUp here. That way any
> > exception that's raised is handled by the normal mechanisms, and will be
> printed
> > out, and the browser restarted if it happens. A big question is why the
> renderer
> > is crashing, and that change will allow diagnosis.
> > 
> > Regarding such a unittest: that would be great, though I don't know how to
> write
> > it. Any advice you can offer would be great.
> 
> We can use what's similar to
>
https://cs.chromium.org/chromium/src/third_party/catapult/telemetry/telemetry...
> 
> Basically use telemetry.core.discover to dynamically discover all the subclass
> of SeriallyExecutedBrowserTestCase & assert
> 1) they are subclass of GpuIntegrationTestCase
> 2) they don't override setup & tearDown
> ..Other things

So I see one problem with this workflow.  

The original errors we were diagnosing were when test n crashed the browser or
the tab and then tests n+1 to num_tests were failing because we try and set the
tab to the 0th tab of the browser which is now in a crashed state.  Therefore,
test n is really where the error happened, and all of the subsequent tests
failed because of it.  

Therefore, we are catching the error in the setup method for the browser being
down so that the n+1 test has a chance to run and pass or fail on its own
instead of failing because a previous test failed.

If we move this logic into the _RunGpuTest method this information will be lost.
 Say for example test n+1 is expected to fail, it now fails on self.tab =
self.browser.tabs[0] in the try/catch in RunGpuTest, but actually succeeds
because the expectation was for it to fail, we just didn't realize it was the
wrong reason for failing.  

I think this logic should stay in the setup since it is actually the setup that
is failing, we can't access the browser because it is previously down, so lets
try and restart it and maybe log a warning.

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test_unittest.py (right):

https://codereview.chromium.org/2151983002/diff/20001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test_unittest.py:39:
self.tab.Navigate('chrome://crash')
On 2016/07/14 21:38:22, Ken Russell wrote:
> Per comments in https://codereview.chromium.org/2148283003 , what if this were
> rewritten to conditionally navigate to chrome://inducebrowsercrashforrealz ,
and
> only then call super(...).setUp()? Would that more closely match the real
> behavior seen?

Done.

Ken Russell (switch to Gerrit)

> So I see one problem with this workflow. > > The original errors we ...

4 years, 5 months ago (2016-07-15 18:05:24 UTC) #13

> So I see one problem with this workflow.  
> 
> The original errors we were diagnosing were when test n crashed the browser or
> the tab and then tests n+1 to num_tests were failing because we try and set
the
> tab to the 0th tab of the browser which is now in a crashed state.  Therefore,
> test n is really where the error happened, and all of the subsequent tests
> failed because of it.  
> 
> Therefore, we are catching the error in the setup method for the browser being
> down so that the n+1 test has a chance to run and pass or fail on its own
> instead of failing because a previous test failed.
> 
> If we move this logic into the _RunGpuTest method this information will be
lost.
>  Say for example test n+1 is expected to fail, it now fails on self.tab =
> self.browser.tabs[0] in the try/catch in RunGpuTest, but actually succeeds
> because the expectation was for it to fail, we just didn't realize it was the
> wrong reason for failing.  
> 
> I think this logic should stay in the setup since it is actually the setup
that
> is failing, we can't access the browser because it is previously down, so lets
> try and restart it and maybe log a warning.

Good points. Agreed, with warning and diagnostics added. LGTM

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:131: except Exception:
Please call exception_formatter.PrintFormattedException here so that we can
start to diagnose what's going wrong when this happens. Also please log
something about a failure happening during browser startup. Thanks.

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test_unittest.py (right):

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test_unittest.py:38:
self.tab.Navigate('chrome://inducebrowsercrashforrealz')
Based on another thread, it sounded like I was wrong, and it's the renderer
rather than the browser which crashed. So could you change this back to
chrome://crash? Will that still provoke the failure? Thanks.

nednguyen

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode131 content/test/gpu/gpu_tests/gpu_integration_test.py:131: except Exception: On 2016/07/15 18:05:24, Ken Russell wrote: > ...

4 years, 5 months ago (2016-07-15 18:11:15 UTC) #14

Ken Russell (switch to Gerrit)

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode131 content/test/gpu/gpu_tests/gpu_integration_test.py:131: except Exception: On 2016/07/15 18:11:15, nednguyen wrote: > On ...

4 years, 5 months ago (2016-07-16 01:09:54 UTC) #15

eyaich

https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/40001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode131 content/test/gpu/gpu_tests/gpu_integration_test.py:131: except Exception: On 2016/07/16 01:09:54, Ken Russell wrote: > ...

4 years, 5 months ago (2016-07-21 17:58:07 UTC) #16

nednguyen

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') I still think we should reraise ...

4 years, 5 months ago (2016-07-21 18:01:56 UTC) #17

eyaich

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') On 2016/07/21 18:01:55, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-21 18:35:36 UTC) #18

nednguyen

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') On 2016/07/21 18:35:36, eyaich (ooo july18-july20) ...

4 years, 5 months ago (2016-07-21 18:40:30 UTC) #19

eyaich

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') On 2016/07/21 18:40:30, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-21 18:48:02 UTC) #20

nednguyen

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') On 2016/07/21 18:48:02, eyaich wrote: > ...

4 years, 5 months ago (2016-07-21 18:57:51 UTC) #21

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:135:
self._RestartBrowser('failure in setup')
On 2016/07/21 18:48:02, eyaich wrote:
> On 2016/07/21 18:40:30, nednguyen wrote:
> > On 2016/07/21 18:35:36, eyaich (ooo july18-july20) wrote:
> > > On 2016/07/21 18:01:55, nednguyen wrote:
> > > > I still think we should reraise after this, otherwise this will not
> surface
> > > the
> > > > failure.
> > > 
> > > The failure is on the test prior, so the exception that we care about has
> > > already been raised for that test.  This is simply catching the state of
the
> > > browser that failed from the last tests and restarting it so this test has
a
> > > chance to pass.  Raising the error again here would fail this test and
> defeat
> > > the whole purpose of this CL in the first place.
> > 
> > I think that we may still have case in which:
> > 
> > test N not fail
> > test N + 1 's setUp fail.
> > 
> > I would also expect most of the test failure from test N should also trigger
a
> > browser restart in test N already:
> >
>
https://cs.chromium.org/chromium/src/content/test/gpu/gpu_tests/gpu_integrati...
> 
> How would test N+1's setUp fail unless tab[0] wasn't available, ie the browser
> wasn't up from either 1) a previous failure or 2) the browser isn't able to be
> started at all in which case all tests in that process would fail.  

Note that all the communication between telemetry & chrome happens
asynchronously, so it's possible that:

Test N : Browser is aive
Test N : Telemetry tells Browser to do X
Test N : Browser does X
Test N : Telemetry poke browser to get the state of the page, browser return
"foo"
Some where between test N & test N + 1: Browser crashes
Test N + 1.setUp:Telemetry contacts the browser again, but it's already crashed
& throw CrashException.

> 
> If we raise an exception again here what is the point of this try catch block
> other than to restart the browser again?  I guess it would restart it for the
> N+2 test, but then N+1 might fail unnecessarily. 

Yes, in case there is the exception, I would prefer N+1 fails here rather than
nothing fails.  When there is failure, adding an extra failure is bad, but not
as bad as comparing to no failure at all if there is a crash here that wasn't
shown in previous test run.

> 
> Also, if if test N was expected to fail the browser wouldn't restart
regardless
> of the failure reason.  Side note: why don't we restart the browser after the
> expected failure case, won't the browser be down after the exception?

eyaich

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py File content/test/gpu/gpu_tests/gpu_integration_test.py (right): https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_tests/gpu_integration_test.py#newcode135 content/test/gpu/gpu_tests/gpu_integration_test.py:135: self._RestartBrowser('failure in setup') On 2016/07/21 18:57:51, nednguyen wrote: > ...

4 years, 5 months ago (2016-07-21 19:04:10 UTC) #22

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_te...
File content/test/gpu/gpu_tests/gpu_integration_test.py (right):

https://codereview.chromium.org/2151983002/diff/60001/content/test/gpu/gpu_te...
content/test/gpu/gpu_tests/gpu_integration_test.py:135:
self._RestartBrowser('failure in setup')
On 2016/07/21 18:57:51, nednguyen wrote:
> On 2016/07/21 18:48:02, eyaich wrote:
> > On 2016/07/21 18:40:30, nednguyen wrote:
> > > On 2016/07/21 18:35:36, eyaich (ooo july18-july20) wrote:
> > > > On 2016/07/21 18:01:55, nednguyen wrote:
> > > > > I still think we should reraise after this, otherwise this will not
> > surface
> > > > the
> > > > > failure.
> > > > 
> > > > The failure is on the test prior, so the exception that we care about
has
> > > > already been raised for that test.  This is simply catching the state of
> the
> > > > browser that failed from the last tests and restarting it so this test
has
> a
> > > > chance to pass.  Raising the error again here would fail this test and
> > defeat
> > > > the whole purpose of this CL in the first place.
> > > 
> > > I think that we may still have case in which:
> > > 
> > > test N not fail
> > > test N + 1 's setUp fail.
> > > 
> > > I would also expect most of the test failure from test N should also
trigger
> a
> > > browser restart in test N already:
> > >
> >
>
https://cs.chromium.org/chromium/src/content/test/gpu/gpu_tests/gpu_integrati...
> > 
> > How would test N+1's setUp fail unless tab[0] wasn't available, ie the
browser
> > wasn't up from either 1) a previous failure or 2) the browser isn't able to
be
> > started at all in which case all tests in that process would fail.  
> 
> Note that all the communication between telemetry & chrome happens
> asynchronously, so it's possible that:
> 
> Test N : Browser is aive
> Test N : Telemetry tells Browser to do X
> Test N : Browser does X
> Test N : Telemetry poke browser to get the state of the page, browser return
> "foo"
> Some where between test N & test N + 1: Browser crashes
> Test N + 1.setUp:Telemetry contacts the browser again, but it's already
crashed
> & throw CrashException.
> 
> > 
> > If we raise an exception again here what is the point of this try catch
block
> > other than to restart the browser again?  I guess it would restart it for
the
> > N+2 test, but then N+1 might fail unnecessarily. 
> 
> Yes, in case there is the exception, I would prefer N+1 fails here rather than
> nothing fails.  When there is failure, adding an extra failure is bad, but not
> as bad as comparing to no failure at all if there is a crash here that wasn't
> shown in previous test run.
> 
> > 
> > Also, if if test N was expected to fail the browser wouldn't restart
> regardless
> > of the failure reason.  Side note: why don't we restart the browser after
the
> > expected failure case, won't the browser be down after the exception?

Ok I agree that not reporting an error at all is worse than reporting an extra
one.  Added the raise.

eyaich

The patchset sent to the CQ was uploaded after l-g-t-m from kbr@chromium.org Link to the ...

4 years, 5 months ago (2016-07-21 19:42:26 UTC) #25

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2151983002/80001

4 years, 5 months ago (2016-07-21 19:43:04 UTC) #26

commit-bot: I haz the power

Your CL relies on deprecated CQ feature(s): * Specifying master names in CQ_INCLUDE_TRYBOTS part of ...

4 years, 5 months ago (2016-07-21 19:43:06 UTC) #27

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 5 months ago (2016-07-21 19:59:06 UTC) #28

commit-bot: I haz the power

Try jobs failed on following builders: android_arm64_dbg_recipe on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/android_arm64_dbg_recipe/builds/99837)

4 years, 5 months ago (2016-07-21 19:59:07 UTC) #29

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2151983002/80001

4 years, 5 months ago (2016-07-21 20:28:48 UTC) #31

commit-bot: I haz the power

Your CL relies on deprecated CQ feature(s): * Specifying master names in CQ_INCLUDE_TRYBOTS part of ...

4 years, 5 months ago (2016-07-21 20:28:50 UTC) #32

eyaich

The patchset sent to the CQ was uploaded after l-g-t-m from nednguyen@google.com, kbr@chromium.org Link to ...

4 years, 5 months ago (2016-07-21 20:38:48 UTC) #34

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2151983002/100001

4 years, 5 months ago (2016-07-21 20:39:07 UTC) #35

commit-bot: I haz the power

Your CL relies on deprecated CQ feature(s): * Specifying master names in CQ_INCLUDE_TRYBOTS part of ...

4 years, 5 months ago (2016-07-21 20:39:10 UTC) #36

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 5 months ago (2016-07-21 22:08:31 UTC) #37

commit-bot: I haz the power

Try jobs failed on following builders: mac_optional_gpu_tests_rel on tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/mac_optional_gpu_tests_rel/builds/2168)

4 years, 5 months ago (2016-07-21 22:08:32 UTC) #38

Ken Russell (switch to Gerrit)

On 2016/07/21 22:08:32, commit-bot: I haz the power wrote: > Try jobs failed on following ...

4 years, 5 months ago (2016-07-21 22:47:49 UTC) #39

Ken Russell (switch to Gerrit)

The CQ bit was checked by kbr@chromium.org

4 years, 5 months ago (2016-07-21 22:47:57 UTC) #40

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2151983002/100001

4 years, 5 months ago (2016-07-21 22:48:36 UTC) #41

commit-bot: I haz the power

Your CL relies on deprecated CQ feature(s): * Specifying master names in CQ_INCLUDE_TRYBOTS part of ...

4 years, 5 months ago (2016-07-21 22:48:38 UTC) #42

Ken Russell (switch to Gerrit)

On 2016/07/21 22:47:49, Ken Russell wrote: > On 2016/07/21 22:08:32, commit-bot: I haz the power ...

4 years, 5 months ago (2016-07-21 23:32:31 UTC) #43

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 5 months ago (2016-07-22 01:33:28 UTC) #44

commit-bot: I haz the power

Try jobs failed on following builders: linux_android_rel_ng on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/linux_android_rel_ng/builds/108559)

4 years, 5 months ago (2016-07-22 01:33:29 UTC) #45

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2151983002/100001

4 years, 5 months ago (2016-07-22 01:52:08 UTC) #47

commit-bot: I haz the power

Your CL relies on deprecated CQ feature(s): * Specifying master names in CQ_INCLUDE_TRYBOTS part of ...

4 years, 5 months ago (2016-07-22 01:52:11 UTC) #48

commit-bot: I haz the power

Description was changed from ========== Adding logic to restart the browser if there is an ...

4 years, 5 months ago (2016-07-22 03:26:47 UTC) #49

commit-bot: I haz the power

Description was changed from ========== Adding logic to restart the browser if there is an ...

4 years, 5 months ago (2016-07-22 03:29:37 UTC) #51

commit-bot: I haz the power

4 years, 5 months ago (2016-07-22 03:29:38 UTC) #52

Message was sent while issue was closed.

Patchset 6 (id:??) landed as
https://crrev.com/202a09250ad4141be8a80cd63881998d8e821f09
Cr-Commit-Position: refs/heads/master@{#407022}

Issue 2151983002: Adding logic to restart the browser if there is an exception (Closed)

Description

Patch Set 1 #

Patch Set 2 : Adding removed logging statement #

Patch Set 3 : Updating where the crash happens for the unittest #

Patch Set 4 : Adding exception logging on failure #

Patch Set 5 : Raising error after restarting the browser #

Patch Set 6 : Syncing client #

Messages