Issue 2506263005: Prevent a service worker from keeping itself alive by self postMessage.

Marijn Kruisselbrink

The CQ bit was checked by mek@chromium.org to run a CQ dry run

4 years, 1 month ago (2016-11-18 00:17:24 UTC) #1

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/20001

4 years, 1 month ago (2016-11-18 00:19:09 UTC) #3

Marijn Kruisselbrink

mek@chromium.org changed reviewers: + falken@chromium.org

4 years, 1 month ago (2016-11-18 01:01:17 UTC) #4

Marijn Kruisselbrink

My initial attempt at implementing linking the timeout of message events to the timeout of ...

4 years, 1 month ago (2016-11-18 01:01:18 UTC) #5

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 1 month ago (2016-11-18 01:35:22 UTC) #6

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: android_n5x_swarming_rel on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/android_n5x_swarming_rel/builds/70517)

4 years, 1 month ago (2016-11-18 01:35:23 UTC) #7

Marijn Kruisselbrink

The CQ bit was checked by mek@chromium.org to run a CQ dry run

4 years, 1 month ago (2016-11-18 05:24:57 UTC) #8

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/40001

4 years, 1 month ago (2016-11-18 05:25:24 UTC) #9

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years, 1 month ago (2016-11-18 08:20:06 UTC) #10

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

4 years, 1 month ago (2016-11-18 08:20:07 UTC) #11

Marijn Kruisselbrink

Description was changed from ========== Prevent a service worker from keeping itself alive by self ...

4 years, 1 month ago (2016-11-18 20:30:24 UTC) #12

falken

I think this is a fine approach. I suppose the next steps will be to ...

4 years, 1 month ago (2016-11-21 08:01:48 UTC) #13

I think this is a fine approach.

I suppose the next steps will be to use a similar timeout clamping for foreign
fetch?

Agree that CONTINUE_ON_TIMEOUT didn't seem intuitive but it should be OK as long
as the idle timeout is working. I'm wondering about:

> and I think it might even be possible to end up in a situation with all events
expired and no idle_time_ set)

That seems really bad, and would defeat this mechanism. But if it's working as
intended, the idle_time_ is always set when the worker starts, and never gets
reset to null until the worker stops. Do you see something else?

Overall the entire service worker timeout code seems like it should be simpler,
that's out of the scope of this patch of course.

> As an aside, would it maybe make sense to split up
service_worker_browsertest.cc in a couple of files?

Yes, I think there's just too much friction to add a new file so everyone piles
tests into the same file.

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_browsertest.cc (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_browsertest.cc:1261: const
base::string16 messageMsg = base::ASCIIToUTF16("MESSAGE");
nit: activateMsg isn't used, and naming style should be kMessageMsg

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_browsertest.cc:1319: 
It'd be slightly nicer if this could verify that the worker STOPPED and isn't
starting back up again, since STOPPING could still mean a StartWorker is queued
up and waiting to restart. Do you know if the test can do that?

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_dispatcher_host.cc (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.cc:926: case
SERVICE_WORKER_PROVIDER_FOR_CONTROLLER: {
nit: maybe add some comment like "Clamp timeout to the sending worker's, to
prevent postMessage from keeping workers alive forever."

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.cc:1104:
sent_message_ports, source_info, callback, SERVICE_WORKER_ERROR_FAILED);
Hm, TIMEOUT seems like a better error code here, since if we did barely have
enough time and the worker took longer than 100 ms, we'd get a TIMEOUT error.

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_dispatcher_host.h (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.h:193: const
base::Optional<base::TimeDelta>& timeout,
super nit: would it be better for timeout to come before |callback| for
consistency with DispatchExtendableMessageEventAfterStartWorker?

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_version.cc (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_version.cc:574:
max_request_expiration_time_ = expiration_time;
It seems we never reset max to an earlier time (e.g., when this requests
finishes), so this is really an upper bound on the expiration time of all
outstanding requests. Maybe the comment suggested above should be more precise
then.

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_version.h (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_version.h:807: base::TimeTicks
max_request_expiration_time_;
nit: All these times are hard to keep track off so I'd prefer a comment like
"the expiration time of the request with the latest expiration time".

Marijn Kruisselbrink

The CQ bit was checked by mek@chromium.org to run a CQ dry run

4 years ago (2016-12-01 19:17:21 UTC) #14

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/60001

4 years ago (2016-12-01 19:17:49 UTC) #15

Marijn Kruisselbrink

On 2016/11/21 at 08:01:48, falken wrote: > I suppose the next steps will be to ...

4 years ago (2016-12-01 19:54:30 UTC) #16

On 2016/11/21 at 08:01:48, falken wrote:
> I suppose the next steps will be to use a similar timeout clamping for foreign
fetch?
Yeah, working on that in https://codereview.chromium.org/2518523003

> Agree that CONTINUE_ON_TIMEOUT didn't seem intuitive but it should be OK as
long as the idle timeout is working. I'm wondering about:

One problem with CONTINUE_ON_TIMEOUT and IPC based events (such as currently the
message event), is the code in EmbeddedWorkerRegistry::OnMessageReceived, which
currently "Assume an unhandled message for a stopping worker is because the
message was timed out and its handler removed prior to stopping.". With
CONTINUE_ON_TIMEOUT it suddenly becomes reasonable to get messages for timed out
requests where the worker isn't stopping... Not sure how to best deal with that.

> > and I think it might even be possible to end up in a situation with all
events expired and no idle_time_ set)
> 
> That seems really bad, and would defeat this mechanism. But if it's working as
intended, the idle_time_ is always set when the worker starts, and never gets
reset to null until the worker stops. Do you see something else?

Actually I was wrong. I was misreading some of the code, and misinterpreting
test flakiness I was seeing in my attempt at adding a browsertest for this
(where sometimes the worker wouldn't actually stop). So yeah, I'm pretty sure
idle_time_ should be working correctly indeed.

> Overall the entire service worker timeout code seems like it should be
simpler, that's out of the scope of this patch of course.

Agreed.

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_browsertest.cc (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_browsertest.cc:1319: 
On 2016/11/21 at 08:01:48, falken ooo on Dec 1 wrote:
> It'd be slightly nicer if this could verify that the worker STOPPED and isn't
starting back up again, since STOPPING could still mean a StartWorker is queued
up and waiting to restart. Do you know if the test can do that?

Unfortunately I couldn't even figure out how to make the existing assertions not
flakily fail (the test as written seemed to fail about 1% of the time) due to
timing issues with how far the renderer progresses while the browser is doing
things here... After trying all kinds of things I've just deleted this entire
browser test, although it would be nice to have some kind of more end-to-end
test than the unit tests... Just not sure how to reliably write one..

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_dispatcher_host.cc (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.cc:926: case
SERVICE_WORKER_PROVIDER_FOR_CONTROLLER: {
On 2016/11/21 at 08:01:48, falken ooo on Dec 1 wrote:
> nit: maybe add some comment like "Clamp timeout to the sending worker's, to
prevent postMessage from keeping workers alive forever."

Done

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.cc:1104:
sent_message_ports, source_info, callback, SERVICE_WORKER_ERROR_FAILED);
On 2016/11/21 at 08:01:48, falken ooo on Dec 1 wrote:
> Hm, TIMEOUT seems like a better error code here, since if we did barely have
enough time and the worker took longer than 100 ms, we'd get a TIMEOUT error.

Good point, done.

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_dispatcher_host.h (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_dispatcher_host.h:193: const
base::Optional<base::TimeDelta>& timeout,
On 2016/11/21 at 08:01:48, falken ooo on Dec 1 wrote:
> super nit: would it be better for timeout to come before |callback| for
consistency with DispatchExtendableMessageEventAfterStartWorker?

Done

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
File content/browser/service_worker/service_worker_version.h (right):

https://codereview.chromium.org/2506263005/diff/40001/content/browser/service...
content/browser/service_worker/service_worker_version.h:807: base::TimeTicks
max_request_expiration_time_;
On 2016/11/21 at 08:01:48, falken ooo on Dec 1 wrote:
> nit: All these times are hard to keep track off so I'd prefer a comment like
"the expiration time of the request with the latest expiration time".

Done

Marijn Kruisselbrink

The CQ bit was checked by mek@chromium.org to run a CQ dry run

4 years ago (2016-12-02 00:30:39 UTC) #17

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/100001

4 years ago (2016-12-02 00:31:16 UTC) #18

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years ago (2016-12-02 02:40:38 UTC) #19

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_chromium_rel_ng on master.tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/linux_chromium_rel_ng/builds/348475)

4 years ago (2016-12-02 02:40:39 UTC) #20

Marijn Kruisselbrink

The CQ bit was checked by mek@chromium.org to run a CQ dry run

4 years ago (2016-12-03 00:09:45 UTC) #21

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/120001

4 years ago (2016-12-03 00:10:44 UTC) #22

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

4 years ago (2016-12-03 02:01:23 UTC) #23

falken

This patch lgtm. "One problem with CONTINUE_ON_TIMEOUT and IPC based events (such as currently the ...

4 years ago (2016-12-05 02:31:51 UTC) #25

falken

OK I now learned about ServiceWorkerDisaptcherHost::kFilteredMessageClasses, so the BadMessageReceived is not necessarily incorrect. It still ...

4 years ago (2016-12-05 03:32:22 UTC) #26

Marijn Kruisselbrink

On 2016/12/05 at 03:32:22, falken wrote: > OK I now learned about ServiceWorkerDisaptcherHost::kFilteredMessageClasses, so the ...

4 years ago (2016-12-05 18:35:17 UTC) #27

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2506263005/120001

4 years ago (2016-12-05 18:35:56 UTC) #29

commit-bot: I haz the power

CQ is committing da patch. Bot data: {"patchset_id": 120001, "attempt_start_ts": 1480962920875860, "parent_rev": "a857ef03e84feccbebd9b7b95945b613f8e7efad", "commit_rev": "1c3c0932c53a559b80c8f661b80c8b2f64ae970a"}

4 years ago (2016-12-05 20:42:21 UTC) #30

commit-bot: I haz the power

Description was changed from ========== Prevent a service worker from keeping itself alive by self ...

4 years ago (2016-12-05 20:44:41 UTC) #32

commit-bot: I haz the power

4 years ago (2016-12-05 20:44:42 UTC) #33

Message was sent while issue was closed.

Patchset 6 (id:??) landed as
https://crrev.com/f88b52ec1e4aebd7c1f28b13a58bfad92d04ba94
Cr-Commit-Position: refs/heads/master@{#436393}

Issue 2506263005: Prevent a service worker from keeping itself alive by self postMessage. (Closed)

Description

Patch Set 1 #

Patch Set 2 : fix unit test #

Patch Set 3 : Get rid of browser test. #

Patch Set 4 : address further comments #

Patch Set 5 : cleanup #

Patch Set 6 : rebase #

Messages