Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(41)

Issue 2951023002: Only schedule termination task if an unconnected MP bot is dead (Closed)

Created:
3 years, 6 months ago by smut
Modified:
3 years, 6 months ago
Reviewers:
Vadim Sh.
CC:
chromium-reviews, infra-reviews+luci-py_chromium.org
Target Ref:
refs/heads/master
Project:
luci-py
Visibility:
Public.

Description

Only schedule termination task if an unconnected MP bot is dead Previously this was happening no matter what, meaning we were hardly giving any chance to MP bots to actually connect. Bug: 717769 Review-Url: https://codereview.chromium.org/2951023002 Committed: https://github.com/luci/luci-py/commit/3fa4c520dddd82ed190152709e0a54b35faa3bae

Patch Set 1 #

Unified diffs Side-by-side diffs Delta from patch set Stats (+12 lines, -7 lines) Patch
M appengine/swarming/server/lease_management.py View 3 chunks +12 lines, -7 lines 0 comments Download

Messages

Total messages: 14 (6 generated)
smut
I'm surprised nobody noticed/complained. The intention is for MP bots which don't connect within 10 ...
3 years, 6 months ago (2017-06-21 01:20:34 UTC) #2
Vadim Sh.
lol lgmt
3 years, 6 months ago (2017-06-21 01:24:30 UTC) #5
Vadim Sh.
On 2017/06/21 01:24:30, Vadim Sh. wrote: > lol > > lgmt err lgtm
3 years, 6 months ago (2017-06-21 01:24:38 UTC) #6
commit-bot: I haz the power
CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2951023002/40001
3 years, 6 months ago (2017-06-21 01:25:33 UTC) #8
Vadim Sh.
Why we didn't notice? Does MP gives released bots right back to Swarming or it ...
3 years, 6 months ago (2017-06-21 01:26:02 UTC) #9
smut
On 2017/06/21 01:26:02, Vadim Sh. wrote: > Why we didn't notice? Does MP gives released ...
3 years, 6 months ago (2017-06-21 01:29:46 UTC) #10
commit-bot: I haz the power
Committed patchset #1 (id:40001) as https://github.com/luci/luci-py/commit/3fa4c520dddd82ed190152709e0a54b35faa3bae
3 years, 6 months ago (2017-06-21 01:30:09 UTC) #13
smut
3 years, 6 months ago (2017-06-21 01:31:24 UTC) #14
Message was sent while issue was closed.
On 2017/06/21 01:29:46, smut wrote:
> On 2017/06/21 01:26:02, Vadim Sh. wrote:
> > Why we didn't notice? Does MP gives released bots right back to Swarming or
it
> > actually terminates them?
> 
> It actually deletes them, so you have to wait for GCE Backend to spin up a new
> VM and there is definitely a delay. The VMs that arrived within 60 seconds
would
> work as normal. Maybe most VMs arrive very quickly (I think it's unlikely) or
we
> have far too much capacity allocated.

Actually it might be possible that most or enough VMs arrive within 60 seconds.
The 60 second timer only starts once MP says the VM is yours, which only happens
once it has been booted. The only thing that would cause a delay is if startup
scripts take an exceptionally long time to complete.

Powered by Google App Engine
This is Rietveld 408576698