Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(58)

Issue 2547713002: [Findit] Using ts_mon to track swarming/isolated server outages (Closed)

Created:
4 years ago by lijeffrey
Modified:
4 years ago
Reviewers:
chanli, stgao
CC:
chromium-reviews, infra-reviews+infra_chromium.org, Sharu Jiang
Target Ref:
refs/heads/master
Project:
infra
Visibility:
Public.

Description

[Findit] Using ts_mon to track swarming/isolated server outages Swarming server/isolated servers can go down and become unreachable, in which case Findit cannot continue to trigger/monitor swarming tasks whose results are used by try jobs and the flake checker. This change is to include monitoring using ts_mon so such outages can be monitored more effectively. BUG=669215 Committed: https://chromium.googlesource.com/infra/infra/+/dc8bb5ccaee0bffbf50ebeba97b9b3f32af27eda

Patch Set 1 #

Total comments: 6

Patch Set 2 : Addressing comments #

Patch Set 3 : fixing whitespace #

Total comments: 8

Patch Set 4 : Addressing comments #

Total comments: 2

Patch Set 5 : Fixing nit #

Unified diffs Side-by-side diffs Delta from patch set Stats (+28 lines, -6 lines) Patch
M appengine/findit/waterfall/monitoring.py View 1 2 3 4 1 chunk +4 lines, -0 lines 0 comments Download
M appengine/findit/waterfall/swarming_util.py View 1 2 3 4 5 chunks +16 lines, -2 lines 0 comments Download
M appengine/findit/waterfall/test/swarming_util_test.py View 1 2 3 3 chunks +8 lines, -4 lines 0 comments Download

Messages

Total messages: 15 (7 generated)
stgao
https://codereview.chromium.org/2547713002/diff/1/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/1/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/swarmingserverfailures', Maybe unify these two metrics as 'findit/httperrors', and ...
4 years ago (2016-12-01 22:25:15 UTC) #3
lijeffrey
comments addressed, ptal https://codereview.chromium.org/2547713002/diff/1/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/1/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/swarmingserverfailures', On 2016/12/01 22:25:15, stgao (slow ...
4 years ago (2016-12-01 22:54:34 UTC) #4
stgao
lgtm after comments are addressed https://codereview.chromium.org/2547713002/diff/40001/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/40001/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/httperrors', description='Failed http calls ...
4 years ago (2016-12-01 23:04:00 UTC) #5
lijeffrey
https://codereview.chromium.org/2547713002/diff/40001/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/40001/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/httperrors', description='Failed http calls to various servers') On 2016/12/01 ...
4 years ago (2016-12-01 23:25:30 UTC) #7
stgao
lgtm with nit https://codereview.chromium.org/2547713002/diff/60001/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/60001/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/httperrors', description='Failed http requests to various ...
4 years ago (2016-12-01 23:27:51 UTC) #8
lijeffrey
https://codereview.chromium.org/2547713002/diff/60001/appengine/findit/waterfall/monitoring.py File appengine/findit/waterfall/monitoring.py (right): https://codereview.chromium.org/2547713002/diff/60001/appengine/findit/waterfall/monitoring.py#newcode11 appengine/findit/waterfall/monitoring.py:11: 'findit/httperrors', description='Failed http requests to various servers') On 2016/12/01 ...
4 years ago (2016-12-01 23:34:08 UTC) #9
commit-bot: I haz the power
CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2547713002/80001
4 years ago (2016-12-01 23:34:15 UTC) #12
commit-bot: I haz the power
4 years ago (2016-12-01 23:49:14 UTC) #15
Message was sent while issue was closed.
Committed patchset #5 (id:80001) as
https://chromium.googlesource.com/infra/infra/+/dc8bb5ccaee0bffbf50ebeba97b9b...

Powered by Google App Engine
This is Rietveld 408576698