OLD | NEW |
1 # Perf Bot Sheriffing | 1 # Perf Bot Sheriffing |
2 | 2 |
3 The perf bot sheriff is responsible for keeping the bots on the chromium.perf | 3 The perf bot sheriff is responsible for keeping the bots on the chromium.perf |
4 waterfall up and running, and triaging performance test failures and flakes. | 4 waterfall up and running, and triaging performance test failures and flakes. |
5 | 5 |
6 **[Rotation calendar](https://calendar.google.com/calendar/embed?src=google.com_
2fpmo740pd1unrui9d7cgpbg2k%40group.calendar.google.com)** | 6 **[Rotation calendar](https://calendar.google.com/calendar/embed?src=google.com_
2fpmo740pd1unrui9d7cgpbg2k%40group.calendar.google.com)** |
7 | 7 |
8 ## Key Responsibilities | 8 ## Key Responsibilities |
9 | 9 |
10 * [Handle Device and Bot Failures](#Handle-Device-and-Bot-Failures) | 10 * [Handle Device and Bot Failures](#Handle-Device-and-Bot-Failures) |
(...skipping 137 matching lines...) Expand 10 before | Expand all | Expand 10 after Loading... |
148 * **Infra** label is appropriate for bugs that are not high priority, but we | 148 * **Infra** label is appropriate for bugs that are not high priority, but we |
149 need infra team's help to triage. For example, the buildbot status page UI | 149 need infra team's help to triage. For example, the buildbot status page UI |
150 is weird or we are getting some infra-related log spam. The infra team works | 150 is weird or we are getting some infra-related log spam. The infra team works |
151 to triage these bugs within 24 hours, so you should ping if you do not get a | 151 to triage these bugs within 24 hours, so you should ping if you do not get a |
152 response. | 152 response. |
153 * **Tests>Telemetry** for telemetry failures. | 153 * **Tests>Telemetry** for telemetry failures. |
154 * **Tests>AutoBisect** for bisect and perf try job failures. | 154 * **Tests>AutoBisect** for bisect and perf try job failures. |
155 | 155 |
156 If you still need help, ask the speed infra chat, or escalate to sullivan@. | 156 If you still need help, ask the speed infra chat, or escalate to sullivan@. |
157 | 157 |
| 158 ### Android Cross-Device Failures |
| 159 |
| 160 Sometimes when looking at failing android tests you will notice that there are |
| 161 tests on multiple devices failing. Sometimes (but not always) this means that |
| 162 there is a problem on the host machine. One way this problem can occur is if |
| 163 a test is using the wrong version of adb in one of its commands. This causes |
| 164 the adb server on the host to reset which can cause failures to anything |
| 165 trying to communicate with a device via adb during that time. A good tool |
| 166 for diagnosing this is the **Test Trace** step on the android runs. This is a |
| 167 trace of which tests are running. If you have all the tests across all the |
| 168 testing shards failing, it may be an issue on the host not with the tests. |
| 169 This will no longer be used when the android bots move to swarming, since |
| 170 each device will be sandboxed from the others and not run from a single |
| 171 point. |
| 172 |
| 173 |
158 ### Clobbering | 174 ### Clobbering |
159 | 175 |
160 Sometimes when a compile step is failing, you may be asked to clobber | 176 Sometimes when a compile step is failing, you may be asked to clobber |
161 [example](https://bugs.chromium.org/p/chromium/issues/detail?id=598955#c7). | 177 [example](https://bugs.chromium.org/p/chromium/issues/detail?id=598955#c7). |
162 Steps to clobber: | 178 Steps to clobber: |
163 | 179 |
164 1. Open the builder page through | 180 1. Open the builder page through |
165 [uberchromegw](https://uberchromegw.corp.google.com/i/chromium.perf). | 181 [uberchromegw](https://uberchromegw.corp.google.com/i/chromium.perf). |
166 2. At the bottom, there is a form titled **"Force build"**. | 182 2. At the bottom, there is a form titled **"Force build"**. |
167 3. Fill out the force build form, including your username, the reason for the | 183 3. Fill out the force build form, including your username, the reason for the |
(...skipping 119 matching lines...) Expand 10 before | Expand all | Expand 10 after Loading... |
287 | 303 |
288 **[Pri-2 bugs](https://bugs.chromium.org/p/chromium/issues/list?can=2&q=label%3A
Performance-Sheriff-BotHealth+label%3APri-2)** | 304 **[Pri-2 bugs](https://bugs.chromium.org/p/chromium/issues/list?can=2&q=label%3A
Performance-Sheriff-BotHealth+label%3APri-2)** |
289 are for disabled tests. These should be pinged weekly, and work towards fixing | 305 are for disabled tests. These should be pinged weekly, and work towards fixing |
290 should be ongoing when the sheriff is not working on a Pri-1 issue. Here is the | 306 should be ongoing when the sheriff is not working on a Pri-1 issue. Here is the |
291 [list of Pri-2 bugs that have not been pinged in a week](https://bugs.chromium.o
rg/p/chromium/issues/list?can=2&q=label:Performance-Sheriff-BotHealth%20label:Pr
i-2%20modified-before:today-7&sort=modified). | 307 [list of Pri-2 bugs that have not been pinged in a week](https://bugs.chromium.o
rg/p/chromium/issues/list?can=2&q=label:Performance-Sheriff-BotHealth%20label:Pr
i-2%20modified-before:today-7&sort=modified). |
292 | 308 |
293 <!-- Unresolved issues: | 309 <!-- Unresolved issues: |
294 1. Do perf sheriffs watch the bisect waterfall? | 310 1. Do perf sheriffs watch the bisect waterfall? |
295 2. Do perf sheriffs watch the internal clank waterfall? | 311 2. Do perf sheriffs watch the internal clank waterfall? |
296 --> | 312 --> |
OLD | NEW |