Issue 1566013002: Add support for bisect bots to post results to dashboard.

Issue 1566013002: Add support for bisect bots to post results to dashboard. (Closed)

Created:
4 years, 11 months ago by chrisphan

Modified:
4 years, 10 months ago

Reviewers:
prasadv, qyearsley

CC:
catapult-reviews_chromium.org, perf-dashboard-reviews_chromium.org

Base URL:
https://github.com/catapult-project/catapult.git@master

Target Ref:
refs/heads/master

Project:
catapult

Visibility:
Public.

More Reviews

Description

Add support for bisect bots to post results to dashboard. High level changes: update_bug_with_results.py - Removed the need to query rietveld, buildbucket, and buildbots. This will make it simpler. We want to get all the necessary results in one place which is from the bots. - Removed tracking infra failure. Current method is not very reliable. - TryJobs now do not get removed. post_bisect_results.py - To make it simple, we verify the data, and save it directly to a datastore JSONProperty. bisect_report.py - This is where we create report base on TryJob states, whether they failed, staled, or completed. try_job.py - Added 'results_data' which is data directly from bisect bots. - Added a 'staled' state. BUG=catapult:#1869 Committed: https://chromium.googlesource.com/external/github.com/catapult-project/catapult/+/4e0a366a9842da32ef20e8af466e5d0d7519d5e1

Patch Set 1 #

Total comments: 29

Patch Set 2 : addressed comments #

Total comments: 34

Patch Set 3 : address comments #

Total comments: 8

Patch Set 4 : rebase #

Total comments: 5

Patch Set 5 : address comments #

Patch Set 6 : check confidence to cc author #

Total comments: 1

Patch Set 7 : rebase #

Patch Set 8 : rebase #

Created: 4 years, 10 months ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+913 lines, -1875 lines)			Patch
M	dashboard/dashboard/bisect_fyi.py	View	1 2 3 4	4 chunks	+27 lines, -39 lines	0 comments	Download
M	dashboard/dashboard/bisect_fyi_test.py	View	1 2 3 4	2 chunks	+3 lines, -6 lines	0 comments	Download
A	dashboard/dashboard/bisect_report.py	View	1 2 3	1 chunk	+143 lines, -0 lines	0 comments	Download
A	dashboard/dashboard/bisect_report_test.py	View	1 2 3	1 chunk	+197 lines, -0 lines	0 comments	Download
M	dashboard/dashboard/bisect_stats.py	View	1	1 chunk	+3 lines, -1 line	0 comments	Download
M	dashboard/dashboard/dispatcher.py	View	1 2 3	2 chunks	+2 lines, -0 lines	0 comments	Download
M	dashboard/dashboard/email_template.py	View		6 chunks	+45 lines, -43 lines	0 comments	Download
M	dashboard/dashboard/models/try_job.py	View	1	3 chunks	+14 lines, -2 lines	0 comments	Download
A	dashboard/dashboard/post_bisect_results.py	View	1 2 3	1 chunk	+111 lines, -0 lines	0 comments	Download
A	dashboard/dashboard/post_bisect_results_test.py	View	1 2 3	1 chunk	+98 lines, -0 lines	0 comments	Download
M	dashboard/dashboard/update_bug_with_results.py	View	1 2 3 4 5	12 chunks	+95 lines, -682 lines	0 comments	Download
M	dashboard/dashboard/update_bug_with_results_test.py	View	1 2 3	11 chunks	+144 lines, -1102 lines	0 comments	Download
M	dashboard/dashboard/utils.py	View	1 2 3	2 chunks	+31 lines, -0 lines	0 comments	Download

Messages

Total messages: 22 (6 generated)

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages

chrisphan

In the work is adding doc and tests. https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/update_bug_with_results_test.py File dashboard/dashboard/update_bug_with_results_test.py (right): https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/update_bug_with_results_test.py#newcode1 dashboard/dashboard/update_bug_with_results_test.py:1: # ...

4 years, 11 months ago (2016-01-06 23:28:10 UTC) #2

chrisphan

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/email_template.py File dashboard/dashboard/email_template.py (right): https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/email_template.py#newcode77 dashboard/dashboard/email_template.py:77: _PERF_TRY_EMAIL_TEXT_BODY = """ Pradsad, do you think all these ...

4 years, 11 months ago (2016-01-06 23:33:40 UTC) #3

chrisphan

Description was changed from ========== Add support for bisect bots to post results to dashboard. ...

4 years, 11 months ago (2016-01-07 00:00:08 UTC) #4

chrisphan

Description was changed from ========== Add support for bisect bots to post results to dashboard. ...

4 years, 11 months ago (2016-01-07 00:02:29 UTC) #5

qyearsley

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_report.py File dashboard/dashboard/bisect_report.py (right): https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_report.py#newcode5 dashboard/dashboard/bisect_report.py:5: """URL endpoint for a cron job to update bugs ...

4 years, 11 months ago (2016-01-11 22:16:45 UTC) #6

chrisphan

Updated tests for: bisect_report_test.py update_bug_with_results_test.py https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboard/bisect_report.py File dashboard/dashboard/bisect_report.py (right): https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboard/bisect_report.py#newcode5 dashboard/dashboard/bisect_report.py:5: """URL endpoint for a ...

4 years, 11 months ago (2016-01-13 00:32:54 UTC) #7

Updated tests for:

bisect_report_test.py
update_bug_with_results_test.py

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
File dashboard/dashboard/bisect_report.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/bisect_report.py:5: """URL endpoint for a cron job to update
bugs after bisects."""
On 2016/01/11 22:16:44, qyearsley wrote:
> File docstring should be removed or updated.

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
File dashboard/dashboard/bisect_report_test.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/bisect_report_test.py:159: pass
On 2016/01/11 22:16:44, qyearsley wrote:
> Pending implementation.
> 
> When you implement this, it may be a good idea to avoid big global strings,
> because they tend to make the tests more fragile and harder to read.
> 
> Even if you use a big literal string, if possible it's good to put them in the
> one test method where they're used, and when possible, it's less fragile to
> assert that just one or two key things is contained in the string.

Agree, in this case, I think it's useful to see the expected layout in a bug
message.  LMK.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
File dashboard/dashboard/post_bisect_results.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:11: import re
On 2016/01/11 22:16:45, qyearsley wrote:
> Some of these appear unused; you can run pylint --enable=W0611 to find unused
> imports.

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:27: 'status': ['completed', 'failed',
'pending', 'aborted'],
On 2016/01/11 22:16:45, qyearsley wrote:
> Nit: 4 space indent

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:37: """URL endpoint to post data to
the dashboard."""
On 2016/01/11 22:16:44, qyearsley wrote:
> This docstring can be removed.

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:40: """Validates data parameter and
save to TryJob entity.
On 2016/01/11 22:16:45, qyearsley wrote:
> s/save/saves/
> 
> Is there something that implicitly happens when _UpdateTryJob is called? When
is
> a post made to crbug?

This handler will serve to bots updating partial and full results.  So we'll
simply validate and store the results.  Posting to crbug will happen in the same
place update_bug_with_results.py.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:42: Bisect results come from a "data"
parameter, which is a JSON encoding of a
On 2016/01/11 22:16:45, qyearsley wrote:
> I advocate changing the name from "data" to "bisect_results".

This comes from a generic post_json.py which is shared with posting data to
/add_point.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:95: """
On 2016/01/11 22:16:44, qyearsley wrote:
> I wonder whether this can be put somewhere else and be reused for other places
> where we're looking at received data or requests.
> 
> Anyway for this CL it can be kept here, but it might be nice to re-use it.

Acknowledged.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/post_bisect_results.py:99: act_type = type(actual)
On 2016/01/11 22:16:45, qyearsley wrote:
> I think it's a little more readable (in general) if you avoid abbreviations,
and
> use names like expected_type and actual_type instead -- "exp" and "act" in
> different contexts can mean different things (expiry, experimental, expansion;
> active, action, actor...)

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
File dashboard/dashboard/update_bug_with_results.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/update_bug_with_results.py:97: # Do we want to send a FYI
Bisect email here?
On 2016/01/11 22:16:45, qyearsley wrote:
> Currently we're sending it below in _CheckFYIBisectJob. Seems like this if
> clause is just for the case where we want to stop retrying a bisect job (and
> possibly tell people that the bisect job repeatedly failed).
> 
> Side note: If you think it improves readability, you could add a little
> function:
> 
> def _IsStale(job):
>   if not job.last_ran_timestamp:
>     return False
>   time_since_last_ran = datetime.datetime.now() - job.last_ran_timestamp
>   return time_since_last_ran > _STALE_TRYJOB_DELTA

Added a to do for sending FYI on staled bisect.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboa...
dashboard/dashboard/update_bug_with_results.py:309: # TODO(chrisphan): Use the
new quick_logger.
On 2016/01/11 22:16:45, qyearsley wrote:
> What's the new quick_logger? A new log name? This comment can be made more
> specific.
> 
> Also, the input of this function is try_job entity, right? Optionally you can
> add a docstring here.

Removing this for now.  I have a quicker-logger update for displaying clearer
bisect results.  Will add it later.

chrisphan

https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboard/post_bisect_results.py File dashboard/dashboard/post_bisect_results.py (right): https://chromiumcodereview-hr.appspot.com/1566013002/diff/1/dashboard/dashboard/post_bisect_results.py#newcode40 dashboard/dashboard/post_bisect_results.py:40: """Validates data parameter and save to TryJob entity. On ...

4 years, 11 months ago (2016-01-13 00:35:35 UTC) #8

qyearsley

Really happy with this change, especially because it simplifies update_bug_with_results.py. Prasad should review email_template.py and ...

4 years, 11 months ago (2016-01-18 21:10:25 UTC) #9

Really happy with this change, especially because it simplifies
update_bug_with_results.py.

Prasad should review email_template.py and bisect_fyi.py.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
File dashboard/dashboard/bisect_report.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report.py:9: _CONFIDENCE_THRESHOLD = 99.5
This appears to be unused here now -- this "confidence threshold" is used to
determine whether the bisect results "status" should be "Positive or "Negative".

In the current draft of the auto-bisect side changes
(https://codereview.chromium.org/1573293002), this is changed so that status
doesn't indicate "Positive" or "Negative".

I think it seems OK to put the deciding of positive/negative status on the
dashboard side, but it's also OK to decide on the bisect side and send it over.

Note that the "Positive"/"Negative" status was used to decide whether to update
bugs or not -- bugs are only updated for positive results. I think we want to
keep that behavior.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
File dashboard/dashboard/bisect_report_test.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report_test.py:64: }
Formatting should be adjusted here; see comments in post_bisect_results_test.py

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report_test.py:186:
self.assertEqual(_LOG_FAILED_BISECT, bisect_report.GetReport(job))
I think that the literal expected string directly in the test method here is a
good idea, even if it's very long and involves multi-line strings -- the reason
is that this puts the example expected string in the context of where it's
expected, and means that the reader doesn't need to scroll up and down.

Additionally, in some cases instead of using
 
 self.assertEqual(<full expected text>, bisect_report.GetReport(job))

it might be less fragile while keeping the same coverage if we use

 self.assertIn(<key part>, bisect_report.GetReport(job))

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/mod...
File dashboard/dashboard/models/try_job.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/mod...
dashboard/dashboard/models/try_job.py:36: # Bisect run status (e.g., started,
failed).
I think this comment actually doesn't tell you anything besides what the
property name and choices already tell you, so it could be removed.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/mod...
dashboard/dashboard/models/try_job.py:39: choices=['started', 'failed',
'staled', 'completed'],
At first I was thinking "stale" might be better than "staled", since is a much
more common word that staled (see
https://www.google.com/trends/explore#q=stale%2C%20staled) and it's an adjective
(not just a verb) so it works here.

But now I think staled is also good, since it seems more consistent with the
other status strings which all end in "ed".

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
File dashboard/dashboard/post_bisect_results.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results.py:121: job.put()
When is the results posted to the issue tracker?

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results.py:122:
update_bug_with_results.UpdateQuickLog(job)
The name of this function indicates that all it does is update a TryJob entity
with results_data, and updating the quick log seems to be a separate thing.
Could this be moved outside (after _UpdateTryJob is called on line 71)?

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results.py:126: """Updates bisect job's result
data with known values."""
What other parts of the bisect job's result data might be updated?

If there's nothing else for now, then for now this function could be called
something more specific, like _SetResultsIssueURL.

Perhaps it could also take a results dict instead of a TryJob?

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results.py:131: def _IssueURL(job):
Optional docstring:

"""Returns a URL for information about a bisect try job."""

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
File dashboard/dashboard/post_bisect_results_test.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:29: 'change': '',
I wonder what the 'change' field is for -- is it regression vs improvement,
increase vs decrease, or relative change as a percentage?

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:60: }]
If you think it improves readability you could change the formatting here to:

    'revision_data': [
        {
            'depot_name': 'chromium',
            ...
        },
        {
            'depot_name': 'chromium',
            ...
        }
    ]

(Generally I feel that being able to see structure through indentation level is
more important that saving space.)

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:61: }
Closing brace should have no indent

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:142: {'values':
{'nested_values': 'orange'}})
Each of the failing and passing cases could be split into separate test methods,
e.g.

  testValidate_StringNotInOptionList_Fails
  testValidate_StringInOptionList_Passes
  testValidate_IntRequiredStringGiven_Fails
  testValidate_StringRequiredStringGiven_Passes
  testValidate_TypeNoneStringGiven_Passes
  etc.

The advantage of separating them into separate test methods is:
 1. When it fails, it's clear from the method name which behavior isn't working
 2. The test runner can run the methods in parallel, which is usually faster

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/upd...
File dashboard/dashboard/update_bug_with_results.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/upd...
dashboard/dashboard/update_bug_with_results.py:1: # Copyright 2016 The Chromium
Authors. All rights reserved.
Summary of changes in this file:

Removed:
  UnexpectedJsonError
  _CheckPerfTryJob
  _ParseCloudLinksFromOutput
  _LoadConfigFromString
  _GetPerfTryJobResults
  _GetBisectResults
  _FetchBuildData
  _GetBotFailureInfo
  _GetPartialBisectResult
  _ValidateAndConvertBuildbucketResponse
  _ValidateRietveldResponse
  _CheckBisectBotForInfraFailure
  _GetBisectScriptStepIndex
  _LogBisectInfraFailure
  _BisectResultIsPositive
  _BeautifyContent
  _FetchURL
  _FetchRietveldIssueJSON
  _RietveldIssueURL
  _BuildbucketStatusToStatusConstant

Changed/renamed:
  _PostFailedResult, _PostSuccessfulResult -> _PostResult
  _GetReviewersFromBisectLog -> _GetReviewersFromCulpritData

Added:
  _UpdateQuickLog

Does that look right?

Question: After this CL, tracing perf try job results (for try jobs starting
using the trace button in the dashboard) won't be checked from here, right? If
this is the case, then you should file a bug to make sure that trace perf try
job results are surfaced; we may also want to send those to the dashboard.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/upd...
dashboard/dashboard/update_bug_with_results.py:316: return
If a report is always expected when this function is called, this could
potentially raise an error or log a warning.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/upd...
File dashboard/dashboard/update_bug_with_results_test.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/upd...
dashboard/dashboard/update_bug_with_results_test.py:69: }
Formatting; see post_bisect_results_test.py

chrisphan

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/dashboard/bisect_report.py File dashboard/dashboard/bisect_report.py (right): https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/dashboard/bisect_report.py#newcode9 dashboard/dashboard/bisect_report.py:9: _CONFIDENCE_THRESHOLD = 99.5 On 2016/01/18 21:10:24, qyearsley wrote: > ...

4 years, 11 months ago (2016-01-20 21:47:19 UTC) #10

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/bisect_report.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/bisect_report.py:9: _CONFIDENCE_THRESHOLD = 99.5
On 2016/01/18 21:10:24, qyearsley wrote:
> This appears to be unused here now -- this "confidence threshold" is used to
> determine whether the bisect results "status" should be "Positive or
"Negative".
> 
> In the current draft of the auto-bisect side changes
> (https://codereview.chromium.org/1573293002), this is changed so that status
> doesn't indicate "Positive" or "Negative".
> 
> I think it seems OK to put the deciding of positive/negative status on the
> dashboard side, but it's also OK to decide on the bisect side and send it
over.
> 
> Note that the "Positive"/"Negative" status was used to decide whether to
update
> bugs or not -- bugs are only updated for positive results. I think we want to
> keep that behavior.

Acknowledged.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/bisect_report_test.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/bisect_report_test.py:64: }
On 2016/01/18 21:10:24, qyearsley wrote:
> Formatting should be adjusted here; see comments in
post_bisect_results_test.py

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/bisect_report_test.py:186:
self.assertEqual(_LOG_FAILED_BISECT, bisect_report.GetReport(job))
On 2016/01/18 21:10:24, qyearsley wrote:
> I think that the literal expected string directly in the test method here is a
> good idea, even if it's very long and involves multi-line strings -- the
reason
> is that this puts the example expected string in the context of where it's
> expected, and means that the reader doesn't need to scroll up and down.
> 
> Additionally, in some cases instead of using
>  
>  self.assertEqual(<full expected text>, bisect_report.GetReport(job))
> 
> it might be less fragile while keeping the same coverage if we use
> 
>  self.assertIn(<key part>, bisect_report.GetReport(job))

Done. Though I think tests can be use to see examples and it would be easier on
top, plus it would be re-usable and easy to update.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/post_bisect_results.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results.py:121: job.put()
On 2016/01/18 21:10:25, qyearsley wrote:
> When is the results posted to the issue tracker?

In update_bug_with_results.py cron job.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results.py:122:
update_bug_with_results.UpdateQuickLog(job)
On 2016/01/18 21:10:24, qyearsley wrote:
> The name of this function indicates that all it does is update a TryJob entity
> with results_data, and updating the quick log seems to be a separate thing.
> Could this be moved outside (after _UpdateTryJob is called on line 71)?

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results.py:126: """Updates bisect job's result
data with known values."""
On 2016/01/18 21:10:24, qyearsley wrote:
> What other parts of the bisect job's result data might be updated?

There were a few stuff, moved it to the bot side.  I think there are others in
the future.

> 
> If there's nothing else for now, then for now this function could be called
> something more specific, like _SetResultsIssueURL.

Done.  Moved to _UpdateTryJob.

> 
> Perhaps it could also take a results dict instead of a TryJob?

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results.py:131: def _IssueURL(job):
On 2016/01/18 21:10:25, qyearsley wrote:
> Optional docstring:
> 
> """Returns a URL for information about a bisect try job."""

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/post_bisect_results_test.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results_test.py:29: 'change': '',
On 2016/01/18 21:10:25, qyearsley wrote:
> I wonder what the 'change' field is for -- is it regression vs improvement,
> increase vs decrease, or relative change as a percentage?

I see relative change percentage or string message for legacy.  And relative
change percentage for bisect recipe.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results_test.py:60: }]
On 2016/01/18 21:10:25, qyearsley wrote:
> If you think it improves readability you could change the formatting here to:
> 
>     'revision_data': [
>         {
>             'depot_name': 'chromium',
>             ...
>         },
>         {
>             'depot_name': 'chromium',
>             ...
>         }
>     ]
> 
> (Generally I feel that being able to see structure through indentation level
is
> more important that saving space.)

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results_test.py:61: }
On 2016/01/18 21:10:25, qyearsley wrote:
> Closing brace should have no indent

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/post_bisect_results_test.py:142: {'values':
{'nested_values': 'orange'}})
On 2016/01/18 21:10:25, qyearsley wrote:
> Each of the failing and passing cases could be split into separate test
methods,
> e.g.
> 
>   testValidate_StringNotInOptionList_Fails
>   testValidate_StringInOptionList_Passes
>   testValidate_IntRequiredStringGiven_Fails
>   testValidate_StringRequiredStringGiven_Passes
>   testValidate_TypeNoneStringGiven_Passes
>   etc.
> 
> The advantage of separating them into separate test methods is:
>  1. When it fails, it's clear from the method name which behavior isn't
working
>  2. The test runner can run the methods in parallel, which is usually faster

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/update_bug_with_results.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/update_bug_with_results.py:1: # Copyright 2016 The Chromium
Authors. All rights reserved.
On 2016/01/18 21:10:25, qyearsley wrote:
> Summary of changes in this file:
> 
> Removed:
>   UnexpectedJsonError
>   _CheckPerfTryJob
>   _ParseCloudLinksFromOutput
>   _LoadConfigFromString
>   _GetPerfTryJobResults
>   _GetBisectResults
>   _FetchBuildData
>   _GetBotFailureInfo
>   _GetPartialBisectResult
>   _ValidateAndConvertBuildbucketResponse
>   _ValidateRietveldResponse
>   _CheckBisectBotForInfraFailure
>   _GetBisectScriptStepIndex
>   _LogBisectInfraFailure
>   _BisectResultIsPositive
>   _BeautifyContent
>   _FetchURL
>   _FetchRietveldIssueJSON
>   _RietveldIssueURL
>   _BuildbucketStatusToStatusConstant
> 
> Changed/renamed:
>   _PostFailedResult, _PostSuccessfulResult -> _PostResult
>   _GetReviewersFromBisectLog -> _GetReviewersFromCulpritData
> 
> Added:
>   _UpdateQuickLog
> 
> Does that look right?

That's correct.

> 
> Question: After this CL, tracing perf try job results (for try jobs starting
> using the trace button in the dashboard) won't be checked from here, right? If
> this is the case, then you should file a bug to make sure that trace perf try
> job results are surfaced; we may also want to send those to the dashboard.

Perf try results should be uploaded as well in this CL: 1573293002, unless I'm
missing something?

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/update_bug_with_results.py:316: return
On 2016/01/18 21:10:25, qyearsley wrote:
> If a report is always expected when this function is called, this could
> potentially raise an error or log a warning.

Done.

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
File dashboard/dashboard/update_bug_with_results_test.py (right):

https://chromiumcodereview-hr.appspot.com/1566013002/diff/20001/dashboard/das...
dashboard/dashboard/update_bug_with_results_test.py:69: }
On 2016/01/18 21:10:25, qyearsley wrote:
> Formatting; see post_bisect_results_test.py

Done.

qyearsley

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_report_test.py File dashboard/dashboard/bisect_report_test.py (right): https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_report_test.py#newcode159 dashboard/dashboard/bisect_report_test.py:159: pass On 2016/01/13 00:32:54, chrisphan wrote: > On 2016/01/11 ...

4 years, 11 months ago (2016-01-26 18:43:27 UTC) #11

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_...
File dashboard/dashboard/bisect_report_test.py (right):

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/bisect_...
dashboard/dashboard/bisect_report_test.py:159: pass
On 2016/01/13 00:32:54, chrisphan wrote:
> On 2016/01/11 22:16:44, qyearsley wrote:
> > Pending implementation.
> > 
> > When you implement this, it may be a good idea to avoid big global strings,
> > because they tend to make the tests more fragile and harder to read.
> > 
> > Even if you use a big literal string, if possible it's good to put them in
the
> > one test method where they're used, and when possible, it's less fragile to
> > assert that just one or two key things is contained in the string.
> 
> Agree, in this case, I think it's useful to see the expected layout in a bug
> message.  LMK.

Putting the expected layout in a bug message is also OK, and I agree that it's
helpful to see at least one full expected bug message string in the test.

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/post_bi...
File dashboard/dashboard/post_bisect_results.py (right):

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/post_bi...
dashboard/dashboard/post_bisect_results.py:40: """Validates data parameter and
save to TryJob entity.
On 2016/01/13 00:35:35, chrisphan wrote:
> On 2016/01/13 00:32:54, chrisphan wrote:
> > On 2016/01/11 22:16:45, qyearsley wrote:
> > > s/save/saves/
> > > 
> > > Is there something that implicitly happens when _UpdateTryJob is called?
> When
> > is
> > > a post made to crbug?
> > 
> > This handler will serve to bots updating partial and full results.  So we'll
> > simply validate and store the results.  Posting to crbug will happen in the
> same
> > place update_bug_with_results.py.
> 
> Also, we are updating quick-log.  So at all time, quick-log will have the
latest
> bisect results.

Ah, that makes sense :-)

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/post_bi...
dashboard/dashboard/post_bisect_results.py:42: Bisect results come from a "data"
parameter, which is a JSON encoding of a
On 2016/01/13 00:32:54, chrisphan wrote:
> On 2016/01/11 22:16:45, qyearsley wrote:
> > I advocate changing the name from "data" to "bisect_results".
> 
> This comes from a generic post_json.py which is shared with posting data to
> /add_point.

Alright, seems good, especially since it's noted here that it's a JSON dict with
fields "master", "bot" and "test".

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/update_...
File dashboard/dashboard/update_bug_with_results_test.py (right):

https://codereview.chromium.org/1566013002/diff/1/dashboard/dashboard/update_...
dashboard/dashboard/update_bug_with_results_test.py:1: # Copyright 2016 The
Chromium Authors. All rights reserved.
On 2016/01/06 23:28:10, chrisphan wrote:
> Currently be worked on:
>  - update tests that are left here
>  - add applicable new tests
> 
> The tests that were removed are tests that are no applicable anymore.  For
> example testGet_PerfTryJob, asserts bisect results data exist, but with
posting
> method, data is only as good as that bots posting them.
> 

Acknowledged.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
File dashboard/dashboard/bisect_report.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report.py:9: _CONFIDENCE_THRESHOLD = 99.5
On 2016/01/20 21:47:18, chrisphan wrote:
> On 2016/01/18 21:10:24, qyearsley wrote:
> > This appears to be unused here now -- this "confidence threshold" is used to
> > determine whether the bisect results "status" should be "Positive or
> "Negative".
> > 
> > In the current draft of the auto-bisect side changes
> > (https://codereview.chromium.org/1573293002), this is changed so that status
> > doesn't indicate "Positive" or "Negative".
> > 
> > I think it seems OK to put the deciding of positive/negative status on the
> > dashboard side, but it's also OK to decide on the bisect side and send it
> over.
> > 
> > Note that the "Positive"/"Negative" status was used to decide whether to
> update
> > bugs or not -- bugs are only updated for positive results. I think we want
to
> > keep that behavior.
> 
> Acknowledged.

_CONFIDENCE_THRESHOLD is still unused, and should be removed if it's unused.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
File dashboard/dashboard/bisect_report_test.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report_test.py:186:
self.assertEqual(_LOG_FAILED_BISECT, bisect_report.GetReport(job))
On 2016/01/20 21:47:19, chrisphan wrote:
> On 2016/01/18 21:10:24, qyearsley wrote:
> > I think that the literal expected string directly in the test method here is
a
> > good idea, even if it's very long and involves multi-line strings -- the
> reason
> > is that this puts the example expected string in the context of where it's
> > expected, and means that the reader doesn't need to scroll up and down.
> > 
> > Additionally, in some cases instead of using
> >  
> >  self.assertEqual(<full expected text>, bisect_report.GetReport(job))
> > 
> > it might be less fragile while keeping the same coverage if we use
> > 
> >  self.assertIn(<key part>, bisect_report.GetReport(job))
> 
> Done. Though I think tests can be use to see examples and it would be easier
on
> top, plus it would be re-usable and easy to update.

That's true, I agree.

Although, in the case of start_try_job_test and update_bug_with_results_test,
there are lot of similar, long samples and when reading the test methods below,
I felt like I had to jump up and down to see what was being tested,

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
File dashboard/dashboard/post_bisect_results.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results.py:121: job.put()
On 2016/01/20 21:47:19, chrisphan wrote:
> On 2016/01/18 21:10:25, qyearsley wrote:
> > When is the results posted to the issue tracker?
> 
> In update_bug_with_results.py cron job.

Since this is one thing that I was uncertain about while reading the code
initially, I think it would be helpful to add a note here saying that the
results will actually be posted to the bug in update_bug_with_results.

Alternatively, we could also consider posting the results within this same
request. Possible advantages:
 (1) full results will be posted as soon as they're ready
 (2) this may allow the bisect to know whether or not the bisect failed to be
posted, and potentially output a warning there.

Either way is fine though.

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
File dashboard/dashboard/post_bisect_results_test.py (right):

https://codereview.chromium.org/1566013002/diff/20001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:29: 'change': '',
On 2016/01/20 21:47:19, chrisphan wrote:
> On 2016/01/18 21:10:25, qyearsley wrote:
> > I wonder what the 'change' field is for -- is it regression vs improvement,
> > increase vs decrease, or relative change as a percentage?
> 
> I see relative change percentage or string message for legacy.  And relative
> change percentage for bisect recipe.

Ah, right. Could you put in a sample number (e.g "10%")?

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/bis...
File dashboard/dashboard/bisect_report_test.py (right):

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report_test.py:47: 'result': 'good'
Indentation

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/bis...
dashboard/dashboard/bisect_report_test.py:56: },{
There should be a space after the comma.

Also, I slightly prefer putting the opening brace on its own line, e.g.

    {
        ...
    },
    {
        ...
    },

(This style makes it slightly easier to copy/cut/delete one dict in a list)

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/pos...
File dashboard/dashboard/post_bisect_results_test.py (right):

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/pos...
dashboard/dashboard/post_bisect_results_test.py:152: {'values':
{'nested_values': 'orange'}})
I like the way these tests are written now :-)

Of course, now these tests are in utils_test and after rebasing, you can now use
utils.Validate in post_bisect_results.

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/upd...
File dashboard/dashboard/update_bug_with_results.py (right):

https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/upd...
dashboard/dashboard/update_bug_with_results.py:259: emails =
[culprit_data['email']] or []
This will always be equivalent to [culprit_data['email']].

Even if culprit_data['email'] is None, a non-empty array is always considered
true so the other side of the `or` will never be used.

Maybe this should be:

emails = [culprit_data['email']] if culprit_data['email'] else []

chrisphan

Rebased. https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/bisect_report_test.py File dashboard/dashboard/bisect_report_test.py (right): https://codereview.chromium.org/1566013002/diff/40001/dashboard/dashboard/bisect_report_test.py#newcode47 dashboard/dashboard/bisect_report_test.py:47: 'result': 'good' On 2016/01/26 18:43:27, qyearsley wrote: > ...

4 years, 10 months ago (2016-02-09 20:34:40 UTC) #12

prasadv

https://codereview.chromium.org/1566013002/diff/60001/dashboard/dashboard/bisect_fyi.py File dashboard/dashboard/bisect_fyi.py (right): https://codereview.chromium.org/1566013002/diff/60001/dashboard/dashboard/bisect_fyi.py#newcode128 dashboard/dashboard/bisect_fyi.py:128: expected = set(config.keys()) Here we actually want to compare ...

4 years, 10 months ago (2016-02-09 21:09:41 UTC) #13

chrisphan

https://chromiumcodereview-hr.appspot.com/1566013002/diff/60001/dashboard/dashboard/bisect_fyi.py File dashboard/dashboard/bisect_fyi.py (right): https://chromiumcodereview-hr.appspot.com/1566013002/diff/60001/dashboard/dashboard/bisect_fyi.py#newcode128 dashboard/dashboard/bisect_fyi.py:128: expected = set(config.keys()) On 2016/02/09 21:09:41, prasadv wrote: > ...

4 years, 10 months ago (2016-02-09 22:10:40 UTC) #14

chrisphan

https://chromiumcodereview-hr.appspot.com/1566013002/diff/100001/dashboard/dashboard/update_bug_with_results.py File dashboard/dashboard/update_bug_with_results.py (right): https://chromiumcodereview-hr.appspot.com/1566013002/diff/100001/dashboard/dashboard/update_bug_with_results.py#newcode260 dashboard/dashboard/update_bug_with_results.py:260: if results_data.get('score') < _CONFIDENCE_LEVEL_TO_CC_AUTHOR: Check confidence level before ccing ...

4 years, 10 months ago (2016-02-09 23:42:36 UTC) #15

chrisphan

On 2016/02/09 23:42:36, chrisphan wrote: > https://chromiumcodereview-hr.appspot.com/1566013002/diff/100001/dashboard/dashboard/update_bug_with_results.py > File dashboard/dashboard/update_bug_with_results.py (right): > > https://chromiumcodereview-hr.appspot.com/1566013002/diff/100001/dashboard/dashboard/update_bug_with_results.py#newcode260 > ...

4 years, 10 months ago (2016-02-16 19:13:36 UTC) #16

qyearsley

On 2016/02/16 19:13:36, chrisphan wrote: > On 2016/02/09 23:42:36, chrisphan wrote: > > > https://chromiumcodereview-hr.appspot.com/1566013002/diff/100001/dashboard/dashboard/update_bug_with_results.py ...

4 years, 10 months ago (2016-02-18 00:45:45 UTC) #17

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1566013002/140001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1566013002/140001

4 years, 10 months ago (2016-02-18 19:13:48 UTC) #20

commit-bot: I haz the power

Description was changed from ========== Add support for bisect bots to post results to dashboard. ...

4 years, 10 months ago (2016-02-18 19:24:42 UTC) #21

commit-bot: I haz the power

4 years, 10 months ago (2016-02-18 19:24:43 UTC) #22

Message was sent while issue was closed.

Committed patchset #8 (id:140001) as
https://chromium.googlesource.com/external/github.com/catapult-project/catapu...

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages