Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(1714)

Issue 268673007: Extracting page shingle hashes for similarity detection. (Closed)

Created:
6 years, 7 months ago by zysxqn
Modified:
6 years, 7 months ago
Reviewers:
mattm, noelutz, noé
CC:
chromium-reviews
Base URL:
https://chromium.googlesource.com/chromium/src.git@master
Visibility:
Public.

Description

Extracting page shingle hashes for similarity detection. BUG= Committed: https://src.chromium.org/viewvc/chrome?view=rev&revision=269976

Patch Set 1 #

Patch Set 2 : Fix some typo #

Patch Set 3 : Fix a reference problem. #

Total comments: 14

Patch Set 4 : Address 1st round comment #

Total comments: 13

Patch Set 5 : Address 2nd round comment #

Patch Set 6 : Address 3rd round comment #

Patch Set 7 : Fix a nit #

Total comments: 12

Patch Set 8 : Address 4th round comment. #

Unified diffs Side-by-side diffs Delta from patch set Stats (+246 lines, -48 lines) Patch
M chrome/common/safe_browsing/client_model.proto View 1 2 3 1 chunk +6 lines, -0 lines 0 comments Download
M chrome/common/safe_browsing/csd.proto View 1 2 3 4 1 chunk +3 lines, -0 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_classifier.h View 2 chunks +3 lines, -0 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_classifier.cc View 1 2 3 4 chunks +9 lines, -0 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_classifier_browsertest.cc View 1 2 3 4 1 chunk +2 lines, -0 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_term_feature_extractor.h View 1 2 3 4 5 6 7 7 chunks +17 lines, -13 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc View 1 2 3 4 5 6 7 7 chunks +31 lines, -17 lines 0 comments Download
M chrome/renderer/safe_browsing/phishing_term_feature_extractor_unittest.cc View 1 2 3 4 5 10 chunks +156 lines, -17 lines 0 comments Download
M chrome/renderer/safe_browsing/scorer.h View 1 2 3 2 chunks +7 lines, -1 line 0 comments Download
M chrome/renderer/safe_browsing/scorer.cc View 1 2 3 1 chunk +8 lines, -0 lines 0 comments Download
M chrome/renderer/safe_browsing/scorer_unittest.cc View 1 2 3 2 chunks +4 lines, -0 lines 0 comments Download

Messages

Total messages: 19 (0 generated)
zysxqn
6 years, 7 months ago (2014-05-01 21:11:54 UTC) #1
mattm
https://codereview.chromium.org/268673007/diff/30001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/30001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode252 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:252: } each of the cases here duplicates the code ...
6 years, 7 months ago (2014-05-06 01:00:14 UTC) #2
zysxqn
https://codereview.chromium.org/268673007/diff/30001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/30001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode252 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:252: } On 2014/05/06 01:00:14, mattm wrote: > each of ...
6 years, 7 months ago (2014-05-06 20:56:57 UTC) #3
noelutz
Nice. A few comments. noe. https://codereview.chromium.org/268673007/diff/50001/chrome/common/safe_browsing/client_model.proto File chrome/common/safe_browsing/client_model.proto (right): https://codereview.chromium.org/268673007/diff/50001/chrome/common/safe_browsing/client_model.proto#newcode93 chrome/common/safe_browsing/client_model.proto:93: optional int32 max_shingles_per_page = ...
6 years, 7 months ago (2014-05-06 21:40:18 UTC) #4
zysxqn
https://codereview.chromium.org/268673007/diff/50001/chrome/common/safe_browsing/client_model.proto File chrome/common/safe_browsing/client_model.proto (right): https://codereview.chromium.org/268673007/diff/50001/chrome/common/safe_browsing/client_model.proto#newcode93 chrome/common/safe_browsing/client_model.proto:93: optional int32 max_shingles_per_page = 9 [default = 200]; On ...
6 years, 7 months ago (2014-05-07 19:29:18 UTC) #5
zysxqn
Please have another look. Thanks! https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode230 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:230: if (shingle_hashes_->size() < max_shingles_per_page_) ...
6 years, 7 months ago (2014-05-09 21:34:24 UTC) #6
mattm
https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode230 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:230: if (shingle_hashes_->size() < max_shingles_per_page_) { On 2014/05/09 21:34:24, zysxqn ...
6 years, 7 months ago (2014-05-09 23:28:10 UTC) #7
noelutz
https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode230 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:230: if (shingle_hashes_->size() < max_shingles_per_page_) { On 2014/05/09 23:28:10, mattm ...
6 years, 7 months ago (2014-05-10 01:01:19 UTC) #8
zysxqn
On 2014/05/10 01:01:19, noelutz wrote: > https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc > File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): > > https://codereview.chromium.org/268673007/diff/50001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode230 > ...
6 years, 7 months ago (2014-05-12 17:43:20 UTC) #9
zysxqn
No changes of the code. But replies to the previous comments. Please have another look. ...
6 years, 7 months ago (2014-05-12 17:43:56 UTC) #10
noelutz
lgtm Please wait for Matt's OK was well.
6 years, 7 months ago (2014-05-12 18:01:29 UTC) #11
mattm
https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode248 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:248: // which are relatively expensive. On 2014/05/12 17:43:57, zysxqn ...
6 years, 7 months ago (2014-05-12 20:08:14 UTC) #12
zysxqn
https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode248 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:248: // which are relatively expensive. On 2014/05/12 20:08:14, mattm ...
6 years, 7 months ago (2014-05-12 21:09:33 UTC) #13
noelutz
lgtm https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode248 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:248: // which are relatively expensive. On 2014/05/12 21:09:34, ...
6 years, 7 months ago (2014-05-12 21:51:08 UTC) #14
zysxqn
https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc File chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc (right): https://codereview.chromium.org/268673007/diff/110001/chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc#newcode248 chrome/renderer/safe_browsing/phishing_term_feature_extractor.cc:248: // which are relatively expensive. On 2014/05/12 20:08:14, mattm ...
6 years, 7 months ago (2014-05-12 22:08:56 UTC) #15
mattm
lgtm
6 years, 7 months ago (2014-05-12 22:18:15 UTC) #16
zysxqn
The CQ bit was checked by zysxqn@google.com
6 years, 7 months ago (2014-05-12 22:23:13 UTC) #17
commit-bot: I haz the power
CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/zysxqn@google.com/268673007/130001
6 years, 7 months ago (2014-05-12 22:24:39 UTC) #18
commit-bot: I haz the power
6 years, 7 months ago (2014-05-13 02:54:57 UTC) #19
Message was sent while issue was closed.
Change committed as 269976

Powered by Google App Engine
This is Rietveld 408576698