Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(379)

Side by Side Diff: chrome/common/safe_browsing/client_model.proto

Issue 268673007: Extracting page shingle hashes for similarity detection. (Closed) Base URL: https://chromium.googlesource.com/chromium/src.git@master
Patch Set: Fix a reference problem. Created 6 years, 7 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View unified diff | Download patch
OLDNEW
1 // Copyright (c) 2011 The Chromium Authors. All rights reserved. 1 // Copyright (c) 2011 The Chromium Authors. All rights reserved.
2 // Use of this source code is governed by a BSD-style license that can be 2 // Use of this source code is governed by a BSD-style license that can be
3 // found in the LICENSE file. 3 // found in the LICENSE file.
4 // 4 //
5 // This proto represents a machine learning model which is used to compute 5 // This proto represents a machine learning model which is used to compute
6 // the probability that a particular page visited by Chrome is phishing. 6 // the probability that a particular page visited by Chrome is phishing.
7 // 7 //
8 // Note: sine the machine learning model is trained on the server-side and then 8 // Note: sine the machine learning model is trained on the server-side and then
9 // downloaded onto the client it is important that this proto file stays in 9 // downloaded onto the client it is important that this proto file stays in
10 // sync with the server-side copy. Otherwise, the client may not be able to 10 // sync with the server-side copy. Otherwise, the client may not be able to
(...skipping 70 matching lines...) Expand 10 before | Expand all | Expand 10 after
81 // is hashed using sha256. 81 // is hashed using sha256.
82 required bytes prefix = 1; 82 required bytes prefix = 1;
83 83
84 // Network prefix size in bits. Default is an exact-host match. 84 // Network prefix size in bits. Default is an exact-host match.
85 optional int32 size = 2 [default = 128]; 85 optional int32 size = 2 [default = 128];
86 }; 86 };
87 repeated IPSubnet bad_subnet = 7; 87 repeated IPSubnet bad_subnet = 7;
88 88
89 // Murmur hash seed that was used to hash the page words. 89 // Murmur hash seed that was used to hash the page words.
90 optional fixed32 murmur_hash_seed = 8; 90 optional fixed32 murmur_hash_seed = 8;
91
92 // Maximum number of unique shingle hashes per page.
93 optional int32 max_hashes_per_page = 9 [default = 200];
94
95 // The number of words in a shingle.
96 optional int32 shingle_size = 10 [default = 4];
91 } 97 }
OLDNEW
« no previous file with comments | « no previous file | chrome/common/safe_browsing/csd.proto » ('j') | chrome/renderer/safe_browsing/phishing_term_feature_extractor.h » ('J')

Powered by Google App Engine
This is Rietveld 408576698