Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(10)

Issue 6286072: PrefixSet as an alternate to BloomFilter for safe-browsing. (Closed)

Created:
9 years, 10 months ago by Scott Hess - ex-Googler
Modified:
9 years, 6 months ago
Reviewers:
lzheng1, lzheng
CC:
chromium-reviews, Paweł Hajdan Jr., Erik does not do reviews
Visibility:
Public.

Description

PrefixSet as an alternate to BloomFilter for safe-browsing. The safe-browsing prefix data is uniformly distributed across the 32-bit integer space. When sorted, the average delta between items is about 8,000, which can be encoded in a 16-bit integer. PrefixSet takes advantage of this to compress the prefixes into a structure which is relatively efficient to query. The current CL adds the new structure, but continues to use the bloom filter to control things. Histograms are logged to track differences in results. BUG=71832 TEST=none Committed: http://src.chromium.org/viewvc/chrome?view=rev&revision=74487

Patch Set 1 #

Patch Set 2 : Fix windows build. #

Total comments: 7

Patch Set 3 : Example of how a specific set would be stored. #

Unified diffs Side-by-side diffs Delta from patch set Stats (+382 lines, -0 lines) Patch
A chrome/browser/safe_browsing/prefix_set.h View 1 2 1 chunk +92 lines, -0 lines 0 comments Download
A chrome/browser/safe_browsing/prefix_set.cc View 1 1 chunk +104 lines, -0 lines 0 comments Download
A chrome/browser/safe_browsing/prefix_set_unittest.cc View 1 1 chunk +120 lines, -0 lines 0 comments Download
M chrome/browser/safe_browsing/safe_browsing_database.h View 2 chunks +7 lines, -0 lines 0 comments Download
M chrome/browser/safe_browsing/safe_browsing_database.cc View 7 chunks +56 lines, -0 lines 0 comments Download
M chrome/chrome_browser.gypi View 1 2 1 chunk +2 lines, -0 lines 0 comments Download
M chrome/chrome_tests.gypi View 1 2 1 chunk +1 line, -0 lines 0 comments Download

Messages

Total messages: 8 (0 generated)
Scott Hess - ex-Googler
Awhile back I did some experimentation with different ways this could be stored. On Tuesday, ...
9 years, 10 months ago (2011-02-03 01:43:33 UTC) #1
lzheng
looks good. I haven't take a look at the test yet, I will take spend ...
9 years, 10 months ago (2011-02-05 01:03:42 UTC) #2
Scott Hess - ex-Googler
On 2011/02/05 01:03:42, lzheng wrote: > looks good. I haven't take a look at the ...
9 years, 10 months ago (2011-02-05 01:28:42 UTC) #3
Scott Hess - ex-Googler
OK, I'm done changing things. http://codereview.chromium.org/6286072/diff/3001/chrome/browser/safe_browsing/prefix_set.cc File chrome/browser/safe_browsing/prefix_set.cc (right): http://codereview.chromium.org/6286072/diff/3001/chrome/browser/safe_browsing/prefix_set.cc#newcode42 chrome/browser/safe_browsing/prefix_set.cc:42: unsigned delta = sorted[i] ...
9 years, 10 months ago (2011-02-07 19:20:23 UTC) #4
lzheng
http://codereview.chromium.org/6286072/diff/3001/chrome/browser/safe_browsing/prefix_set.cc File chrome/browser/safe_browsing/prefix_set.cc (right): http://codereview.chromium.org/6286072/diff/3001/chrome/browser/safe_browsing/prefix_set.cc#newcode42 chrome/browser/safe_browsing/prefix_set.cc:42: unsigned delta = sorted[i] - prev_prefix; I was thinking ...
9 years, 10 months ago (2011-02-08 01:18:11 UTC) #5
lzheng1
Good analysis. Indeed, we should choose the best k based on m/n ratio. If we ...
9 years, 10 months ago (2011-02-08 02:12:47 UTC) #6
Scott Hess - ex-Googler
Originally, the speedup from using the BloomFilter was that you didn't have to hit the ...
9 years, 10 months ago (2011-02-08 05:10:30 UTC) #7
lzheng
9 years, 10 months ago (2011-02-08 18:19:56 UTC) #8
LGTM.

Powered by Google App Engine
This is Rietveld 408576698