Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(93)

Side by Side Diff: components/url_formatter/top_domains/README

Issue 2784933002: Mitigate spoofing attempt using Latin letters. (Closed)
Patch Set: add similarity check unittests Created 3 years, 8 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View unified diff | Download patch
OLDNEW
(Empty)
1 * alexa_10k_domains.list
2 It is an input to make_top_domain_list and is made up of list of Alexa
3 top 10k domains (one per line).
4 It's derived from
5 src/tools/perf/page_sets/alexa1-10000-urls.json by running the following:
ncarter (slow) 2017/04/20 22:26:59 IIRC the alexa10000 from page_sets was almost five
6
7 grep http ../../../tools/perf/page_sets/alexa1-10000-urls.json | \
8 sed -r -e 's;^.*"https?://(.*)/".*$;\1;' -e 's/www\.//' | \
9 awk 'BEGIN {FS="."} { printf("%s%s\n", NF > 3 ? "#" : "", $0); } \
10 END {printf ("# for testing\ndigklmo68.com\ndigklmo68.co.uk\n");}' > \
ncarter (slow) 2017/04/20 22:26:58 This would probably be better as a python script,
11 alexa_10k_domains.list
12
13 * alexa_10k_names_and_skeletons.gperf
14
15 It is generated by running make_top_domain_list and checked in.
16 No command line argument needs to be passed.
17
18 $ ninja -C $build_outdir make_top_domain_list
19 $ $build_outdir/make_top_domain_list
20
21 During a build, it is processed by base/dafsa/make_dafsa.py to generate
22 alexa_10k_names_and_skeletons-inc.cc that is included by
23 components/url_formatter/url_formatter.cc
OLDNEW

Powered by Google App Engine
This is Rietveld 408576698