OLD | NEW |
---|---|
(Empty) | |
1 * alexa_10k_domains.list | |
2 It is an input to make_top_domain_list and is made up of list of Alexa | |
3 top 10k domains (one per line). | |
4 It's derived from | |
5 src/tools/perf/page_sets/alexa1-10000-urls.json by running the following: | |
ncarter (slow)
2017/04/20 22:26:59
IIRC the alexa10000 from page_sets was almost five
| |
6 | |
7 grep http ../../../tools/perf/page_sets/alexa1-10000-urls.json | \ | |
8 sed -r -e 's;^.*"https?://(.*)/".*$;\1;' -e 's/www\.//' | \ | |
9 awk 'BEGIN {FS="."} { printf("%s%s\n", NF > 3 ? "#" : "", $0); } \ | |
10 END {printf ("# for testing\ndigklmo68.com\ndigklmo68.co.uk\n");}' > \ | |
ncarter (slow)
2017/04/20 22:26:58
This would probably be better as a python script,
| |
11 alexa_10k_domains.list | |
12 | |
13 * alexa_10k_names_and_skeletons.gperf | |
14 | |
15 It is generated by running make_top_domain_list and checked in. | |
16 No command line argument needs to be passed. | |
17 | |
18 $ ninja -C $build_outdir make_top_domain_list | |
19 $ $build_outdir/make_top_domain_list | |
20 | |
21 During a build, it is processed by base/dafsa/make_dafsa.py to generate | |
22 alexa_10k_names_and_skeletons-inc.cc that is included by | |
23 components/url_formatter/url_formatter.cc | |
OLD | NEW |