Descriptionadd options to specify original domain(s)
- this is part of the implementation of scoring mechanism for next/prev page
links; details are in http://codereview.chromium.org/661883003.
- add options to standalone content extractor to specify original domain if page
being processed is a file://
1) --original-domain pairs with --url
2) --original-domains pairs with --urls, delimited by ';', 1st domain is for
1st url, etc. eval tool will invoke content extractor with this option.
- for --original-domains, different URLs require different DomDistillerOptions
to be passed to GWT distiller as individual --original-domain:
1) rename DistillerFactory::CreateDistiller() to CreateDistillerForUrl(GURL)
2) impl new DistillerFactory class for content extractor to create different
options for different url's, i.e. a specific original domain for each url.
BUG=425952
Committed: https://crrev.com/8240530c2f8395ab9a5d8b63d72247fe01ee8c87
Cr-Commit-Position: refs/heads/master@{#301959}
Patch Set 1 #
Total comments: 4
Patch Set 2 : addressed comment #
Messages
Total messages: 9 (2 generated)
|