DescriptionSkip unrecognized iframes
IFrames are usually useless unless they are recognized by embed extractors.
The result is mostly good. The exception is documented at:
http://crbug.com/641678
Score changes:
https://x20web.corp.google.com/~wychen/domdistillerscore/noiframe/knowledge.html
Average precision: 0.959 → 0.961
https://x20web.corp.google.com/~wychen/domdistillerscore/noiframe/multi-page.html
Average precision: 0.746 → 0.746+
https://x20web.corp.google.com/~wychen/domdistillerscore/noiframe/page-links.html
Average precision: 0.919 → 0.920
https://x20web.corp.google.com/~wychen/domdistillerscore/noiframe/reader-golden.html
Average precision: 0.945 → 0.946
The recall is not changed because less things are extracted.
BUG=641678
R=mdjones@chromium.org
Committed: 50efabe82fa19b3bb9d44e8812907d447d9746a5
Patch Set 1 #
Total comments: 2
Patch Set 2 : address comment #
Depends on Patchset: Dependent Patchsets: Messages
Total messages: 7 (2 generated)
|