DescriptionRoll dom_distiller_js and add UMA for word count for distilled pages.
For every successfully distilled article a word count is
submitted to UMA.
The histogram ranges from 1->4000 words with 50 buckets.
Changes rolled in from the DOM Distiller repo:
bbf7c01 Add StatisticsInfo to DomDistillerResult proto for number of words.
fc1a5c1 treat non-breaking space as whitespace
de38c78 Expand usage of SimilarSiblingContentExpansion
444a55e reorder table tests
5ff895b add and fix missing tests to suite
970a419 Add SimilarSiblingContentExpansion
da76b1e add new table classification heuristic
BUG=417049
Committed: https://crrev.com/940e7511e50d7afbe62a1df985bec4d07739c73a
Cr-Commit-Position: refs/heads/master@{#296986}
Patch Set 1 #
Total comments: 4
Patch Set 2 : Addressed comments from isherman and rolled dom distiller #
Messages
Total messages: 11 (3 generated)
|