Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(485)

Unified Diff: java/org/chromium/distiller/webdocument/WebText.java

Issue 2401853004: Strip unwanted classNames from all nodes (Closed)
Patch Set: address comments Created 4 years, 2 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View side-by-side diff with in-line comments
Download patch
Index: java/org/chromium/distiller/webdocument/WebText.java
diff --git a/java/org/chromium/distiller/webdocument/WebText.java b/java/org/chromium/distiller/webdocument/WebText.java
index 95827fb58e5a9502221ec6b0c87446bcdd24928e..1f2b85ea85dd129ca908893e51553255947b4984 100644
--- a/java/org/chromium/distiller/webdocument/WebText.java
+++ b/java/org/chromium/distiller/webdocument/WebText.java
@@ -133,6 +133,7 @@ public class WebText extends WebElement {
DomUtil.makeAllLinksAbsolute(clonedRoot);
DomUtil.stripTargetAttributes(clonedRoot);
DomUtil.stripIds(clonedRoot);
+ DomUtil.stripUnwantedClassNames(clonedRoot);
DomUtil.stripFontColorAttributes(clonedRoot);
DomUtil.stripStyleAttributes(clonedRoot);
// TODO(wychen): if we allow images in WebText later, add stripImageElements().
« no previous file with comments | « java/org/chromium/distiller/DomUtil.java ('k') | javatests/org/chromium/distiller/ContentExtractorTest.java » ('j') | no next file with comments »

Powered by Google App Engine
This is Rietveld 408576698