Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(807)

Unified Diff: java/org/chromium/distiller/ContentExtractor.java

Issue 1230583006: Fix for keeping lists structure (Closed) Base URL: https://github.com/chromium/dom-distiller.git@master
Patch Set: StackEntry removed, using WebTag content flag instead. Created 5 years, 4 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View side-by-side diff with in-line comments
Download patch
Index: java/org/chromium/distiller/ContentExtractor.java
diff --git a/java/org/chromium/distiller/ContentExtractor.java b/java/org/chromium/distiller/ContentExtractor.java
index b9f0b2da0eab6d305cb30548856946b32e971228..7d16dd87f500b846f87830dab7eed78d237e5eac 100644
--- a/java/org/chromium/distiller/ContentExtractor.java
+++ b/java/org/chromium/distiller/ContentExtractor.java
@@ -21,6 +21,7 @@ import com.google.gwt.dom.client.Document;
import com.google.gwt.dom.client.Element;
import com.google.gwt.dom.client.Node;
import com.google.gwt.dom.client.NodeList;
+import org.chromium.distiller.webdocument.filters.WebTagStructureKeeper;
import java.util.ArrayList;
import java.util.LinkedList;
@@ -92,6 +93,7 @@ public class ContentExtractor {
now = DomUtil.getTime();
processDocument(documentInfo.document);
RelevantElements.process(documentInfo.document);
+ WebTagStructureKeeper.process(documentInfo.document);
mdjones 2015/08/03 23:29:45 This should probably be the last filter to run sin
LeadImageFinder.process(documentInfo.document);
List<WebImage> images = documentInfo.document.getContentImages();

Powered by Google App Engine
This is Rietveld 408576698