java/org/chromium/distiller/ContentExtractor.java - Issue 1230583006: Fix for keeping lists structure

Keyboard Shortcuts

	File
u :	up to issue
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line

	Issue
u :	up to list of issues
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue

Unified Diff: java/org/chromium/distiller/ContentExtractor.java

Issue 1230583006: Fix for keeping lists structure (Closed) Base URL: https://github.com/chromium/dom-distiller.git@master

Patch Set: StackEntry removed, using WebTag content flag instead. Created 5 years, 4 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

Index: java/org/chromium/distiller/ContentExtractor.java

diff --git a/java/org/chromium/distiller/ContentExtractor.java b/java/org/chromium/distiller/ContentExtractor.java

index b9f0b2da0eab6d305cb30548856946b32e971228..7d16dd87f500b846f87830dab7eed78d237e5eac 100644

--- a/java/org/chromium/distiller/ContentExtractor.java

+++ b/java/org/chromium/distiller/ContentExtractor.java

@@ -21,6 +21,7 @@ import com.google.gwt.dom.client.Document;

import com.google.gwt.dom.client.Element;

import com.google.gwt.dom.client.Node;

import com.google.gwt.dom.client.NodeList;

+import org.chromium.distiller.webdocument.filters.WebTagStructureKeeper;

import java.util.ArrayList;

import java.util.LinkedList;

@@ -92,6 +93,7 @@ public class ContentExtractor {

now = DomUtil.getTime();

processDocument(documentInfo.document);

RelevantElements.process(documentInfo.document);

+ WebTagStructureKeeper.process(documentInfo.document);

mdjones 2015/08/03 23:29:45 This should probably be the last filter to run sin

LeadImageFinder.process(documentInfo.document);

List<WebImage> images = documentInfo.document.getContentImages();