Issue 77553004: Defer setting type in the WebVTT tokenizer until emitting the token

Issue 77553004: Defer setting type in the WebVTT tokenizer until emitting the token (Closed)

Created:
7 years, 1 month ago by fs

Modified:
7 years, 1 month ago

Reviewers:
Nate Chapin, jochen (gone - plz use gerrit)

CC:
blink-reviews, nessy, philipj_slow, gasubic, feature-media-reviews_chromium.org, dglazkov+blink, adamk+blink_chromium.org, vcarbune.chromium

Base URL:
https://chromium.googlesource.com/chromium/blink.git@master

Visibility:
Public.

More Reviews

Description

Defer setting type in the WebVTT tokenizer until emitting the token Remove setting of the token type from begin* (and analog) methods in VTTToken, and add a new setType. Call setType from the token-emitting methods in VTTTokenizer. Remove ASSERTs that no longer apply. Because of this change, VTTTokenizer::haveBufferedCharacterToken can no longer return a value based on the type of token - make it return false always (there should never be a buffered character token - a token should be emitted after each and every call to nextToken that returns true). This also means that "EOF" (read: end-of-string) handling needs to be improved. Adopt the method from the HTML parser, that appends a segment with an EOF marker (a NUL) to the input, and adjust EOF handling to match (make sure to consume the EOF, and exit early and return false if encountering a EOF mark at the start of the FSM). While doing this, also hide the "implementation detail" that SegmentedString is used, by just passing a String to the tokenizer via the constructor. After doing the above, it becomes apparent that the EndTagOpenState is redundant with the EndTagState, so they can be merged. (The former state does not appear in the spec text.) A number of end-of-input cases are fixed: tags.html - "<c." now parses correctly. timestamp.html - "<00:00:00.500" now parses correctly. New tests: entities.html - (A number of FAIL -> PASS transitions here compared to previously - due to actually setting the correct token- type for content with only an entity (or something looking like an entity. Also no longer triggers an assert in Debug.) BUG=319391 Committed: https://src.chromium.org/viewvc/blink?view=rev&revision=162373

Patch Set 1 #

Total comments: 2

Created: 7 years, 1 month ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Stats (+87 lines, -62 lines)			Patch
A	LayoutTests/media/track/opera/track/webvtt/parsing-cue-data/tests/entities.html	View	1 chunk	+31 lines, -0 lines	0 comments	Download
A	LayoutTests/media/track/opera/track/webvtt/parsing-cue-data/tests/entities-expected.txt	View	1 chunk	+23 lines, -0 lines	0 comments	Download
M	LayoutTests/media/track/opera/track/webvtt/parsing-cue-data/tests/tags-expected.txt	View	1 chunk	+1 line, -1 line	0 comments	Download
M	LayoutTests/media/track/opera/track/webvtt/parsing-cue-data/tests/timestamps-expected.txt	View	1 chunk	+2 lines, -2 lines	0 comments	Download
M	Source/core/html/track/vtt/VTTParser.cpp	View	1 chunk	+2 lines, -3 lines	0 comments	Download
M	Source/core/html/track/vtt/VTTToken.h	View	5 chunks	+1 line, -25 lines	1 comment	Download
M	Source/core/html/track/vtt/VTTTokenizer.h	View	3 chunks	+5 lines, -12 lines	1 comment	Download
M	Source/core/html/track/vtt/VTTTokenizer.cpp	View	7 chunks	+22 lines, -19 lines	0 comments	Download

Messages

Total messages: 6 (0 generated)

Expand Messages | Collapse Messages

jochen (gone - plz use gerrit)

https://codereview.chromium.org/77553004/diff/1/Source/core/html/track/vtt/VTTToken.h File Source/core/html/track/vtt/VTTToken.h (right): https://codereview.chromium.org/77553004/diff/1/Source/core/html/track/vtt/VTTToken.h#newcode107 Source/core/html/track/vtt/VTTToken.h:107: void beginTimestampTag(UChar character) what's the point of having all ...

7 years, 1 month ago (2013-11-20 14:09:37 UTC) #2

On 2013/11/20 14:09:37, jochen wrote: > https://codereview.chromium.org/77553004/diff/1/Source/core/html/track/vtt/VTTToken.h > File Source/core/html/track/vtt/VTTToken.h (right): > > https://codereview.chromium.org/77553004/diff/1/Source/core/html/track/vtt/VTTToken.h#newcode107 > ...

7 years, 1 month ago (2013-11-20 14:21:28 UTC) #3

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/fs@opera.com/77553004/1

7 years, 1 month ago (2013-11-20 14:29:47 UTC) #5

Message was sent while issue was closed.

Change committed as 162373

Expand Messages | Collapse Messages