vm/unicode.cc - Issue 11419259: Fix bug in Utf8::CodePointCount which was causing some strings with latin1

Keyboard Shortcuts

	File
u :	up to issue
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line

	Issue
u :	up to list of issues
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue

Unified Diff: vm/unicode.cc

Issue 11419259: Fix bug in Utf8::CodePointCount which was causing some strings with latin1 (Closed) Base URL: http://dart.googlecode.com/svn/branches/bleeding_edge/dart/runtime/

Patch Set: Created 8 years, 1 month ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View side-by-side diff with in-line comments

Download patch

Index: vm/unicode.cc

===================================================================

--- vm/unicode.cc (revision 15591)

+++ vm/unicode.cc (working copy)

@@ -53,23 +53,25 @@

};

-// Returns a count of the number of UTF-8 trail bytes.

-intptr_t Utf8::CodePointCount(const uint8_t* utf8_array,

- intptr_t array_len,

- Type* type) {

+// Returns the most restricted coding form in which the sequence of utf8

+// characters in 'utf8_array' can be represented in, and the number of

+// code units needed in that form.

+intptr_t Utf8::CodeUnitCount(const uint8_t* utf8_array,

+ intptr_t array_len,

+ Type* type) {

intptr_t len = 0;

Type char_type = kLatin1;

for (intptr_t i = 0; i < array_len; i++) {

uint8_t code_unit = utf8_array[i];

if (!IsTrailByte(code_unit)) {

++len;

- }

- if (!IsLatin1SequenceStart(code_unit)) { // > U+00FF

- if (IsSupplementarySequenceStart(code_unit)) { // >= U+10000

- char_type = kSupplementary;

- ++len;

- } else if (char_type == kLatin1) {

- char_type = kBMP;

+ if (!IsLatin1SequenceStart(code_unit)) { // > U+00FF

+ if (IsSupplementarySequenceStart(code_unit)) { // >= U+10000

+ char_type = kSupplementary;

+ ++len;

+ } else if (char_type == kLatin1) {

+ char_type = kBMP;

+ }

}

« no previous file with comments | « vm/unicode.h ('k') | no next file » | no next file with comments »