Issue 23522018: RenderTextWin: Break runs between any two characters that are not in the same code block

ckocagil

7 years, 3 months ago (2013-09-02 20:56:53 UTC) #1

msw

I like the general idea behind this patch, and I hope Alexei will weigh in ...

7 years, 3 months ago (2013-09-03 16:15:31 UTC) #2

Alexei Svitkine (slow)

Waiting for you to answer Mike's questions. Also, whatever approach is taken should include a ...

7 years, 3 months ago (2013-09-03 17:05:21 UTC) #3

ckocagil

https://codereview.chromium.org/23522018/diff/1/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/1/ui/gfx/render_text_win.cc#newcode602 ui/gfx/render_text_win.cc:602: // Break runs between any two characters that are ...

7 years, 3 months ago (2013-09-04 13:02:50 UTC) #4

Alexei Svitkine (slow)

Thanks for working on this, by the way! (Still waiting for a unit test for ...

7 years, 3 months ago (2013-09-04 14:42:01 UTC) #5

ckocagil

Also added a unit test. https://codereview.chromium.org/23522018/diff/7001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/7001/ui/gfx/render_text_win.cc#newcode601 ui/gfx/render_text_win.cc:601: run_break = run->range.start() + ...

7 years, 3 months ago (2013-09-04 21:35:46 UTC) #6

Alexei Svitkine (slow)

Looks good, thanks for fixing the other erroneous uses of text(). A few more comments. ...

7 years, 3 months ago (2013-09-04 21:52:36 UTC) #7

msw

https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_unittest.cc File ui/gfx/render_text_unittest.cc (right): https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_unittest.cc#newcode1656 ui/gfx/render_text_unittest.cc:1656: const base::string16 kTestString = WideToUTF16(L"x\x25B6y"); nit: Add another test ...

7 years, 3 months ago (2013-09-04 22:05:38 UTC) #8

ckocagil

https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_unittest.cc File ui/gfx/render_text_unittest.cc (right): https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_unittest.cc#newcode1656 ui/gfx/render_text_unittest.cc:1656: const base::string16 kTestString = WideToUTF16(L"x\x25B6y"); On 2013/09/04 22:05:38, msw ...

7 years, 3 months ago (2013-09-05 20:13:14 UTC) #9

Alexei Svitkine (slow)

https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/20001/ui/gfx/render_text_win.cc#newcode621 ui/gfx/render_text_win.cc:621: On 2013/09/05 20:13:15, ckocagil wrote: > On 2013/09/04 21:52:36, ...

7 years, 3 months ago (2013-09-05 20:32:40 UTC) #10

ckocagil

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc#newcode611 ui/gfx/render_text_win.cc:611: if (!ui::IsValidCodePointIndex(layout_text, run_start)) On 2013/09/05 20:32:41, Alexei Svitkine wrote: ...

7 years, 3 months ago (2013-09-05 21:08:57 UTC) #11

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc
File ui/gfx/render_text_win.cc (right):

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc...
ui/gfx/render_text_win.cc:611: if (!ui::IsValidCodePointIndex(layout_text,
run_start))
On 2013/09/05 20:32:41, Alexei Svitkine wrote:
> I'm not a fan of this logic here. Decrementing run_start seems wrong to me,
> since now you're overlapping with the previous run. If we can guarantee that
> run_start is at a valid index (i.e. the previous run_break was at a valid
> index), then this one shouldn't be necessary, I think.

|run_start| doesn't change |run->range.start()|. This allows CharIterator to
start with a valid char index (even if icu checks it, I would rather do it here
explicitly). Surrogate pairs that have halves in different runs will be checked
in both runs; this is intentional.

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc...
ui/gfx/render_text_win.cc:614: if (!ui::IsValidCodePointIndex(layout_text,
run_break))
On 2013/09/05 20:32:41, Alexei Svitkine wrote:
> In what cases could this be the case? For example, your CL is fixing the
> clamping code above, so that shouldn't result in the run_break being bad.
> 
> I guess only if Uniscribe gives us back results via the script item list? If
so,
> it should probably be handled at that stage.

I'm not sure, style ranges maybe? Do they contain code-point indices? Mike, do
you know what causes these runs or if they actually exist?

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc...
ui/gfx/render_text_win.cc:621: run_break = std::min(run_break, iter.array_pos()
+ run_start);
On 2013/09/05 20:32:41, Alexei Svitkine wrote:
> The mins is only necessary because your code above does --run_start, right?

No, it is necessary because of the ++run_length (this allows "iter.pos() +
run_start == run_break + 1"). I could use --run_length, but then surrogate
halves wouldn't be checked in both of the runs they are in.

Alexei Svitkine (slow)

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc#newcode611 ui/gfx/render_text_win.cc:611: if (!ui::IsValidCodePointIndex(layout_text, run_start)) On 2013/09/05 21:08:57, ckocagil wrote: > ...

7 years, 3 months ago (2013-09-06 13:28:41 UTC) #12

Alexei Svitkine (slow)

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc#newcode611 ui/gfx/render_text_win.cc:611: if (!ui::IsValidCodePointIndex(layout_text, run_start)) On 2013/09/06 13:28:42, Alexei Svitkine wrote: ...

7 years, 3 months ago (2013-09-06 13:36:18 UTC) #13

ckocagil

On 2013/09/06 13:36:18, Alexei Svitkine wrote: > https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc > File ui/gfx/render_text_win.cc (right): > > https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc#newcode611 ...

7 years, 3 months ago (2013-09-06 15:50:55 UTC) #14

On 2013/09/06 13:36:18, Alexei Svitkine wrote:
> https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc
> File ui/gfx/render_text_win.cc (right):
> 
>
https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc...
> ui/gfx/render_text_win.cc:611: if (!ui::IsValidCodePointIndex(layout_text,
> run_start))
> On 2013/09/06 13:28:42, Alexei Svitkine wrote:
> > On 2013/09/05 21:08:57, ckocagil wrote:
> > > On 2013/09/05 20:32:41, Alexei Svitkine wrote:
> > > > I'm not a fan of this logic here. Decrementing run_start seems wrong to
> me,
> > > > since now you're overlapping with the previous run. If we can guarantee
> that
> > > > run_start is at a valid index (i.e. the previous run_break was at a
valid
> > > > index), then this one shouldn't be necessary, I think.
> > > 
> > > |run_start| doesn't change |run->range.start()|. This allows CharIterator
to
> > > start with a valid char index (even if icu checks it, I would rather do it
> > here
> > > explicitly). Surrogate pairs that have halves in different runs will be
> > checked
> > > in both runs; this is intentional.
> > 
> > Right, I understand this, but still am not a fan of it. It would be better
if
> > such a case was impossible (i.e. we were guaranteed that the run start is
not
> > the 2nd half of a surrogate pair). Otherwise, it makes the logic below quite
> > strange. For example, |first_block| is assigned iter.get(), but if this
> happened
> > than it would correspond to something starting at a previous run.
> > 
> > We should just make sure not to have run boundaries split surrogate pairs
and
> > then this extra logic won't be necessary.
> 
> Plus, looking at the UTF16CharIterator() code, it shouldn't crash or anything
if
> we pass it a fragment that splits a surrogate pair. So I'd say just remove
these
> isValidCodePointIndex() checks in this block and add a TODO elsewhere to
ensure
> we don't split surrogate pairs when splitting runs.

Done.

msw

https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc File ui/gfx/render_text_win.cc (right): https://codereview.chromium.org/23522018/diff/31001/ui/gfx/render_text_win.cc#newcode614 ui/gfx/render_text_win.cc:614: if (!ui::IsValidCodePointIndex(layout_text, run_break)) On 2013/09/05 21:08:57, ckocagil wrote: > ...

7 years, 3 months ago (2013-09-06 16:51:28 UTC) #16

ckocagil

https://codereview.chromium.org/23522018/diff/39001/ui/gfx/render_text_unittest.cc File ui/gfx/render_text_unittest.cc (right): https://codereview.chromium.org/23522018/diff/39001/ui/gfx/render_text_unittest.cc#newcode1656 ui/gfx/render_text_unittest.cc:1656: const base::string16 kTestString1 = WideToUTF16(L"x\x25B6y"); On 2013/09/06 16:51:28, msw ...

7 years, 3 months ago (2013-09-06 17:26:49 UTC) #17

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/ckocagil@chromium.org/23522018/45001

7 years, 3 months ago (2013-09-06 17:33:13 UTC) #20

Message was sent while issue was closed.

Change committed as 221776

Issue 23522018: RenderTextWin: Break runs between any two characters that are not in the same code block (Closed)

Description

Patch Set 1 #

Patch Set 2 : use CharIterator #

Patch Set 3 : addressed comments #

Patch Set 4 : fixed for partial surrogates #

Patch Set 5 : added test case #

Patch Set 6 : don't handle bad ranges #

Patch Set 7 : Mike's comments #

Messages