|
|
Created:
3 years, 7 months ago by jungshik at Google Modified:
3 years, 7 months ago Reviewers:
Peter Kasting CC:
chromium-reviews Target Ref:
refs/heads/master Project:
chromium Visibility:
Public. |
DescriptionBlock Tifinagh + Latin mix
BUG=chromium:722639
TEST=components_unittests --gtest_filter=*IDNToU*
Review-Url: https://codereview.chromium.org/2894313002
Cr-Commit-Position: refs/heads/master@{#474199}
Committed: https://chromium.googlesource.com/chromium/src/+/ae6f339fba0736224fdca0b96d2bb1cda42d8ad3
Patch Set 1 #
Total comments: 2
Patch Set 2 : rebased #
Messages
Total messages: 22 (11 generated)
Description was changed from ========== Block Tifinagh + Latin mix BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* ========== to ========== Block Tifinagh + Latin mix Blocking any Tifinagh character from mixing with Latin is a bit too broad a brush for a single character of U+.... in the script confusable with lowercase L. BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* ==========
Description was changed from ========== Block Tifinagh + Latin mix Blocking any Tifinagh character from mixing with Latin is a bit too broad a brush for a single character of U+.... in the script confusable with lowercase L. BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* ========== to ========== Block Tifinagh + Latin mix BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* ==========
Blocking any Tifinagh characters from mixing with Latin is a bit too broad brush because there's only one character in the script that is confusable with Latin. However, it's done this way to facilitate merging to branches. A bit more complex regex (as used for o-like and g-like Armenian letters could have been used), but I chose a simpler one.
On 2017/05/22 06:18:44, jungshik at Google wrote: > Blocking any Tifinagh characters from mixing with Latin is a bit too broad brush > because there's only one character in the script that is confusable with Latin. > However, it's done this way to facilitate merging to branches. A bit more > complex regex (as used for o-like and g-like Armenian letters could have been > used), but I chose a simpler one. Not a single domain (out of ~ 1 million registered in .com as of March 2017) is affected by this change. I guess some domains (for testing) have registered since that would be filtered by this change.
jshin@chromium.org changed reviewers: + pkasting@chromium.org
PTAL
LGTM https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... File components/url_formatter/idn_spoof_checker.cc (right): https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... components/url_formatter/idn_spoof_checker.cc:170: // - Disalow mixing of Latin and Tifinagh. Are there other "Latin + <script>" combos we should pre-emptively block? https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... components/url_formatter/idn_spoof_checker.cc:186: R"([\p{sc=tfng}].*[a-z]|[a-z].*[\p{sc=tfng}])", Nit: Feels like there's maybe a less redundant way to write this regex, but it's probably less readable.
On 2017/05/22 18:47:46, Peter Kasting wrote: > LGTM Thanks > > https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... > File components/url_formatter/idn_spoof_checker.cc (right): > > https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... > components/url_formatter/idn_spoof_checker.cc:170: // - Disalow mixing of Latin > and Tifinagh. > Are there other "Latin + <script>" combos we should pre-emptively block? With https://codereview.chromium.org/2897873002, the need for that became pretty small. (top-domain-look-alikes are blocked by that CL). Miao has one character that is confusable with one of [a-z]. We have blocked the worst confusables in Armenian ('g-like' and 'o-like') to be next to Latin for a couple of years. Armenian has others (rather distinct in most fonts), but they'd be blocked by https://codereview.chromium.org/2897873002 for top domains. For branches, I may as well block mixing of Latin and {Miao, Armenian} > https://codereview.chromium.org/2894313002/diff/1/components/url_formatter/id... > components/url_formatter/idn_spoof_checker.cc:186: > R"([\p{sc=tfng}].*[a-z]|[a-z].*[\p{sc=tfng}])", > Nit: Feels like there's maybe a less redundant way to write this regex, but it's > probably less readable. I couldn't come up with one. There may be. I'll think about it.
The CQ bit was checked by jshin@chromium.org
CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.or...
The CQ bit was unchecked by commit-bot@chromium.org
Try jobs failed on following builders: ios-simulator on master.tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/ios-simulator/bui...) ios-simulator-xcode-clang on master.tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/ios-simulator-xco...) mac_chromium_compile_dbg_ng on master.tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/mac_chromium_comp...)
The CQ bit was checked by jshin@chromium.org
The patchset sent to the CQ was uploaded after l-g-t-m from pkasting@chromium.org Link to the patchset: https://codereview.chromium.org/2894313002/#ps20001 (title: "rebased")
CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.or...
The CQ bit was unchecked by commit-bot@chromium.org
Try jobs failed on following builders: win_chromium_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_rel_...)
The CQ bit was checked by jshin@chromium.org
CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.or...
CQ is committing da patch. Bot data: {"patchset_id": 20001, "attempt_start_ts": 1495607598432040, "parent_rev": "a1db6b7d87922f09904037e82447170cc97c0a2e", "commit_rev": "ae6f339fba0736224fdca0b96d2bb1cda42d8ad3"}
Message was sent while issue was closed.
Description was changed from ========== Block Tifinagh + Latin mix BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* ========== to ========== Block Tifinagh + Latin mix BUG=chromium:722639 TEST=components_unittests --gtest_filter=*IDNToU* Review-Url: https://codereview.chromium.org/2894313002 Cr-Commit-Position: refs/heads/master@{#474199} Committed: https://chromium.googlesource.com/chromium/src/+/ae6f339fba0736224fdca0b96d2b... ==========
Message was sent while issue was closed.
Committed patchset #2 (id:20001) as https://chromium.googlesource.com/chromium/src/+/ae6f339fba0736224fdca0b96d2b... |