Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(258)

Issue 2138643002: Reland "Replace ICU with CED for auto encoding detection" (Closed)

Created:
4 years, 5 months ago by Jinsuk Kim
Modified:
4 years, 5 months ago
CC:
blink-reviews, chromium-reviews
Base URL:
https://chromium.googlesource.com/chromium/src.git@master
Target Ref:
refs/pending/heads/master
Project:
chromium
Visibility:
Public.

Description

Reland "Replace ICU with CED for auto encoding detection" Another attempt to land crrev.com/2081653007. The patch was reverted due accidentally having added new static constructors (string), and causing a failure of the "sizes" step of the build. Link to broken build: https://build.chromium.org/p/chromium/builders/Linux%20x64/builds/21618 Relevant log lines: FAILED linux-release-64/sizes/chrome-si/initializers: actual 9, expected 7, better lower The issue was fixed in the library on github.com: https://github.com/google/compact_enc_det/commit/4abb8e48ca45e40bb6e4eb5767f20498e8df0075 https://github.com/google/compact_enc_det/commit/45c3d4a3d7ee40adf271748f3c4177fa293800e1 Tested: ------- $ tools/linux/dump-static-initializers.py out/Release/chrome ... Found 40 static initializers in 7 files. ------- This reverts commit dbd9f219ef9b615d9e116964e46b1a390c9354a1. BUG= Committed: https://crrev.com/ec678d0e5493256ccf2b2b6a752de289b58b9d28 Cr-Commit-Position: refs/heads/master@{#405352}

Patch Set 1 #

Total comments: 2
Unified diffs Side-by-side diffs Delta from patch set Stats (+16 lines, -69 lines) Patch
M third_party/WebKit/Source/platform/BUILD.gn View 1 chunk +1 line, -0 lines 0 comments Download
M third_party/WebKit/Source/platform/DEPS View 1 chunk +1 line, -0 lines 0 comments Download
M third_party/WebKit/Source/platform/blink_platform.gyp View 1 chunk +1 line, -0 lines 0 comments Download
M third_party/WebKit/Source/platform/text/TextEncodingDetector.cpp View 2 chunks +13 lines, -69 lines 2 comments Download

Messages

Total messages: 11 (4 generated)
Jinsuk Kim
4 years, 5 months ago (2016-07-11 05:00:12 UTC) #2
tkent
lgtm
4 years, 5 months ago (2016-07-11 05:03:39 UTC) #3
commit-bot: I haz the power
CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2138643002/1
4 years, 5 months ago (2016-07-13 22:03:50 UTC) #5
commit-bot: I haz the power
Committed patchset #1 (id:1)
4 years, 5 months ago (2016-07-13 23:55:41 UTC) #6
commit-bot: I haz the power
CQ bit was unchecked.
4 years, 5 months ago (2016-07-13 23:56:01 UTC) #7
commit-bot: I haz the power
Patchset 1 (id:??) landed as https://crrev.com/ec678d0e5493256ccf2b2b6a752de289b58b9d28 Cr-Commit-Position: refs/heads/master@{#405352}
4 years, 5 months ago (2016-07-13 23:58:27 UTC) #9
jungshik at Google
4 years, 5 months ago (2016-07-19 21:18:30 UTC) #11
Message was sent while issue was closed.
Thank you !  

I'm adding comments below because I couldn't file a bug at github (somehow, I
don't see issues tab there). What's your plan for bug tracking of CED?  Are you
gonna use github or crbug.com?

https://codereview.chromium.org/2138643002/diff/1/third_party/WebKit/Source/p...
File third_party/WebKit/Source/platform/text/TextEncodingDetector.cpp (right):

https://codereview.chromium.org/2138643002/diff/1/third_party/WebKit/Source/p...
third_party/WebKit/Source/platform/text/TextEncodingDetector.cpp:49: false, //
Include 7-bit encodings
The only 7-bit encoding Blink supports is ISO-2022-JP. We can introduce 'HTML5'
mode (#if defined(HTML5)) to suppress the detection of all the non-HTML5
encodings in CED.

https://codereview.chromium.org/2138643002/diff/1/third_party/WebKit/Source/p...
third_party/WebKit/Source/platform/text/TextEncodingDetector.cpp:53:
*detectedEncoding = WTF::TextEncoding(MimeEncodingName(encoding));
MimeEncodingName uses kEncodingTable (
https://cs.chromium.org/chromium/src/third_party/ced/src/util/encodings/encod...
) that has a lot of non-HTML5 encodings or non-HTML5 names. 

Perhaps, we have to introduce 'HTML5' mode (#if defined(HTML5) ...) in CED to
sanitize those names.

Powered by Google App Engine
This is Rietveld 408576698