Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(22)

Issue 224943002: icu local change part1 (Closed)

Created:
6 years, 8 months ago by jungshik at Google
Modified:
6 years, 8 months ago
Reviewers:
Mark Mentovai, jsbell
CC:
chromium-reviews
Visibility:
Public.

Description

ICU 52 local changes part1 1. Remove all the obsolete patches. There are lots of them because most of local patches to ICU 4.6.1 have either been accepted or become obsolete. The largest local patch removed is our patches for CJ word breaker because they were upstreamed. Android didn't apply the CJK word breaker patch to ICU 4.6 to reduce the data size. In a follow-up CL, we'll have an Android-specific change for this issue. Besides, we don't include patches for files we locally add because the patches for new files are redundant. Instead, they're mentioned in README.chromium. 2. We don't need platform-specific headers any more (pmac, plinux, pwin, etc). They're combined into a single file and all platforms we care about are well-supported except for one issue on Android/QNX. putil.patch takes care of it. 3. Breakiterator patches for a few remaining issues. We also use a much smaller Khmer dictionary (upstream fix pending). 4. Converter - Introduced two WHATWG-encoding-standard-compliant mapping tables are added (derived directly from the spec with a script) for EUC-JP and CP866 - Disabled various non-HTML5-encodings such as SCSU,BOCU, UTF-7, CESU-8 saving ~30kB in the code size. Even though we link statically, they're still pulled in as a part of uconv. - Disabled ISO-2022-JP-[1-4] in ucnv2022.c - Removed a number of encoding alias entries in the alias table leading to ~40kB data size reduction. 5. Locale data : Haven't yet updated. We need to trim them substantially. 6. Unihan collation removal is now done with a script (scripts/remove_unihan.sh) 7. Updated timezone data to the latest (2014b) as of today. 8. Customized transliterator for Greek uppercasing 9. Updated data build related patches. The windows data build patch has yet to be updated. 10. The updated ICU data file/assembly source files are not included in this CL. They'll be updated in a separate CL. With all the size reduction changes applied, the data size went down from > 23MB to 12.4MB. However, it's still 2.5MB larger than ICU 4.6.1 data. The locale data trimming will bring it down further. 11. Update README.chromium accordingly. The only exceptions are item #5 and the android entry in item #3 (breakiterator. see #1 above) BUG=132145, 76328 TEST=Following the procedure outlined in README.chromium, one can build the icu data file. R=jsbell@chromium.org, mark@chromium.org Committed: https://src.chromium.org/viewvc/chrome?view=rev&revision=262192

Patch Set 1 #

Patch Set 2 : #

Patch Set 3 : #

Patch Set 4 : fix typos in uconfig.h and putil.patch #

Total comments: 23

Patch Set 5 : review comments addressed; uconv.patch updated #

Patch Set 6 : function indentation changed #

Unified diffs Side-by-side diffs Delta from patch set Stats (+17142 lines, -52254 lines) Patch
M icu52/README.chromium View 1 2 3 4 5 3 chunks +88 lines, -210 lines 0 comments Download
D icu52/patches/breakiterator.patch View 1 2 3 4 5 1 chunk +0 lines, -140 lines 0 comments Download
D icu52/patches/changeset_30255.patch View 1 2 3 4 5 1 chunk +0 lines, -22 lines 0 comments Download
D icu52/patches/clang.patch View 1 2 3 4 5 1 chunk +0 lines, -89 lines 0 comments Download
A icu52/patches/configure.patch View 1 2 3 4 5 1 chunk +13 lines, -0 lines 0 comments Download
M icu52/patches/converters.patch View 1 2 3 4 5 6 chunks +862 lines, -24351 lines 0 comments Download
D icu52/patches/csetdet.patch View 1 2 3 4 5 1 chunk +0 lines, -35 lines 0 comments Download
M icu52/patches/data.build.patch View 1 2 3 4 5 6 chunks +22 lines, -26 lines 0 comments Download
D icu52/patches/gcc46.patch View 1 2 3 4 5 1 chunk +0 lines, -20 lines 0 comments Download
D icu52/patches/ios_timezone.patch View 1 2 3 4 5 1 chunk +0 lines, -76 lines 0 comments Download
A icu52/patches/khmer-dictbe.patch View 1 2 3 4 5 1 chunk +18 lines, -0 lines 0 comments Download
D icu52/patches/nan.patch View 1 2 3 4 5 1 chunk +0 lines, -65 lines 0 comments Download
D icu52/patches/pandroid.h.patch View 1 2 3 4 5 1 chunk +0 lines, -29 lines 0 comments Download
D icu52/patches/platform.patch View 1 2 3 4 5 1 chunk +0 lines, -45 lines 0 comments Download
D icu52/patches/platform.qnx.patch View 1 2 3 4 5 1 chunk +0 lines, -113 lines 0 comments Download
D icu52/patches/pmac.h.patch View 1 2 3 4 5 1 chunk +0 lines, -55 lines 0 comments Download
D icu52/patches/pnacl.h.patch View 1 2 3 4 5 1 chunk +0 lines, -14 lines 0 comments Download
A + icu52/patches/putil.patch View 1 2 3 4 5 1 chunk +11 lines, -43 lines 0 comments Download
D icu52/patches/regex.patch View 1 2 3 4 5 1 chunk +0 lines, -39 lines 0 comments Download
D icu52/patches/rematch.patch View 1 2 3 4 5 1 chunk +0 lines, -62 lines 0 comments Download
D icu52/patches/rtti.patch View 1 2 3 4 5 1 chunk +0 lines, -407 lines 0 comments Download
D icu52/patches/search_collation.patch View 1 2 3 4 5 1 chunk +0 lines, -1083 lines 0 comments Download
D icu52/patches/segmentation.patch View 1 2 3 4 5 1 chunk +0 lines, -3587 lines 0 comments Download
D icu52/patches/si_value.undef.patch View 1 2 3 4 5 1 chunk +0 lines, -14 lines 0 comments Download
D icu52/patches/static.analysis.patch View 1 2 3 4 5 1 chunk +0 lines, -63 lines 0 comments Download
D icu52/patches/ubrk.patch View 1 2 3 4 5 1 chunk +0 lines, -32 lines 0 comments Download
A icu52/patches/uconv.patch View 1 2 3 4 5 1 chunk +333 lines, -0 lines 0 comments Download
D icu52/patches/uloc.patch View 1 2 3 4 5 1 chunk +0 lines, -55 lines 0 comments Download
D icu52/patches/unihan.patch View 1 2 3 4 5 1 chunk +0 lines, -3266 lines 0 comments Download
D icu52/patches/utext.patch View 1 2 3 4 5 1 chunk +0 lines, -76 lines 0 comments Download
D icu52/patches/vs2012.patch View 1 2 3 4 5 1 chunk +0 lines, -29 lines 0 comments Download
M icu52/patches/vscomp.patch View 1 2 3 4 5 1 chunk +22 lines, -52 lines 0 comments Download
M icu52/patches/wpo.patch View 1 2 3 4 5 3 chunks +4 lines, -4 lines 0 comments Download
D icu52/patches/xopen_source.patch View 1 2 3 4 5 1 chunk +0 lines, -28 lines 0 comments Download
A icu52/scripts/eucjp_gen.sh View 1 2 3 4 5 1 chunk +134 lines, -0 lines 0 comments Download
A icu52/scripts/ibm866_gen.sh View 1 2 3 4 5 1 chunk +38 lines, -0 lines 0 comments Download
A icu52/scripts/remove_unihan.sh View 1 2 3 4 5 1 chunk +14 lines, -0 lines 0 comments Download
M icu52/source/common/dictbe.cpp View 1 2 3 4 5 1 chunk +2 lines, -2 lines 0 comments Download
M icu52/source/common/locmap.c View 1 2 3 4 5 2 chunks +6 lines, -0 lines 0 comments Download
M icu52/source/common/putilimp.h View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/common/stringpiece.cpp View 1 2 3 4 5 1 chunk +8 lines, -0 lines 0 comments Download
M icu52/source/common/ucmndata.h View 1 2 3 4 5 1 chunk +14 lines, -0 lines 0 comments Download
M icu52/source/common/ucnv2022.cpp View 1 2 3 4 5 6 chunks +27 lines, -4 lines 0 comments Download
M icu52/source/common/ucnv_bld.cpp View 1 2 3 4 5 1 chunk +10 lines, -1 line 0 comments Download
M icu52/source/common/ucnv_cnv.h View 1 2 3 4 5 1 chunk +5 lines, -0 lines 0 comments Download
M icu52/source/common/ucnv_u7.c View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/common/ucnv_u8.c View 1 2 3 4 5 5 chunks +14 lines, -5 lines 0 comments Download
M icu52/source/common/ucnvbocu.cpp View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/common/ucnvisci.c View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/common/ucnvscsu.c View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/common/udata.cpp View 1 2 3 4 5 3 chunks +3 lines, -3 lines 0 comments Download
M icu52/source/common/unicode/uconfig.h View 1 2 3 4 5 1 chunk +8 lines, -0 lines 0 comments Download
M icu52/source/common/unicode/urename.h View 1 2 3 4 5 2 chunks +8 lines, -0 lines 0 comments Download
M icu52/source/common/unicode/utypes.h View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/configure View 1 2 3 4 5 1 chunk +1 line, -1 line 0 comments Download
M icu52/source/data/Makefile.in View 1 2 3 4 5 3 chunks +12 lines, -3 lines 0 comments Download
A + icu52/source/data/brkitr/brklocal.mk View 1 2 3 4 5 2 chunks +7 lines, -9 lines 0 comments Download
M icu52/source/data/brkitr/line.txt View 1 2 3 4 5 3 chunks +13 lines, -10 lines 0 comments Download
M icu52/source/data/brkitr/word.txt View 1 2 3 4 5 1 chunk +9 lines, -3 lines 0 comments Download
A + icu52/source/data/coll/collocal.mk View 1 2 3 4 5 2 chunks +6 lines, -15 lines 0 comments Download
M icu52/source/data/coll/ja.txt View 1 2 3 4 5 1 chunk +0 lines, -5711 lines 0 comments Download
M icu52/source/data/coll/ko.txt View 1 2 3 4 5 1 chunk +0 lines, -5273 lines 0 comments Download
M icu52/source/data/coll/zh.txt View 1 2 3 4 5 1 chunk +0 lines, -5282 lines 0 comments Download
A + icu52/source/data/curr/reslocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/lang/reslocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A icu52/source/data/lang/tg.txt View 1 2 3 4 5 1 chunk +18 lines, -0 lines 0 comments Download
A + icu52/source/data/locales/reslocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A icu52/source/data/locales/tg.txt View 1 2 3 4 5 1 chunk +17 lines, -0 lines 0 comments Download
M icu52/source/data/mappings/convrtrs.txt View 1 2 3 4 5 11 chunks +80 lines, -646 lines 0 comments Download
A icu52/source/data/mappings/euc-jp-html5.ucm View 1 2 3 4 5 1 chunk +13585 lines, -0 lines 0 comments Download
A icu52/source/data/mappings/ibm-866_html5-2012.ucm View 1 2 3 4 5 1 chunk +274 lines, -0 lines 0 comments Download
A icu52/source/data/mappings/iso-8859_16-2001.ucm View 1 2 3 4 5 1 chunk +301 lines, -0 lines 0 comments Download
A + icu52/source/data/mappings/noop-cns-11643.ucm View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/mappings/noop-gb2312_gl.ucm View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/mappings/noop-iso-ir-165.ucm View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/mappings/ucmlocal.mk View 1 2 3 4 5 2 chunks +5 lines, -8 lines 0 comments Download
M icu52/source/data/mappings/windows-936-2000.ucm View 1 2 3 4 5 5 chunks +6 lines, -1 line 0 comments Download
M icu52/source/data/mappings/windows-949-2000.ucm View 1 2 3 4 5 1 chunk +7 lines, -8 lines 0 comments Download
M icu52/source/data/mappings/windows-950-2000.ucm View 1 2 3 4 5 1 chunk +5 lines, -8 lines 0 comments Download
M icu52/source/data/misc/metaZones.txt View 1 2 3 4 5 15 chunks +39 lines, -24 lines 0 comments Download
M icu52/source/data/misc/timezoneTypes.txt View 1 2 3 4 5 5 chunks +5 lines, -4 lines 0 comments Download
M icu52/source/data/misc/windowsZones.txt View 1 2 3 4 5 11 chunks +12 lines, -17 lines 0 comments Download
M icu52/source/data/misc/zoneinfo64.txt View 1 2 3 4 5 108 chunks +1042 lines, -1052 lines 0 comments Download
A + icu52/source/data/rbnf/rbnflocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/region/reslocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/sprep/spreplocal.mk View 1 2 3 4 5 2 chunks +8 lines, -5 lines 0 comments Download
A + icu52/source/data/translit/css3transform.txt View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
A + icu52/source/data/zone/reslocal.mk View 1 2 3 4 5 0 chunks +-1 lines, --1 lines 0 comments Download
M icu52/source/stubdata/stubdata.c View 1 2 3 4 5 1 chunk +20 lines, -31 lines 0 comments Download
M icu52/source/tools/toolutil/pkg_genc.c View 1 2 3 4 5 3 chunks +10 lines, -2 lines 0 comments Download

Messages

Total messages: 9 (0 generated)
jungshik at Google
Mark, sorry to bother you with this gigantic CL. It's hard to say what to ...
6 years, 8 months ago (2014-04-04 09:29:52 UTC) #1
jungshik at Google
On 2014/04/04 09:29:52, Jungshik Shin wrote: > Mark, sorry to bother you with this gigantic ...
6 years, 8 months ago (2014-04-04 18:04:03 UTC) #2
jsbell
crbug.com/277023 is about CP866, it could be added to the BUG= line. (The CL description/changes ...
6 years, 8 months ago (2014-04-04 18:16:19 UTC) #3
Mark Mentovai
I only looked closely at the scripts and README.chromium. I spot-checked parts of the rest. ...
6 years, 8 months ago (2014-04-04 19:22:51 UTC) #4
jungshik at Google
Thanks a lot for the review. Updated the CL. https://codereview.chromium.org/224943002/diff/30001/icu52/README.chromium File icu52/README.chromium (right): https://codereview.chromium.org/224943002/diff/30001/icu52/README.chromium#newcode44 icu52/README.chromium:44: ...
6 years, 8 months ago (2014-04-04 22:20:26 UTC) #5
jungshik at Google
Can you take another look? Thanks
6 years, 8 months ago (2014-04-07 17:17:43 UTC) #6
jsbell
quick skim lgtm (In the .sh files, the function look like they should be indented ...
6 years, 8 months ago (2014-04-07 17:45:52 UTC) #7
Mark Mentovai
I agree about the indentation. LGTM otherwise.
6 years, 8 months ago (2014-04-07 17:51:22 UTC) #8
jungshik at Google
6 years, 8 months ago (2014-04-07 20:30:20 UTC) #9
Message was sent while issue was closed.
Committed patchset #6 manually as r262192 (presubmit successful).

Powered by Google App Engine
This is Rietveld 408576698