Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(153)

Side by Side Diff: source/data/unidata/changes.txt

Issue 1864463002: Add Emoji properties (Closed) Base URL: https://chromium.googlesource.com/chromium/deps/icu.git@master
Patch Set: Created 4 years, 8 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View unified diff | Download patch
« no previous file with comments | « source/data/in/uprops.icu ('k') | source/data/unidata/norm2/nfc.txt » ('j') | no next file with comments »
Toggle Intra-line Diffs ('i') | Expand Comments ('e') | Collapse Comments ('c') | Show Comments Hide Comments ('s')
OLDNEW
1 * Copyright (C) 2004-2015, International Business Machines 1 * Copyright (C) 2004-2016, International Business Machines
2 * Corporation and others. All Rights Reserved. 2 * Corporation and others. All Rights Reserved.
3 * 3 *
4 * file name: changes.txt 4 * file name: changes.txt
5 * encoding: US-ASCII 5 * encoding: US-ASCII
6 * tab size: 8 (not used) 6 * tab size: 8 (not used)
7 * indentation:4 7 * indentation:4
8 * 8 *
9 * created on: 2004may06 9 * created on: 2004may06
10 * created by: Markus W. Scherer 10 * created by: Markus W. Scherer
11 * 11 *
(...skipping 26 matching lines...) Expand all
38 Aran can be added as USCRIPT_ARABIC_NASTALIQ at any time. 38 Aran can be added as USCRIPT_ARABIC_NASTALIQ at any time.
39 39
40 Adlam, Marchen, and Osage are expected to go into Unicode 9; 40 Adlam, Marchen, and Osage are expected to go into Unicode 9;
41 we should assign Unicode script property value aliases for them 41 we should assign Unicode script property value aliases for them
42 soon after Unicode 8 is released, and add them in ICU 56. 42 soon after Unicode 8 is released, and add them in ICU 56.
43 43
44 Khitan scripts will be encoded later. 44 Khitan scripts will be encoded later.
45 45
46 ---------------------------------------------------------------------------- *** 46 ---------------------------------------------------------------------------- ***
47 47
48 Emoji properties added in ICU 57: http://bugs.icu-project.org/trac/ticket/11802
49
50 Edit preparseucd.py to add & parse new properties.
51 They share the UCD property namespace but are not listed in PropertyAliases.txt.
52
53 Add emoji-data.txt to the input files, from http://www.unicode.org/Public/emoji/
54 Initial data from emoji/2.0/
55
56 ICU_ROOT=~/svn.icu/trunk
57 ICU_SRC_DIR=$ICU_ROOT/src
58 ICUDT=icudt56b
59 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib
60 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in
61 UNIDATA=$ICU_SRC_DIR/source/data/unidata
62
63 Add binary-property constants to uchar.h enum UProperty & UProperty.java.
64
65 ~/svn.icutools/trunk/src/unicode$ py/preparseucd.py ~/unidata/uni80/20151217 $IC U_SRC_DIR ~/svn.icutools/trunk/src
66 (Needs to be run after uchar.h additions, so that the new properties can be pick ed up by genprops.)
67
68 Data structure: uprops.h/.cpp, corepropsbuilder.cpp, UCharacterProperty.java
69
70 make install, then icutools cmake & make, then
71 ~/svn.icutools/trunk/dbg/unicode/c$ make && genprops/genprops $ICU_SRC_DIR
72
73 Generate Java data as usual, only update pnames.icu & uprops.icu.
74
75 ---------------------------------------------------------------------------- ***
76
48 Unicode 8.0 update for ICU 56 77 Unicode 8.0 update for ICU 56
49 78
50 * Command-line environment setup 79 * Command-line environment setup
51 80
52 ICU_ROOT=~/svn.icu/trunk 81 ICU_ROOT=~/svn.icu/trunk
53 ICU_SRC_DIR=$ICU_ROOT/src 82 ICU_SRC_DIR=$ICU_ROOT/src
54 ICUDT=icudt56b 83 ICUDT=icudt56b
55 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib 84 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib
56 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in 85 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in
57 UNIDATA=$ICU_SRC_DIR/source/data/unidata 86 UNIDATA=$ICU_SRC_DIR/source/data/unidata
(...skipping 2327 matching lines...) Expand 10 before | Expand all | Expand 10 after
2385 2414
2386 * name matching 2415 * name matching
2387 - read UCD.html 2416 - read UCD.html
2388 2417
2389 * scripts 2418 * scripts
2390 - use new Hrkt=Katakana_Or_Hiragana 2419 - use new Hrkt=Katakana_Or_Hiragana
2391 2420
2392 * ZWJ & ZWNJ 2421 * ZWJ & ZWNJ
2393 - are now part of combining character sequences 2422 - are now part of combining character sequences
2394 - break iteration used to assume that LB classes did not overlap; now they do fo r ZWJ & ZWNJ 2423 - break iteration used to assume that LB classes did not overlap; now they do fo r ZWJ & ZWNJ
OLDNEW
« no previous file with comments | « source/data/in/uprops.icu ('k') | source/data/unidata/norm2/nfc.txt » ('j') | no next file with comments »

Powered by Google App Engine
This is Rietveld 408576698