source/data/unidata/changes.txt - Issue 1864463002: Add Emoji properties

Side by Side Diff: source/data/unidata/changes.txt

Issue 1864463002: Add Emoji properties (Closed) Base URL: https://chromium.googlesource.com/chromium/deps/icu.git@master

Patch Set: Created 4 years, 8 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch

OLD	NEW
1 * Copyright (C) 2004-2015, International Business Machines	1 * Copyright (C) 2004-2016, International Business Machines

2 * Corporation and others. All Rights Reserved.	2 * Corporation and others. All Rights Reserved.

3 *	3 *

4 * file name: changes.txt	4 * file name: changes.txt

5 * encoding: US-ASCII	5 * encoding: US-ASCII

6 * tab size: 8 (not used)	6 * tab size: 8 (not used)

7 * indentation:4	7 * indentation:4

8 *	8 *

9 * created on: 2004may06	9 * created on: 2004may06

10 * created by: Markus W. Scherer	10 * created by: Markus W. Scherer

11 *	11 *

(...skipping 26 matching lines...) Expand all Loading...
38 Aran can be added as USCRIPT_ARABIC_NASTALIQ at any time.	38 Aran can be added as USCRIPT_ARABIC_NASTALIQ at any time.

39	39

40 Adlam, Marchen, and Osage are expected to go into Unicode 9;	40 Adlam, Marchen, and Osage are expected to go into Unicode 9;

41 we should assign Unicode script property value aliases for them	41 we should assign Unicode script property value aliases for them

42 soon after Unicode 8 is released, and add them in ICU 56.	42 soon after Unicode 8 is released, and add them in ICU 56.

43	43

44 Khitan scripts will be encoded later.	44 Khitan scripts will be encoded later.

45	45

46 ---------------------------------------------------------------------------- ***	46 ---------------------------------------------------------------------------- ***

47	47

	48 Emoji properties added in ICU 57: http://bugs.icu-project.org/trac/ticket/11802

	49

	50 Edit preparseucd.py to add & parse new properties.

	51 They share the UCD property namespace but are not listed in PropertyAliases.txt.

	52

	53 Add emoji-data.txt to the input files, from http://www.unicode.org/Public/emoji/

	54 Initial data from emoji/2.0/

	55

	56 ICU_ROOT=~/svn.icu/trunk

	57 ICU_SRC_DIR=$ICU_ROOT/src

	58 ICUDT=icudt56b

	59 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib

	60 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in

	61 UNIDATA=$ICU_SRC_DIR/source/data/unidata

	62

	63 Add binary-property constants to uchar.h enum UProperty & UProperty.java.

	64

	65 ~/svn.icutools/trunk/src/unicode$ py/preparseucd.py ~/unidata/uni80/20151217 $IC U_SRC_DIR ~/svn.icutools/trunk/src

	66 (Needs to be run after uchar.h additions, so that the new properties can be pick ed up by genprops.)

	67

	68 Data structure: uprops.h/.cpp, corepropsbuilder.cpp, UCharacterProperty.java

	69

	70 make install, then icutools cmake & make, then

	71 ~/svn.icutools/trunk/dbg/unicode/c$ make && genprops/genprops $ICU_SRC_DIR

	72

	73 Generate Java data as usual, only update pnames.icu & uprops.icu.

	74

	75 ---------------------------------------------------------------------------- ***

	76

48 Unicode 8.0 update for ICU 56	77 Unicode 8.0 update for ICU 56

49	78

50 * Command-line environment setup	79 * Command-line environment setup

51	80

52 ICU_ROOT=~/svn.icu/trunk	81 ICU_ROOT=~/svn.icu/trunk

53 ICU_SRC_DIR=$ICU_ROOT/src	82 ICU_SRC_DIR=$ICU_ROOT/src

54 ICUDT=icudt56b	83 ICUDT=icudt56b

55 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib	84 export LD_LIBRARY_PATH=$ICU_ROOT/dbg/lib

56 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in	85 SRC_DATA_IN=$ICU_SRC_DIR/source/data/in

57 UNIDATA=$ICU_SRC_DIR/source/data/unidata	86 UNIDATA=$ICU_SRC_DIR/source/data/unidata

(...skipping 2327 matching lines...) Expand 10 before \| Expand all \| Expand 10 after Loading...
2385	2414

2386 * name matching	2415 * name matching

2387 - read UCD.html	2416 - read UCD.html

2388	2417

2389 * scripts	2418 * scripts

2390 - use new Hrkt=Katakana_Or_Hiragana	2419 - use new Hrkt=Katakana_Or_Hiragana

2391	2420

2392 * ZWJ & ZWNJ	2421 * ZWJ & ZWNJ

2393 - are now part of combining character sequences	2422 - are now part of combining character sequences

2394 - break iteration used to assume that LB classes did not overlap; now they do fo r ZWJ & ZWNJ	2423 - break iteration used to assume that LB classes did not overlap; now they do fo r ZWJ & ZWNJ

OLD	NEW

« no previous file with comments | « source/data/in/uprops.icu ('k') | source/data/unidata/norm2/nfc.txt » ('j') | no next file with comments »