Re: [hackers] [libgrapheme] Update to Unicode 15.0.0 || Laslo Hunhold

From: Hiltjo Posthuma <hiltjo_AT_codemadness.org>
Date: Thu, 15 Sep 2022 09:44:11 +0200

Finally support for the duck emoji!

https://blog.unicode.org/2022/09/announcing-unicode-standard-version-150.html
https://www.unicode.org/announcements/u15-emoji-annc-large.png

On Thu, Sep 15, 2022 at 01:09:50AM +0200, git_AT_suckless.org wrote:
> commit fad432f65f9011175f4fe24d4045ba0d42bdc55e
> Author: Laslo Hunhold <dev_AT_frign.de>
> AuthorDate: Thu Sep 15 01:07:39 2022 +0200
> Commit: Laslo Hunhold <dev_AT_frign.de>
> CommitDate: Thu Sep 15 01:07:39 2022 +0200
>
> Update to Unicode 15.0.0
>
> This was the easiest thing I did all day. Change on variable and
> the rest was automatic. All tests are passing.
>
> I didn't have in mind that a new version was pending a release, but
> it arrived so fittingly just a few days after I introduced the
> automatically generated manuals.
>
> This makes libgrapheme probably one of the first libraries to implement
> Unicode 15.0.0.
>
> Signed-off-by: Laslo Hunhold <dev_AT_frign.de>
>
> diff --git a/Makefile b/Makefile
> index 653175f..eacd8c3 100644
> --- a/Makefile
> +++ b/Makefile
> _AT_@ -4,7 +4,7 @@
>
> VERSION = 1
> MAN_DATE = 2022-09-07
> -UNICODE_VERSION = 14.0.0
> +UNICODE_VERSION = 15.0.0
>
> include config.mk
>
> diff --git a/data/DerivedCoreProperties.txt b/data/DerivedCoreProperties.txt
> index afc2abd..8b482b5 100644
> --- a/data/DerivedCoreProperties.txt
> +++ b/data/DerivedCoreProperties.txt
> _AT_@ -1,11 +1,11 @@
> -# DerivedCoreProperties-14.0.0.txt
> -# Date: 2021-08-12, 23:12:53 GMT
> -# © 2021 Unicode®, Inc.
> +# DerivedCoreProperties-15.0.0.txt
> +# Date: 2022-08-05, 22:17:05 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
>
> # ================================================
>
> _AT_@ -462,6 +462,7 @@ FFE9..FFEC ; Math # Sm [4] HALFWIDTH LEFTWARDS ARROW..HALFWIDTH DOWNWARDS A
> 0BD7 ; Alphabetic # Mc TAMIL AU LENGTH MARK
> 0C00 ; Alphabetic # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE
> 0C01..0C03 ; Alphabetic # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
> +0C04 ; Alphabetic # Mn TELUGU SIGN COMBINING ANUSVARA ABOVE
> 0C05..0C0C ; Alphabetic # Lo [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
> 0C0E..0C10 ; Alphabetic # Lo [3] TELUGU LETTER E..TELUGU LETTER AI
> 0C12..0C28 ; Alphabetic # Lo [23] TELUGU LETTER O..TELUGU LETTER NA
> _AT_@ -497,6 +498,7 @@ FFE9..FFEC ; Math # Sm [4] HALFWIDTH LEFTWARDS ARROW..HALFWIDTH DOWNWARDS A
> 0CE0..0CE1 ; Alphabetic # Lo [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
> 0CE2..0CE3 ; Alphabetic # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> 0CF1..0CF2 ; Alphabetic # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3 ; Alphabetic # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01 ; Alphabetic # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03 ; Alphabetic # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C ; Alphabetic # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> _AT_@ -552,7 +554,7 @@ FFE9..FFEC ; Math # Sm [4] HALFWIDTH LEFTWARDS ARROW..HALFWIDTH DOWNWARDS A
> 0F49..0F6C ; Alphabetic # Lo [36] TIBETAN LETTER NYA..TIBETAN LETTER RRA
> 0F71..0F7E ; Alphabetic # Mn [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
> 0F7F ; Alphabetic # Mc TIBETAN SIGN RNAM BCAD
> -0F80..0F81 ; Alphabetic # Mn [2] TIBETAN VOWEL SIGN REVERSED I..TIBETAN VOWEL SIGN REVERSED II
> +0F80..0F83 ; Alphabetic # Mn [4] TIBETAN VOWEL SIGN REVERSED I..TIBETAN SIGN SNA LDAN
> 0F88..0F8C ; Alphabetic # Lo [5] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN INVERTED MCHU CAN
> 0F8D..0F97 ; Alphabetic # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
> 0F99..0FBC ; Alphabetic # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
> _AT_@ -1053,6 +1055,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 11071..11072 ; Alphabetic # Lo [2] BRAHMI LETTER OLD TAMIL SHORT E..BRAHMI LETTER OLD TAMIL SHORT O
> 11073..11074 ; Alphabetic # Mn [2] BRAHMI VOWEL SIGN OLD TAMIL SHORT E..BRAHMI VOWEL SIGN OLD TAMIL SHORT O
> 11075 ; Alphabetic # Lo BRAHMI LETTER OLD TAMIL LLA
> +11080..11081 ; Alphabetic # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
> 11082 ; Alphabetic # Mc KAITHI SIGN VISARGA
> 11083..110AF ; Alphabetic # Lo [45] KAITHI LETTER A..KAITHI LETTER HA
> 110B0..110B2 ; Alphabetic # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
> _AT_@ -1089,6 +1092,8 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 11234 ; Alphabetic # Mn KHOJKI SIGN ANUSVARA
> 11237 ; Alphabetic # Mn KHOJKI SIGN SHADDA
> 1123E ; Alphabetic # Mn KHOJKI SIGN SUKUN
> +1123F..11240 ; Alphabetic # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> +11241 ; Alphabetic # Mn KHOJKI VOWEL SIGN VOCALIC R
> 11280..11286 ; Alphabetic # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; Alphabetic # Lo MULTANI LETTER GHA
> 1128A..1128D ; Alphabetic # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -1243,12 +1248,22 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 11EE0..11EF2 ; Alphabetic # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> 11EF3..11EF4 ; Alphabetic # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6 ; Alphabetic # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F00..11F01 ; Alphabetic # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F02 ; Alphabetic # Lo KAWI SIGN REPHA
> +11F03 ; Alphabetic # Mc KAWI SIGN VISARGA
> +11F04..11F10 ; Alphabetic # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; Alphabetic # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35 ; Alphabetic # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A ; Alphabetic # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F ; Alphabetic # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40 ; Alphabetic # Mn KAWI VOWEL SIGN EU
> 11FB0 ; Alphabetic # Lo LISU LETTER YHA
> 12000..12399 ; Alphabetic # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; Alphabetic # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; Alphabetic # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; Alphabetic # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; Alphabetic # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; Alphabetic # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; Alphabetic # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; Alphabetic # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; Alphabetic # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; Alphabetic # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -1275,7 +1290,9 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 1AFF5..1AFFB ; Alphabetic # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; Alphabetic # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; Alphabetic # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; Alphabetic # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; Alphabetic # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; Alphabetic # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; Alphabetic # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; Alphabetic # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; Alphabetic # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -1316,16 +1333,21 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 1DF00..1DF09 ; Alphabetic # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; Alphabetic # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; Alphabetic # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; Alphabetic # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> 1E000..1E006 ; Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> 1E008..1E018 ; Alphabetic # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> 1E01B..1E021 ; Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Alphabetic # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Alphabetic # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D ; Alphabetic # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F ; Alphabetic # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E100..1E12C ; Alphabetic # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E137..1E13D ; Alphabetic # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E14E ; Alphabetic # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
> 1E290..1E2AD ; Alphabetic # Lo [30] TOTO LETTER PA..TOTO LETTER A
> 1E2C0..1E2EB ; Alphabetic # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> +1E4D0..1E4EA ; Alphabetic # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; Alphabetic # Lm NAG MUNDARI SIGN OJOD
> 1E7E0..1E7E6 ; Alphabetic # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; Alphabetic # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; Alphabetic # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -1371,14 +1393,15 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 1F150..1F169 ; Alphabetic # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
> 1F170..1F189 ; Alphabetic # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
> 20000..2A6DF ; Alphabetic # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; Alphabetic # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; Alphabetic # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; Alphabetic # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; Alphabetic # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; Alphabetic # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; Alphabetic # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; Alphabetic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; Alphabetic # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
>
> -# Total code points: 133396
> +# Total code points: 137765
>
> # ================================================
>
> _AT_@ -1663,6 +1686,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
> 052F ; Lowercase # L& CYRILLIC SMALL LETTER EL WITH DESCENDER
> 0560..0588 ; Lowercase # L& [41] ARMENIAN SMALL LETTER TURNED AYB..ARMENIAN SMALL LETTER YI WITH STROKE
> 10D0..10FA ; Lowercase # L& [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
> +10FC ; Lowercase # Lm MODIFIER LETTER GEORGIAN NAR
> 10FD..10FF ; Lowercase # L& [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
> 13F8..13FD ; Lowercase # L& [6] CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETTER MV
> 1C80..1C88 ; Lowercase # L& [9] CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SMALL LETTER UNBLENDED UK
> _AT_@ -2012,12 +2036,14 @@ A7D3 ; Lowercase # L& LATIN SMALL LETTER DOUBLE THORN
> A7D5 ; Lowercase # L& LATIN SMALL LETTER DOUBLE WYNN
> A7D7 ; Lowercase # L& LATIN SMALL LETTER MIDDLE SCOTS S
> A7D9 ; Lowercase # L& LATIN SMALL LETTER SIGMOID S
> +A7F2..A7F4 ; Lowercase # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
> A7F6 ; Lowercase # L& LATIN SMALL LETTER REVERSED HALF H
> A7F8..A7F9 ; Lowercase # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
> A7FA ; Lowercase # L& LATIN LETTER SMALL CAPITAL TURNED M
> AB30..AB5A ; Lowercase # L& [43] LATIN SMALL LETTER BARRED ALPHA..LATIN SMALL LETTER Y WITH SHORT RIGHT LEG
> AB5C..AB5F ; Lowercase # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
> AB60..AB68 ; Lowercase # L& [9] LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LETTER TURNED R WITH MIDDLE TILDE
> +AB69 ; Lowercase # Lm MODIFIER LETTER SMALL TURNED W
> AB70..ABBF ; Lowercase # L& [80] CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTER YA
> FB00..FB06 ; Lowercase # L& [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
> FB13..FB17 ; Lowercase # L& [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
> _AT_@ -2065,9 +2091,11 @@ FF41..FF5A ; Lowercase # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH L
> 1D7CB ; Lowercase # L& MATHEMATICAL BOLD SMALL DIGAMMA
> 1DF00..1DF09 ; Lowercase # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0B..1DF1E ; Lowercase # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; Lowercase # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; Lowercase # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E922..1E943 ; Lowercase # L& [34] ADLAM SMALL LETTER ALIF..ADLAM SMALL LETTER SHA
>
> -# Total code points: 2471
> +# Total code points: 2544
>
> # ================================================
>
> _AT_@ -2767,6 +2795,7 @@ FF21..FF3A ; Uppercase # L& [26] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH
> 10C7 ; Cased # L& GEORGIAN CAPITAL LETTER YN
> 10CD ; Cased # L& GEORGIAN CAPITAL LETTER AEN
> 10D0..10FA ; Cased # L& [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
> +10FC ; Cased # Lm MODIFIER LETTER GEORGIAN NAR
> 10FD..10FF ; Cased # L& [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
> 13A0..13F5 ; Cased # L& [86] CHEROKEE LETTER A..CHEROKEE LETTER MV
> 13F8..13FD ; Cased # L& [6] CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETTER MV
> _AT_@ -2837,12 +2866,14 @@ A790..A7CA ; Cased # L& [59] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SM
> A7D0..A7D1 ; Cased # L& [2] LATIN CAPITAL LETTER CLOSED INSULAR G..LATIN SMALL LETTER CLOSED INSULAR G
> A7D3 ; Cased # L& LATIN SMALL LETTER DOUBLE THORN
> A7D5..A7D9 ; Cased # L& [5] LATIN SMALL LETTER DOUBLE WYNN..LATIN SMALL LETTER SIGMOID S
> +A7F2..A7F4 ; Cased # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
> A7F5..A7F6 ; Cased # L& [2] LATIN CAPITAL LETTER REVERSED HALF H..LATIN SMALL LETTER REVERSED HALF H
> A7F8..A7F9 ; Cased # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
> A7FA ; Cased # L& LATIN LETTER SMALL CAPITAL TURNED M
> AB30..AB5A ; Cased # L& [43] LATIN SMALL LETTER BARRED ALPHA..LATIN SMALL LETTER Y WITH SHORT RIGHT LEG
> AB5C..AB5F ; Cased # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
> AB60..AB68 ; Cased # L& [9] LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LETTER TURNED R WITH MIDDLE TILDE
> +AB69 ; Cased # Lm MODIFIER LETTER SMALL TURNED W
> AB70..ABBF ; Cased # L& [80] CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTER YA
> FB00..FB06 ; Cased # L& [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
> FB13..FB17 ; Cased # L& [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
> _AT_@ -2899,12 +2930,14 @@ FF41..FF5A ; Cased # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
> 1D7C4..1D7CB ; Cased # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
> 1DF00..1DF09 ; Cased # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0B..1DF1E ; Cased # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; Cased # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; Cased # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E900..1E943 ; Cased # L& [68] ADLAM CAPITAL LETTER ALIF..ADLAM SMALL LETTER SHA
> 1F130..1F149 ; Cased # So [26] SQUARED LATIN CAPITAL LETTER A..SQUARED LATIN CAPITAL LETTER Z
> 1F150..1F169 ; Cased # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
> 1F170..1F189 ; Cased # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
>
> -# Total code points: 4453
> +# Total code points: 4526
>
> # ================================================
>
> _AT_@ -3054,7 +3087,7 @@ FF41..FF5A ; Cased # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
> 0EB1 ; Case_Ignorable # Mn LAO VOWEL SIGN MAI KAN
> 0EB4..0EBC ; Case_Ignorable # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
> 0EC6 ; Case_Ignorable # Lm LAO KO LA
> -0EC8..0ECD ; Case_Ignorable # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; Case_Ignorable # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0F18..0F19 ; Case_Ignorable # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> 0F35 ; Case_Ignorable # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> 0F37 ; Case_Ignorable # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> _AT_@ -3263,6 +3296,7 @@ FFF9..FFFB ; Case_Ignorable # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLI
> 10AE5..10AE6 ; Case_Ignorable # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> 10D24..10D27 ; Case_Ignorable # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> 10EAB..10EAC ; Case_Ignorable # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> +10EFD..10EFF ; Case_Ignorable # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F46..10F50 ; Case_Ignorable # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> 10F82..10F85 ; Case_Ignorable # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> 11001 ; Case_Ignorable # Mn BRAHMI SIGN ANUSVARA
> _AT_@ -3287,6 +3321,7 @@ FFF9..FFFB ; Case_Ignorable # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLI
> 11234 ; Case_Ignorable # Mn KHOJKI SIGN ANUSVARA
> 11236..11237 ; Case_Ignorable # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; Case_Ignorable # Mn KHOJKI SIGN SUKUN
> +11241 ; Case_Ignorable # Mn KHOJKI VOWEL SIGN VOCALIC R
> 112DF ; Case_Ignorable # Mn KHUDAWADI SIGN ANUSVARA
> 112E3..112EA ; Case_Ignorable # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
> 11300..11301 ; Case_Ignorable # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
> _AT_@ -3348,7 +3383,13 @@ FFF9..FFFB ; Case_Ignorable # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLI
> 11D95 ; Case_Ignorable # Mn GUNJALA GONDI SIGN ANUSVARA
> 11D97 ; Case_Ignorable # Mn GUNJALA GONDI VIRAMA
> 11EF3..11EF4 ; Case_Ignorable # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> -13430..13438 ; Case_Ignorable # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
> +11F00..11F01 ; Case_Ignorable # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F36..11F3A ; Case_Ignorable # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F40 ; Case_Ignorable # Mn KAWI VOWEL SIGN EU
> +11F42 ; Case_Ignorable # Mn KAWI CONJOINER
> +13430..1343F ; Case_Ignorable # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> +13440 ; Case_Ignorable # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13447..13455 ; Case_Ignorable # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 16AF0..16AF4 ; Case_Ignorable # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> 16B30..16B36 ; Case_Ignorable # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> 16B40..16B43 ; Case_Ignorable # Lm [4] PAHAWH HMONG SIGN VOS SEEV..PAHAWH HMONG SIGN IB YAM
> _AT_@ -3382,10 +3423,14 @@ FFF9..FFFB ; Case_Ignorable # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLI
> 1E01B..1E021 ; Case_Ignorable # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Case_Ignorable # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Case_Ignorable # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D ; Case_Ignorable # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F ; Case_Ignorable # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E130..1E136 ; Case_Ignorable # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E137..1E13D ; Case_Ignorable # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E2AE ; Case_Ignorable # Mn TOTO SIGN RISING TONE
> 1E2EC..1E2EF ; Case_Ignorable # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> +1E4EB ; Case_Ignorable # Lm NAG MUNDARI SIGN OJOD
> +1E4EC..1E4EF ; Case_Ignorable # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> 1E8D0..1E8D6 ; Case_Ignorable # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> 1E944..1E94A ; Case_Ignorable # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
> 1E94B ; Case_Ignorable # Lm ADLAM NASALIZATION MARK
> _AT_@ -3394,7 +3439,7 @@ E0001 ; Case_Ignorable # Cf LANGUAGE TAG
> E0020..E007F ; Case_Ignorable # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF ; Case_Ignorable # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 2602
> +# Total code points: 2707
>
> # ================================================
>
> _AT_@ -6617,6 +6662,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 111DC ; ID_Start # Lo SHARADA HEADSTROKE
> 11200..11211 ; ID_Start # Lo [18] KHOJKI LETTER A..KHOJKI LETTER JJA
> 11213..1122B ; ID_Start # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA
> +1123F..11240 ; ID_Start # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> 11280..11286 ; ID_Start # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; ID_Start # Lo MULTANI LETTER GHA
> 1128A..1128D ; ID_Start # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -6679,12 +6725,16 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 11D6A..11D89 ; ID_Start # Lo [32] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER SA
> 11D98 ; ID_Start # Lo GUNJALA GONDI OM
> 11EE0..11EF2 ; ID_Start # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> +11F02 ; ID_Start # Lo KAWI SIGN REPHA
> +11F04..11F10 ; ID_Start # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; ID_Start # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> 11FB0 ; ID_Start # Lo LISU LETTER YHA
> 12000..12399 ; ID_Start # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; ID_Start # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; ID_Start # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; ID_Start # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; ID_Start # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; ID_Start # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; ID_Start # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; ID_Start # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; ID_Start # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; ID_Start # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -6707,7 +6757,9 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1AFF5..1AFFB ; ID_Start # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; ID_Start # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; ID_Start # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; ID_Start # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; ID_Start # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; ID_Start # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; ID_Start # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; ID_Start # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; ID_Start # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -6747,11 +6799,15 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1DF00..1DF09 ; ID_Start # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; ID_Start # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; ID_Start # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; ID_Start # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; ID_Start # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E100..1E12C ; ID_Start # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E137..1E13D ; ID_Start # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E14E ; ID_Start # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
> 1E290..1E2AD ; ID_Start # Lo [30] TOTO LETTER PA..TOTO LETTER A
> 1E2C0..1E2EB ; ID_Start # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> +1E4D0..1E4EA ; ID_Start # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; ID_Start # Lm NAG MUNDARI SIGN OJOD
> 1E7E0..1E7E6 ; ID_Start # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; ID_Start # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; ID_Start # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -6793,14 +6849,15 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1EEA5..1EEA9 ; ID_Start # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
> 1EEAB..1EEBB ; ID_Start # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> 20000..2A6DF ; ID_Start # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; ID_Start # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; ID_Start # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; ID_Start # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; ID_Start # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; ID_Start # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; ID_Start # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; ID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; ID_Start # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
>
> -# Total code points: 131997
> +# Total code points: 136345
>
> # ================================================
>
> _AT_@ -7083,6 +7140,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 0CE2..0CE3 ; ID_Continue # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> 0CE6..0CEF ; ID_Continue # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
> 0CF1..0CF2 ; ID_Continue # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3 ; ID_Continue # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01 ; ID_Continue # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03 ; ID_Continue # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C ; ID_Continue # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> _AT_@ -7136,7 +7194,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 0EBD ; ID_Continue # Lo LAO SEMIVOWEL SIGN NYO
> 0EC0..0EC4 ; ID_Continue # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
> 0EC6 ; ID_Continue # Lm LAO KO LA
> -0EC8..0ECD ; ID_Continue # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; ID_Continue # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0ED0..0ED9 ; ID_Continue # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
> 0EDC..0EDF ; ID_Continue # Lo [4] LAO HO NO..LAO LETTER KHMU NYO
> 0F00 ; ID_Continue # Lo TIBETAN SYLLABLE OM
> _AT_@ -7719,6 +7777,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 10E80..10EA9 ; ID_Continue # Lo [42] YEZIDI LETTER ELIF..YEZIDI LETTER ET
> 10EAB..10EAC ; ID_Continue # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> 10EB0..10EB1 ; ID_Continue # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
> +10EFD..10EFF ; ID_Continue # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F00..10F1C ; ID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
> 10F27 ; ID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
> 10F30..10F45 ; ID_Continue # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
> _AT_@ -7781,6 +7840,8 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 11235 ; ID_Continue # Mc KHOJKI SIGN VIRAMA
> 11236..11237 ; ID_Continue # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; ID_Continue # Mn KHOJKI SIGN SUKUN
> +1123F..11240 ; ID_Continue # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> +11241 ; ID_Continue # Mn KHOJKI VOWEL SIGN VOCALIC R
> 11280..11286 ; ID_Continue # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; ID_Continue # Lo MULTANI LETTER GHA
> 1128A..1128D ; ID_Continue # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -7963,12 +8024,27 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 11EE0..11EF2 ; ID_Continue # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> 11EF3..11EF4 ; ID_Continue # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6 ; ID_Continue # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F00..11F01 ; ID_Continue # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F02 ; ID_Continue # Lo KAWI SIGN REPHA
> +11F03 ; ID_Continue # Mc KAWI SIGN VISARGA
> +11F04..11F10 ; ID_Continue # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; ID_Continue # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35 ; ID_Continue # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A ; ID_Continue # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F ; ID_Continue # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40 ; ID_Continue # Mn KAWI VOWEL SIGN EU
> +11F41 ; ID_Continue # Mc KAWI SIGN KILLER
> +11F42 ; ID_Continue # Mn KAWI CONJOINER
> +11F50..11F59 ; ID_Continue # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 11FB0 ; ID_Continue # Lo LISU LETTER YHA
> 12000..12399 ; ID_Continue # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; ID_Continue # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; ID_Continue # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; ID_Continue # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; ID_Continue # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; ID_Continue # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13440 ; ID_Continue # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13441..13446 ; ID_Continue # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> +13447..13455 ; ID_Continue # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 14400..14646 ; ID_Continue # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; ID_Continue # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; ID_Continue # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -8001,7 +8077,9 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 1AFF5..1AFFB ; ID_Continue # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; ID_Continue # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; ID_Continue # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; ID_Continue # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; ID_Continue # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; ID_Continue # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; ID_Continue # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; ID_Continue # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; ID_Continue # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -8058,11 +8136,14 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 1DF00..1DF09 ; ID_Continue # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; ID_Continue # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; ID_Continue # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; ID_Continue # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> 1E000..1E006 ; ID_Continue # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> 1E008..1E018 ; ID_Continue # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> 1E01B..1E021 ; ID_Continue # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; ID_Continue # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; ID_Continue # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D ; ID_Continue # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F ; ID_Continue # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E100..1E12C ; ID_Continue # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E130..1E136 ; ID_Continue # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E137..1E13D ; ID_Continue # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> _AT_@ -8073,6 +8154,10 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 1E2C0..1E2EB ; ID_Continue # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> 1E2EC..1E2EF ; ID_Continue # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> 1E2F0..1E2F9 ; ID_Continue # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> +1E4D0..1E4EA ; ID_Continue # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; ID_Continue # Lm NAG MUNDARI SIGN OJOD
> +1E4EC..1E4EF ; ID_Continue # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> +1E4F0..1E4F9 ; ID_Continue # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E7E0..1E7E6 ; ID_Continue # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; ID_Continue # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; ID_Continue # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -8118,15 +8203,16 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
> 1EEAB..1EEBB ; ID_Continue # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> 1FBF0..1FBF9 ; ID_Continue # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
> 20000..2A6DF ; ID_Continue # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; ID_Continue # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; ID_Continue # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; ID_Continue # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; ID_Continue # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; ID_Continue # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; ID_Continue # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; ID_Continue # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; ID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 135072
> +# Total code points: 139482
>
> # ================================================
>
> _AT_@ -8685,6 +8771,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 111DC ; XID_Start # Lo SHARADA HEADSTROKE
> 11200..11211 ; XID_Start # Lo [18] KHOJKI LETTER A..KHOJKI LETTER JJA
> 11213..1122B ; XID_Start # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA
> +1123F..11240 ; XID_Start # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> 11280..11286 ; XID_Start # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; XID_Start # Lo MULTANI LETTER GHA
> 1128A..1128D ; XID_Start # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -8747,12 +8834,16 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 11D6A..11D89 ; XID_Start # Lo [32] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER SA
> 11D98 ; XID_Start # Lo GUNJALA GONDI OM
> 11EE0..11EF2 ; XID_Start # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> +11F02 ; XID_Start # Lo KAWI SIGN REPHA
> +11F04..11F10 ; XID_Start # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; XID_Start # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> 11FB0 ; XID_Start # Lo LISU LETTER YHA
> 12000..12399 ; XID_Start # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; XID_Start # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; XID_Start # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; XID_Start # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; XID_Start # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; XID_Start # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; XID_Start # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; XID_Start # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; XID_Start # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; XID_Start # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -8775,7 +8866,9 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 1AFF5..1AFFB ; XID_Start # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; XID_Start # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; XID_Start # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; XID_Start # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; XID_Start # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; XID_Start # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; XID_Start # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; XID_Start # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; XID_Start # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -8815,11 +8908,15 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 1DF00..1DF09 ; XID_Start # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; XID_Start # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; XID_Start # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; XID_Start # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; XID_Start # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E100..1E12C ; XID_Start # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E137..1E13D ; XID_Start # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E14E ; XID_Start # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
> 1E290..1E2AD ; XID_Start # Lo [30] TOTO LETTER PA..TOTO LETTER A
> 1E2C0..1E2EB ; XID_Start # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> +1E4D0..1E4EA ; XID_Start # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; XID_Start # Lm NAG MUNDARI SIGN OJOD
> 1E7E0..1E7E6 ; XID_Start # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; XID_Start # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; XID_Start # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -8861,14 +8958,15 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 1EEA5..1EEA9 ; XID_Start # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
> 1EEAB..1EEBB ; XID_Start # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> 20000..2A6DF ; XID_Start # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; XID_Start # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; XID_Start # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; XID_Start # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; XID_Start # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; XID_Start # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; XID_Start # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; XID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; XID_Start # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
>
> -# Total code points: 131974
> +# Total code points: 136322
>
> # ================================================
>
> _AT_@ -9147,6 +9245,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 0CE2..0CE3 ; XID_Continue # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> 0CE6..0CEF ; XID_Continue # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
> 0CF1..0CF2 ; XID_Continue # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3 ; XID_Continue # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01 ; XID_Continue # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03 ; XID_Continue # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C ; XID_Continue # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> _AT_@ -9200,7 +9299,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
> 0EBD ; XID_Continue # Lo LAO SEMIVOWEL SIGN NYO
> 0EC0..0EC4 ; XID_Continue # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
> 0EC6 ; XID_Continue # Lm LAO KO LA
> -0EC8..0ECD ; XID_Continue # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; XID_Continue # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0ED0..0ED9 ; XID_Continue # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
> 0EDC..0EDF ; XID_Continue # Lo [4] LAO HO NO..LAO LETTER KHMU NYO
> 0F00 ; XID_Continue # Lo TIBETAN SYLLABLE OM
> _AT_@ -9788,6 +9887,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 10E80..10EA9 ; XID_Continue # Lo [42] YEZIDI LETTER ELIF..YEZIDI LETTER ET
> 10EAB..10EAC ; XID_Continue # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> 10EB0..10EB1 ; XID_Continue # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
> +10EFD..10EFF ; XID_Continue # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F00..10F1C ; XID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
> 10F27 ; XID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
> 10F30..10F45 ; XID_Continue # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
> _AT_@ -9850,6 +9950,8 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 11235 ; XID_Continue # Mc KHOJKI SIGN VIRAMA
> 11236..11237 ; XID_Continue # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; XID_Continue # Mn KHOJKI SIGN SUKUN
> +1123F..11240 ; XID_Continue # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> +11241 ; XID_Continue # Mn KHOJKI VOWEL SIGN VOCALIC R
> 11280..11286 ; XID_Continue # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; XID_Continue # Lo MULTANI LETTER GHA
> 1128A..1128D ; XID_Continue # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -10032,12 +10134,27 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 11EE0..11EF2 ; XID_Continue # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> 11EF3..11EF4 ; XID_Continue # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6 ; XID_Continue # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F00..11F01 ; XID_Continue # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F02 ; XID_Continue # Lo KAWI SIGN REPHA
> +11F03 ; XID_Continue # Mc KAWI SIGN VISARGA
> +11F04..11F10 ; XID_Continue # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; XID_Continue # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35 ; XID_Continue # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A ; XID_Continue # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F ; XID_Continue # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40 ; XID_Continue # Mn KAWI VOWEL SIGN EU
> +11F41 ; XID_Continue # Mc KAWI SIGN KILLER
> +11F42 ; XID_Continue # Mn KAWI CONJOINER
> +11F50..11F59 ; XID_Continue # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 11FB0 ; XID_Continue # Lo LISU LETTER YHA
> 12000..12399 ; XID_Continue # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; XID_Continue # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; XID_Continue # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; XID_Continue # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; XID_Continue # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; XID_Continue # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13440 ; XID_Continue # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13441..13446 ; XID_Continue # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> +13447..13455 ; XID_Continue # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 14400..14646 ; XID_Continue # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; XID_Continue # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; XID_Continue # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -10070,7 +10187,9 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 1AFF5..1AFFB ; XID_Continue # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; XID_Continue # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; XID_Continue # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; XID_Continue # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; XID_Continue # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; XID_Continue # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; XID_Continue # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; XID_Continue # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; XID_Continue # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -10127,11 +10246,14 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 1DF00..1DF09 ; XID_Continue # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; XID_Continue # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; XID_Continue # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; XID_Continue # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> 1E000..1E006 ; XID_Continue # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> 1E008..1E018 ; XID_Continue # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> 1E01B..1E021 ; XID_Continue # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; XID_Continue # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; XID_Continue # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D ; XID_Continue # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F ; XID_Continue # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E100..1E12C ; XID_Continue # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E130..1E136 ; XID_Continue # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E137..1E13D ; XID_Continue # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> _AT_@ -10142,6 +10264,10 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 1E2C0..1E2EB ; XID_Continue # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> 1E2EC..1E2EF ; XID_Continue # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> 1E2F0..1E2F9 ; XID_Continue # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> +1E4D0..1E4EA ; XID_Continue # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; XID_Continue # Lm NAG MUNDARI SIGN OJOD
> +1E4EC..1E4EF ; XID_Continue # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> +1E4F0..1E4F9 ; XID_Continue # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E7E0..1E7E6 ; XID_Continue # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; XID_Continue # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; XID_Continue # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -10187,15 +10313,16 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
> 1EEAB..1EEBB ; XID_Continue # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> 1FBF0..1FBF9 ; XID_Continue # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
> 20000..2A6DF ; XID_Continue # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; XID_Continue # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; XID_Continue # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; XID_Continue # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; XID_Continue # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; XID_Continue # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; XID_Continue # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; XID_Continue # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; XID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 135053
> +# Total code points: 139463
>
> # ================================================
>
> _AT_@ -10206,7 +10333,7 @@ E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTO
> # + Variation_Selector
> # - White_Space
> # - FFF9..FFFB (Interlinear annotation format characters)
> -# - 13430..13438 (Egyptian hieroglyph format characters)
> +# - 13430..13440 (Egyptian hieroglyph format characters)
> # - Prepended_Concatenation_Mark (Exceptional format characters that should be visible)
>
> 00AD ; Default_Ignorable_Code_Point # Cf SOFT HYPHEN
> _AT_@ -10351,7 +10478,7 @@ E01F0..E0FFF ; Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<rese
> 0E47..0E4E ; Grapheme_Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
> 0EB1 ; Grapheme_Extend # Mn LAO VOWEL SIGN MAI KAN
> 0EB4..0EBC ; Grapheme_Extend # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
> -0EC8..0ECD ; Grapheme_Extend # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; Grapheme_Extend # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0F18..0F19 ; Grapheme_Extend # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> 0F35 ; Grapheme_Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> 0F37 ; Grapheme_Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> _AT_@ -10490,6 +10617,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
> 10AE5..10AE6 ; Grapheme_Extend # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> 10D24..10D27 ; Grapheme_Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> 10EAB..10EAC ; Grapheme_Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> +10EFD..10EFF ; Grapheme_Extend # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F46..10F50 ; Grapheme_Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> 10F82..10F85 ; Grapheme_Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> 11001 ; Grapheme_Extend # Mn BRAHMI SIGN ANUSVARA
> _AT_@ -10512,6 +10640,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
> 11234 ; Grapheme_Extend # Mn KHOJKI SIGN ANUSVARA
> 11236..11237 ; Grapheme_Extend # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; Grapheme_Extend # Mn KHOJKI SIGN SUKUN
> +11241 ; Grapheme_Extend # Mn KHOJKI VOWEL SIGN VOCALIC R
> 112DF ; Grapheme_Extend # Mn KHUDAWADI SIGN ANUSVARA
> 112E3..112EA ; Grapheme_Extend # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
> 11300..11301 ; Grapheme_Extend # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
> _AT_@ -10579,6 +10708,12 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
> 11D95 ; Grapheme_Extend # Mn GUNJALA GONDI SIGN ANUSVARA
> 11D97 ; Grapheme_Extend # Mn GUNJALA GONDI VIRAMA
> 11EF3..11EF4 ; Grapheme_Extend # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> +11F00..11F01 ; Grapheme_Extend # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F36..11F3A ; Grapheme_Extend # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F40 ; Grapheme_Extend # Mn KAWI VOWEL SIGN EU
> +11F42 ; Grapheme_Extend # Mn KAWI CONJOINER
> +13440 ; Grapheme_Extend # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13447..13455 ; Grapheme_Extend # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 16AF0..16AF4 ; Grapheme_Extend # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> 16B30..16B36 ; Grapheme_Extend # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> 16F4F ; Grapheme_Extend # Mn MIAO SIGN CONSONANT MODIFIER BAR
> _AT_@ -10605,15 +10740,17 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
> 1E01B..1E021 ; Grapheme_Extend # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Grapheme_Extend # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Grapheme_Extend # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E08F ; Grapheme_Extend # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E130..1E136 ; Grapheme_Extend # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E2AE ; Grapheme_Extend # Mn TOTO SIGN RISING TONE
> 1E2EC..1E2EF ; Grapheme_Extend # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> +1E4EC..1E4EF ; Grapheme_Extend # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> 1E8D0..1E8D6 ; Grapheme_Extend # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> 1E944..1E94A ; Grapheme_Extend # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
> E0020..E007F ; Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 2090
> +# Total code points: 2125
>
> # ================================================
>
> _AT_@ -10913,6 +11050,7 @@ E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELE
> 0CE0..0CE1 ; Grapheme_Base # Lo [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
> 0CE6..0CEF ; Grapheme_Base # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
> 0CF1..0CF2 ; Grapheme_Base # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3 ; Grapheme_Base # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D02..0D03 ; Grapheme_Base # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C ; Grapheme_Base # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> 0D0E..0D10 ; Grapheme_Base # Lo [3] MALAYALAM LETTER E..MALAYALAM LETTER AI
> _AT_@ -11965,6 +12103,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 11232..11233 ; Grapheme_Base # Mc [2] KHOJKI VOWEL SIGN O..KHOJKI VOWEL SIGN AU
> 11235 ; Grapheme_Base # Mc KHOJKI SIGN VIRAMA
> 11238..1123D ; Grapheme_Base # Po [6] KHOJKI DANDA..KHOJKI ABBREVIATION SIGN
> +1123F..11240 ; Grapheme_Base # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> 11280..11286 ; Grapheme_Base # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; Grapheme_Base # Lo MULTANI LETTER GHA
> 1128A..1128D ; Grapheme_Base # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -12080,6 +12219,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 11A9D ; Grapheme_Base # Lo SOYOMBO MARK PLUTA
> 11A9E..11AA2 ; Grapheme_Base # Po [5] SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPLE FLAME..SOYOMBO TERMINAL MARK-2
> 11AB0..11AF8 ; Grapheme_Base # Lo [73] CANADIAN SYLLABICS NATTILIK HI..PAU CIN HAU GLOTTAL STOP FINAL
> +11B00..11B09 ; Grapheme_Base # Po [10] DEVANAGARI HEAD MARK..DEVANAGARI SIGN MINDU
> 11C00..11C08 ; Grapheme_Base # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
> 11C0A..11C2E ; Grapheme_Base # Lo [37] BHAIKSUKI LETTER E..BHAIKSUKI LETTER HA
> 11C2F ; Grapheme_Base # Mc BHAIKSUKI VOWEL SIGN AA
> _AT_@ -12109,6 +12249,15 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 11EE0..11EF2 ; Grapheme_Base # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> 11EF5..11EF6 ; Grapheme_Base # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> 11EF7..11EF8 ; Grapheme_Base # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> +11F02 ; Grapheme_Base # Lo KAWI SIGN REPHA
> +11F03 ; Grapheme_Base # Mc KAWI SIGN VISARGA
> +11F04..11F10 ; Grapheme_Base # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; Grapheme_Base # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35 ; Grapheme_Base # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F3E..11F3F ; Grapheme_Base # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F41 ; Grapheme_Base # Mc KAWI SIGN KILLER
> +11F43..11F4F ; Grapheme_Base # Po [13] KAWI DANDA..KAWI PUNCTUATION CLOSING SPIRAL
> +11F50..11F59 ; Grapheme_Base # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 11FB0 ; Grapheme_Base # Lo LISU LETTER YHA
> 11FC0..11FD4 ; Grapheme_Base # No [21] TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH..TAMIL FRACTION DOWNSCALING FACTOR KIIZH
> 11FD5..11FDC ; Grapheme_Base # So [8] TAMIL SIGN NEL..TAMIL SIGN MUKKURUNI
> _AT_@ -12121,7 +12270,8 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 12480..12543 ; Grapheme_Base # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; Grapheme_Base # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> 12FF1..12FF2 ; Grapheme_Base # Po [2] CYPRO-MINOAN SIGN CM301..CYPRO-MINOAN SIGN CM302
> -13000..1342E ; Grapheme_Base # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; Grapheme_Base # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; Grapheme_Base # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; Grapheme_Base # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; Grapheme_Base # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; Grapheme_Base # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -12159,7 +12309,9 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1AFF5..1AFFB ; Grapheme_Base # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; Grapheme_Base # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; Grapheme_Base # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; Grapheme_Base # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; Grapheme_Base # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; Grapheme_Base # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; Grapheme_Base # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; Grapheme_Base # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; Grapheme_Base # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -12180,6 +12332,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1D1AE..1D1EA ; Grapheme_Base # So [61] MUSICAL SYMBOL PEDAL MARK..MUSICAL SYMBOL KORON
> 1D200..1D241 ; Grapheme_Base # So [66] GREEK VOCAL NOTATION SYMBOL-1..GREEK INSTRUMENTAL NOTATION SYMBOL-54
> 1D245 ; Grapheme_Base # So GREEK MUSICAL LEIMMA
> +1D2C0..1D2D3 ; Grapheme_Base # No [20] KAKTOVIK NUMERAL ZERO..KAKTOVIK NUMERAL NINETEEN
> 1D2E0..1D2F3 ; Grapheme_Base # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
> 1D300..1D356 ; Grapheme_Base # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
> 1D360..1D378 ; Grapheme_Base # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE
> _AT_@ -12233,6 +12386,8 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1DF00..1DF09 ; Grapheme_Base # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; Grapheme_Base # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; Grapheme_Base # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; Grapheme_Base # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; Grapheme_Base # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E100..1E12C ; Grapheme_Base # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E137..1E13D ; Grapheme_Base # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E140..1E149 ; Grapheme_Base # Nd [10] NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG PUACHUE HMONG DIGIT NINE
> _AT_@ -12242,6 +12397,9 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1E2C0..1E2EB ; Grapheme_Base # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> 1E2F0..1E2F9 ; Grapheme_Base # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> 1E2FF ; Grapheme_Base # Sc WANCHO NGUN SIGN
> +1E4D0..1E4EA ; Grapheme_Base # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; Grapheme_Base # Lm NAG MUNDARI SIGN OJOD
> +1E4F0..1E4F9 ; Grapheme_Base # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E7E0..1E7E6 ; Grapheme_Base # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; Grapheme_Base # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; Grapheme_Base # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -12310,10 +12468,10 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1F300..1F3FA ; Grapheme_Base # So [251] CYCLONE..AMPHORA
> 1F3FB..1F3FF ; Grapheme_Base # Sk [5] EMOJI MODIFIER FITZPATRICK TYPE-1-2..EMOJI MODIFIER FITZPATRICK TYPE-6
> 1F400..1F6D7 ; Grapheme_Base # So [728] RAT..ELEVATOR
> -1F6DD..1F6EC ; Grapheme_Base # So [16] PLAYGROUND SLIDE..AIRPLANE ARRIVING
> +1F6DC..1F6EC ; Grapheme_Base # So [17] WIRELESS..AIRPLANE ARRIVING
> 1F6F0..1F6FC ; Grapheme_Base # So [13] SATELLITE..ROLLER SKATE
> -1F700..1F773 ; Grapheme_Base # So [116] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ALCHEMICAL SYMBOL FOR HALF OUNCE
> -1F780..1F7D8 ; Grapheme_Base # So [89] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NEGATIVE CIRCLED SQUARE
> +1F700..1F776 ; Grapheme_Base # So [119] ALCHEMICAL SYMBOL FOR QUINTESSENCE..LUNAR ECLIPSE
> +1F77B..1F7D9 ; Grapheme_Base # So [95] HAUMEA..NINE POINTED WHITE STAR
> 1F7E0..1F7EB ; Grapheme_Base # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE
> 1F7F0 ; Grapheme_Base # So HEAVY EQUALS SIGN
> 1F800..1F80B ; Grapheme_Base # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD
> _AT_@ -12324,27 +12482,26 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
> 1F8B0..1F8B1 ; Grapheme_Base # So [2] ARROW POINTING UPWARDS THEN NORTH WEST..ARROW POINTING RIGHTWARDS THEN CURVING SOUTH WEST
> 1F900..1FA53 ; Grapheme_Base # So [340] CIRCLED CROSS FORMEE WITH FOUR DOTS..BLACK CHESS KNIGHT-BISHOP
> 1FA60..1FA6D ; Grapheme_Base # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
> -1FA70..1FA74 ; Grapheme_Base # So [5] BALLET SHOES..THONG SANDAL
> -1FA78..1FA7C ; Grapheme_Base # So [5] DROP OF BLOOD..CRUTCH
> -1FA80..1FA86 ; Grapheme_Base # So [7] YO-YO..NESTING DOLLS
> -1FA90..1FAAC ; Grapheme_Base # So [29] RINGED PLANET..HAMSA
> -1FAB0..1FABA ; Grapheme_Base # So [11] FLY..NEST WITH EGGS
> -1FAC0..1FAC5 ; Grapheme_Base # So [6] ANATOMICAL HEART..PERSON WITH CROWN
> -1FAD0..1FAD9 ; Grapheme_Base # So [10] BLUEBERRIES..JAR
> -1FAE0..1FAE7 ; Grapheme_Base # So [8] MELTING FACE..BUBBLES
> -1FAF0..1FAF6 ; Grapheme_Base # So [7] HAND WITH INDEX FINGER AND THUMB CROSSED..HEART HANDS
> +1FA70..1FA7C ; Grapheme_Base # So [13] BALLET SHOES..CRUTCH
> +1FA80..1FA88 ; Grapheme_Base # So [9] YO-YO..FLUTE
> +1FA90..1FABD ; Grapheme_Base # So [46] RINGED PLANET..WING
> +1FABF..1FAC5 ; Grapheme_Base # So [7] GOOSE..PERSON WITH CROWN
> +1FACE..1FADB ; Grapheme_Base # So [14] MOOSE..PEA POD
> +1FAE0..1FAE8 ; Grapheme_Base # So [9] MELTING FACE..SHAKING FACE
> +1FAF0..1FAF8 ; Grapheme_Base # So [9] HAND WITH INDEX FINGER AND THUMB CROSSED..RIGHTWARDS PUSHING HAND
> 1FB00..1FB92 ; Grapheme_Base # So [147] BLOCK SEXTANT-1..UPPER HALF INVERSE MEDIUM SHADE AND LOWER HALF BLOCK
> 1FB94..1FBCA ; Grapheme_Base # So [55] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..WHITE UP-POINTING CHEVRON
> 1FBF0..1FBF9 ; Grapheme_Base # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
> 20000..2A6DF ; Grapheme_Base # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; Grapheme_Base # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; Grapheme_Base # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; Grapheme_Base # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; Grapheme_Base # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; Grapheme_Base # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; Grapheme_Base # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; Grapheme_Base # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; Grapheme_Base # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
>
> -# Total code points: 142539
> +# Total code points: 146986
>
> # ================================================
>
> _AT_@ -12410,7 +12567,9 @@ ABED ; Grapheme_Link # Mn MEETEI MAYEK APUN IYEK
> 11C3F ; Grapheme_Link # Mn BHAIKSUKI SIGN VIRAMA
> 11D44..11D45 ; Grapheme_Link # Mn [2] MASARAM GONDI SIGN HALANTA..MASARAM GONDI VIRAMA
> 11D97 ; Grapheme_Link # Mn GUNJALA GONDI VIRAMA
> +11F41 ; Grapheme_Link # Mc KAWI SIGN KILLER
> +11F42 ; Grapheme_Link # Mn KAWI CONJOINER
>
> -# Total code points: 63
> +# Total code points: 65
>
> # EOF
> diff --git a/data/EastAsianWidth.txt b/data/EastAsianWidth.txt
> index e04f705..38b7076 100644
> --- a/data/EastAsianWidth.txt
> +++ b/data/EastAsianWidth.txt
> _AT_@ -1,6 +1,6 @@
> -# EastAsianWidth-14.0.0.txt
> -# Date: 2021-07-06, 09:58:53 GMT [KW, LI]
> -# © 2021 Unicode®, Inc.
> +# EastAsianWidth-15.0.0.txt
> +# Date: 2022-05-24, 17:40:20 GMT [KW, LI]
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> # For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> _AT_@ -534,6 +534,7 @@
> 0CE2..0CE3;N # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> 0CE6..0CEF;N # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
> 0CF1..0CF2;N # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3;N # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01;N # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03;N # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C;N # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> _AT_@ -595,7 +596,7 @@
> 0EBD;N # Lo LAO SEMIVOWEL SIGN NYO
> 0EC0..0EC4;N # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
> 0EC6;N # Lm LAO KO LA
> -0EC8..0ECD;N # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE;N # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0ED0..0ED9;N # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
> 0EDC..0EDF;N # Lo [4] LAO HO NO..LAO LETTER KHMU NYO
> 0F00;N # Lo TIBETAN SYLLABLE OM
> _AT_@ -1946,6 +1947,7 @@ FFFD;A # So REPLACEMENT CHARACTER
> 10EAB..10EAC;N # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> 10EAD;N # Pd YEZIDI HYPHENATION MARK
> 10EB0..10EB1;N # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
> +10EFD..10EFF;N # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F00..10F1C;N # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
> 10F1D..10F26;N # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
> 10F27;N # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
> _AT_@ -2028,6 +2030,8 @@ FFFD;A # So REPLACEMENT CHARACTER
> 11236..11237;N # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 11238..1123D;N # Po [6] KHOJKI DANDA..KHOJKI ABBREVIATION SIGN
> 1123E;N # Mn KHOJKI SIGN SUKUN
> +1123F..11240;N # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> +11241;N # Mn KHOJKI VOWEL SIGN VOCALIC R
> 11280..11286;N # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288;N # Lo MULTANI LETTER GHA
> 1128A..1128D;N # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -2190,6 +2194,7 @@ FFFD;A # So REPLACEMENT CHARACTER
> 11A9E..11AA2;N # Po [5] SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPLE FLAME..SOYOMBO TERMINAL MARK-2
> 11AB0..11ABF;N # Lo [16] CANADIAN SYLLABICS NATTILIK HI..CANADIAN SYLLABICS SPA
> 11AC0..11AF8;N # Lo [57] PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL STOP FINAL
> +11B00..11B09;N # Po [10] DEVANAGARI HEAD MARK..DEVANAGARI SIGN MINDU
> 11C00..11C08;N # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
> 11C0A..11C2E;N # Lo [37] BHAIKSUKI LETTER E..BHAIKSUKI LETTER HA
> 11C2F;N # Mc BHAIKSUKI VOWEL SIGN AA
> _AT_@ -2235,6 +2240,19 @@ FFFD;A # So REPLACEMENT CHARACTER
> 11EF3..11EF4;N # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6;N # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> 11EF7..11EF8;N # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> +11F00..11F01;N # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F02;N # Lo KAWI SIGN REPHA
> +11F03;N # Mc KAWI SIGN VISARGA
> +11F04..11F10;N # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33;N # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35;N # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A;N # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F;N # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40;N # Mn KAWI VOWEL SIGN EU
> +11F41;N # Mc KAWI SIGN KILLER
> +11F42;N # Mn KAWI CONJOINER
> +11F43..11F4F;N # Po [13] KAWI DANDA..KAWI PUNCTUATION CLOSING SPIRAL
> +11F50..11F59;N # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 11FB0;N # Lo LISU LETTER YHA
> 11FC0..11FD4;N # No [21] TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH..TAMIL FRACTION DOWNSCALING FACTOR KIIZH
> 11FD5..11FDC;N # So [8] TAMIL SIGN NEL..TAMIL SIGN MUKKURUNI
> _AT_@ -2247,8 +2265,11 @@ FFFD;A # So REPLACEMENT CHARACTER
> 12480..12543;N # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0;N # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> 12FF1..12FF2;N # Po [2] CYPRO-MINOAN SIGN CM301..CYPRO-MINOAN SIGN CM302
> -13000..1342E;N # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> -13430..13438;N # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
> +13000..1342F;N # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13430..1343F;N # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> +13440;N # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13441..13446;N # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> +13447..13455;N # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 14400..14646;N # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38;N # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E;N # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -2293,7 +2314,9 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1AFFD..1AFFE;W # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B0FF;W # Lo [256] KATAKANA LETTER ARCHAIC E..HENTAIGANA LETTER RE-2
> 1B100..1B122;W # Lo [35] HENTAIGANA LETTER RE-3..KATAKANA LETTER ARCHAIC WU
> +1B132;W # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152;W # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155;W # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167;W # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB;W # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A;N # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -2324,6 +2347,7 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1D200..1D241;N # So [66] GREEK VOCAL NOTATION SYMBOL-1..GREEK INSTRUMENTAL NOTATION SYMBOL-54
> 1D242..1D244;N # Mn [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
> 1D245;N # So GREEK MUSICAL LEIMMA
> +1D2C0..1D2D3;N # No [20] KAKTOVIK NUMERAL ZERO..KAKTOVIK NUMERAL NINETEEN
> 1D2E0..1D2F3;N # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
> 1D300..1D356;N # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
> 1D360..1D378;N # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE
> _AT_@ -2383,11 +2407,14 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1DF00..1DF09;N # Ll [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A;N # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E;N # Ll [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A;N # Ll [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> 1E000..1E006;N # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> 1E008..1E018;N # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> 1E01B..1E021;N # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024;N # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A;N # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D;N # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F;N # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E100..1E12C;N # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E130..1E136;N # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E137..1E13D;N # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> _AT_@ -2400,6 +2427,10 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1E2EC..1E2EF;N # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> 1E2F0..1E2F9;N # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> 1E2FF;N # Sc WANCHO NGUN SIGN
> +1E4D0..1E4EA;N # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB;N # Lm NAG MUNDARI SIGN OJOD
> +1E4EC..1E4EF;N # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> +1E4F0..1E4F9;N # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E7E0..1E7E6;N # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB;N # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE;N # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -2528,13 +2559,14 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1F6D0..1F6D2;W # So [3] PLACE OF WORSHIP..SHOPPING TROLLEY
> 1F6D3..1F6D4;N # So [2] STUPA..PAGODA
> 1F6D5..1F6D7;W # So [3] HINDU TEMPLE..ELEVATOR
> -1F6DD..1F6DF;W # So [3] PLAYGROUND SLIDE..RING BUOY
> +1F6DC..1F6DF;W # So [4] WIRELESS..RING BUOY
> 1F6E0..1F6EA;N # So [11] HAMMER AND WRENCH..NORTHEAST-POINTING AIRPLANE
> 1F6EB..1F6EC;W # So [2] AIRPLANE DEPARTURE..AIRPLANE ARRIVING
> 1F6F0..1F6F3;N # So [4] SATELLITE..PASSENGER SHIP
> 1F6F4..1F6FC;W # So [9] SCOOTER..ROLLER SKATE
> -1F700..1F773;N # So [116] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ALCHEMICAL SYMBOL FOR HALF OUNCE
> -1F780..1F7D8;N # So [89] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NEGATIVE CIRCLED SQUARE
> +1F700..1F776;N # So [119] ALCHEMICAL SYMBOL FOR QUINTESSENCE..LUNAR ECLIPSE
> +1F77B..1F77F;N # So [5] HAUMEA..ORCUS
> +1F780..1F7D9;N # So [90] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NINE POINTED WHITE STAR
> 1F7E0..1F7EB;W # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE
> 1F7F0;W # So HEAVY EQUALS SIGN
> 1F800..1F80B;N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD
> _AT_@ -2551,22 +2583,20 @@ FFFD;A # So REPLACEMENT CHARACTER
> 1F947..1F9FF;W # So [185] FIRST PLACE MEDAL..NAZAR AMULET
> 1FA00..1FA53;N # So [84] NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP
> 1FA60..1FA6D;N # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
> -1FA70..1FA74;W # So [5] BALLET SHOES..THONG SANDAL
> -1FA78..1FA7C;W # So [5] DROP OF BLOOD..CRUTCH
> -1FA80..1FA86;W # So [7] YO-YO..NESTING DOLLS
> -1FA90..1FAAC;W # So [29] RINGED PLANET..HAMSA
> -1FAB0..1FABA;W # So [11] FLY..NEST WITH EGGS
> -1FAC0..1FAC5;W # So [6] ANATOMICAL HEART..PERSON WITH CROWN
> -1FAD0..1FAD9;W # So [10] BLUEBERRIES..JAR
> -1FAE0..1FAE7;W # So [8] MELTING FACE..BUBBLES
> -1FAF0..1FAF6;W # So [7] HAND WITH INDEX FINGER AND THUMB CROSSED..HEART HANDS
> +1FA70..1FA7C;W # So [13] BALLET SHOES..CRUTCH
> +1FA80..1FA88;W # So [9] YO-YO..FLUTE
> +1FA90..1FABD;W # So [46] RINGED PLANET..WING
> +1FABF..1FAC5;W # So [7] GOOSE..PERSON WITH CROWN
> +1FACE..1FADB;W # So [14] MOOSE..PEA POD
> +1FAE0..1FAE8;W # So [9] MELTING FACE..SHAKING FACE
> +1FAF0..1FAF8;W # So [9] HAND WITH INDEX FINGER AND THUMB CROSSED..RIGHTWARDS PUSHING HAND
> 1FB00..1FB92;N # So [147] BLOCK SEXTANT-1..UPPER HALF INVERSE MEDIUM SHADE AND LOWER HALF BLOCK
> 1FB94..1FBCA;N # So [55] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..WHITE UP-POINTING CHEVRON
> 1FBF0..1FBF9;N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
> 20000..2A6DF;W # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> 2A6E0..2A6FF;W # Cn [32] <reserved-2A6E0>..<reserved-2A6FF>
> -2A700..2B738;W # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> -2B739..2B73F;W # Cn [7] <reserved-2B739>..<reserved-2B73F>
> +2A700..2B739;W # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> +2B73A..2B73F;W # Cn [6] <reserved-2B73A>..<reserved-2B73F>
> 2B740..2B81D;W # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B81E..2B81F;W # Cn [2] <reserved-2B81E>..<reserved-2B81F>
> 2B820..2CEA1;W # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> _AT_@ -2577,7 +2607,9 @@ FFFD;A # So REPLACEMENT CHARACTER
> 2FA1E..2FA1F;W # Cn [2] <reserved-2FA1E>..<reserved-2FA1F>
> 2FA20..2FFFD;W # Cn [1502] <reserved-2FA20>..<reserved-2FFFD>
> 30000..3134A;W # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> -3134B..3FFFD;W # Cn [60595] <reserved-3134B>..<reserved-3FFFD>
> +3134B..3134F;W # Cn [5] <reserved-3134B>..<reserved-3134F>
> +31350..323AF;W # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> +323B0..3FFFD;W # Cn [56398] <reserved-323B0>..<reserved-3FFFD>
> E0001;N # Cf LANGUAGE TAG
> E0020..E007F;N # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF;A # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
> diff --git a/data/GraphemeBreakProperty.txt b/data/GraphemeBreakProperty.txt
> index dd25690..a12b5ee 100644
> --- a/data/GraphemeBreakProperty.txt
> +++ b/data/GraphemeBreakProperty.txt
> _AT_@ -1,11 +1,11 @@
> -# GraphemeBreakProperty-14.0.0.txt
> -# Date: 2021-08-12, 23:13:02 GMT
> -# © 2021 Unicode®, Inc.
> +# GraphemeBreakProperty-15.0.0.txt
> +# Date: 2022-04-27, 17:07:38 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
>
> # ================================================
>
> _AT_@ -32,8 +32,9 @@
> 11A3A ; Prepend # Lo ZANABAZAR SQUARE CLUSTER-INITIAL LETTER RA
> 11A84..11A89 ; Prepend # Lo [6] SOYOMBO SIGN JIHVAMULIYA..SOYOMBO CLUSTER-INITIAL LETTER SA
> 11D46 ; Prepend # Lo MASARAM GONDI REPHA
> +11F02 ; Prepend # Lo KAWI SIGN REPHA
>
> -# Total code points: 26
> +# Total code points: 27
>
> # ================================================
>
> _AT_@ -67,7 +68,7 @@
> FEFF ; Control # Cf ZERO WIDTH NO-BREAK SPACE
> FFF0..FFF8 ; Control # Cn [9] <reserved-FFF0>..<reserved-FFF8>
> FFF9..FFFB ; Control # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
> -13430..13438 ; Control # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
> +13430..1343F ; Control # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> 1BCA0..1BCA3 ; Control # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
> 1D173..1D17A ; Control # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
> E0000 ; Control # Cn <reserved-E0000>
> _AT_@ -76,7 +77,7 @@ E0002..E001F ; Control # Cn [30] <reserved-E0002>..<reserved-E001F>
> E0080..E00FF ; Control # Cn [128] <reserved-E0080>..<reserved-E00FF>
> E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
>
> -# Total code points: 3886
> +# Total code points: 3893
>
> # ================================================
>
> _AT_@ -185,7 +186,7 @@ E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
> 0E47..0E4E ; Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
> 0EB1 ; Extend # Mn LAO VOWEL SIGN MAI KAN
> 0EB4..0EBC ; Extend # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
> -0EC8..0ECD ; Extend # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; Extend # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0F18..0F19 ; Extend # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> 0F35 ; Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> 0F37 ; Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> _AT_@ -324,6 +325,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 10AE5..10AE6 ; Extend # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> 10D24..10D27 ; Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> 10EAB..10EAC ; Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> +10EFD..10EFF ; Extend # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F46..10F50 ; Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> 10F82..10F85 ; Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> 11001 ; Extend # Mn BRAHMI SIGN ANUSVARA
> _AT_@ -346,6 +348,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11234 ; Extend # Mn KHOJKI SIGN ANUSVARA
> 11236..11237 ; Extend # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; Extend # Mn KHOJKI SIGN SUKUN
> +11241 ; Extend # Mn KHOJKI VOWEL SIGN VOCALIC R
> 112DF ; Extend # Mn KHUDAWADI SIGN ANUSVARA
> 112E3..112EA ; Extend # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
> 11300..11301 ; Extend # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
> _AT_@ -413,6 +416,12 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11D95 ; Extend # Mn GUNJALA GONDI SIGN ANUSVARA
> 11D97 ; Extend # Mn GUNJALA GONDI VIRAMA
> 11EF3..11EF4 ; Extend # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> +11F00..11F01 ; Extend # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F36..11F3A ; Extend # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F40 ; Extend # Mn KAWI VOWEL SIGN EU
> +11F42 ; Extend # Mn KAWI CONJOINER
> +13440 ; Extend # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13447..13455 ; Extend # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 16AF0..16AF4 ; Extend # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> 16B30..16B36 ; Extend # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> 16F4F ; Extend # Mn MIAO SIGN CONSONANT MODIFIER BAR
> _AT_@ -439,16 +448,18 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 1E01B..1E021 ; Extend # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Extend # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Extend # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E08F ; Extend # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E130..1E136 ; Extend # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E2AE ; Extend # Mn TOTO SIGN RISING TONE
> 1E2EC..1E2EF ; Extend # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> +1E4EC..1E4EF ; Extend # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> 1E8D0..1E8D6 ; Extend # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> 1E944..1E94A ; Extend # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
> 1F3FB..1F3FF ; Extend # Sk [5] EMOJI MODIFIER FITZPATRICK TYPE-1-2..EMOJI MODIFIER FITZPATRICK TYPE-6
> E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 2095
> +# Total code points: 2130
>
> # ================================================
>
> _AT_@ -489,6 +500,7 @@ E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
> 0CC3..0CC4 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
> 0CC7..0CC8 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
> 0CCA..0CCB ; SpacingMark # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
> +0CF3 ; SpacingMark # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D02..0D03 ; SpacingMark # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D3F..0D40 ; SpacingMark # Mc [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
> 0D46..0D48 ; SpacingMark # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
> _AT_@ -614,12 +626,16 @@ ABEC ; SpacingMark # Mc MEETEI MAYEK LUM IYEK
> 11D93..11D94 ; SpacingMark # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
> 11D96 ; SpacingMark # Mc GUNJALA GONDI SIGN VISARGA
> 11EF5..11EF6 ; SpacingMark # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F03 ; SpacingMark # Mc KAWI SIGN VISARGA
> +11F34..11F35 ; SpacingMark # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F3E..11F3F ; SpacingMark # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F41 ; SpacingMark # Mc KAWI SIGN KILLER
> 16F51..16F87 ; SpacingMark # Mc [55] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN UI
> 16FF0..16FF1 ; SpacingMark # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
> 1D166 ; SpacingMark # Mc MUSICAL SYMBOL COMBINING SPRECHGESANG STEM
> 1D16D ; SpacingMark # Mc MUSICAL SYMBOL COMBINING AUGMENTATION DOT
>
> -# Total code points: 388
> +# Total code points: 395
>
> # ================================================
>
> diff --git a/data/GraphemeBreakTest.txt b/data/GraphemeBreakTest.txt
> index eff2fd3..3c73f97 100644
> --- a/data/GraphemeBreakTest.txt
> +++ b/data/GraphemeBreakTest.txt
> _AT_@ -1,11 +1,11 @@
> -# GraphemeBreakTest-14.0.0.txt
> -# Date: 2021-03-08, 06:22:32 GMT
> -# © 2021 Unicode®, Inc.
> +# GraphemeBreakTest-15.0.0.txt
> +# Date: 2022-02-26, 00:38:37 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
> #
> # Default Grapheme_Cluster_Break Test
> #
> diff --git a/data/LineBreak.txt b/data/LineBreak.txt
> index aa5985b..8243333 100644
> --- a/data/LineBreak.txt
> +++ b/data/LineBreak.txt
> _AT_@ -1,6 +1,6 @@
> -# LineBreak-14.0.0.txt
> -# Date: 2021-07-06, 09:58:55 GMT [KW, LI]
> -# © 2021 Unicode®, Inc.
> +# LineBreak-15.0.0.txt
> +# Date: 2022-07-28, 09:20:42 GMT [KW, LI]
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> # For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> _AT_@ -481,6 +481,7 @@
> 0CE2..0CE3;CM # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> 0CE6..0CEF;NU # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
> 0CF1..0CF2;AL # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
> +0CF3;CM # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01;CM # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03;CM # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D04..0D0C;AL # Lo [9] MALAYALAM LETTER VEDIC ANUSVARA..MALAYALAM LETTER VOCALIC L
> _AT_@ -542,7 +543,7 @@
> 0EBD;SA # Lo LAO SEMIVOWEL SIGN NYO
> 0EC0..0EC4;SA # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
> 0EC6;SA # Lm LAO KO LA
> -0EC8..0ECD;SA # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE;SA # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0ED0..0ED9;NU # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
> 0EDC..0EDF;SA # Lo [4] LAO HO NO..LAO LETTER KHMU NYO
> 0F00;AL # Lo TIBETAN SYLLABLE OM
> _AT_@ -855,7 +856,11 @@
> 1D79..1D7F;AL # Ll [7] LATIN SMALL LETTER INSULAR G..LATIN SMALL LETTER UPSILON WITH STROKE
> 1D80..1D9A;AL # Ll [27] LATIN SMALL LETTER B WITH PALATAL HOOK..LATIN SMALL LETTER EZH WITH RETROFLEX HOOK
> 1D9B..1DBF;AL # Lm [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
> -1DC0..1DFF;CM # Mn [64] COMBINING DOTTED GRAVE ACCENT..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
> +1DC0..1DCC;CM # Mn [13] COMBINING DOTTED GRAVE ACCENT..COMBINING MACRON-BREVE
> +1DCD;GL # Mn COMBINING DOUBLE CIRCUMFLEX ABOVE
> +1DCE..1DFB;CM # Mn [46] COMBINING OGONEK ABOVE..COMBINING DELETION MARK
> +1DFC;GL # Mn COMBINING DOUBLE INVERTED BREVE BELOW
> +1DFD..1DFF;CM # Mn [3] COMBINING ALMOST EQUAL TO BELOW..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
> 1E00..1EFF;AL # L& [256] LATIN CAPITAL LETTER A WITH RING BELOW..LATIN SMALL LETTER Y WITH LOOP
> 1F00..1F15;AL # L& [22] GREEK SMALL LETTER ALPHA WITH PSILI..GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA
> 1F18..1F1D;AL # Lu [6] GREEK CAPITAL LETTER EPSILON WITH PSILI..GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA
> _AT_@ -931,7 +936,7 @@
> 2054;AL # Pc INVERTED UNDERTIE
> 2055;AL # Po FLOWER PUNCTUATION MARK
> 2056;BA # Po THREE DOT PUNCTUATION
> -2057;AL # Po QUADRUPLE PRIME
> +2057;PO # Po QUADRUPLE PRIME
> 2058..205B;BA # Po [4] FOUR DOT PUNCTUATION..FOUR DOT MARK
> 205C;AL # Po DOTTED CROSS
> 205D..205E;BA # Po [2] TRICOLON..VERTICAL FOUR DOTS
> _AT_@ -2793,6 +2798,7 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 10EAB..10EAC;CM # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> 10EAD;BA # Pd YEZIDI HYPHENATION MARK
> 10EB0..10EB1;AL # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
> +10EFD..10EFF;CM # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F00..10F1C;AL # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
> 10F1D..10F26;AL # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
> 10F27;AL # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
> _AT_@ -2882,6 +2888,8 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1123B..1123C;BA # Po [2] KHOJKI SECTION MARK..KHOJKI DOUBLE SECTION MARK
> 1123D;AL # Po KHOJKI ABBREVIATION SIGN
> 1123E;CM # Mn KHOJKI SIGN SUKUN
> +1123F..11240;AL # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> +11241;CM # Mn KHOJKI VOWEL SIGN VOCALIC R
> 11280..11286;AL # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288;AL # Lo MULTANI LETTER GHA
> 1128A..1128D;AL # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -3055,6 +3063,7 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 11AA1..11AA2;BA # Po [2] SOYOMBO TERMINAL MARK-1..SOYOMBO TERMINAL MARK-2
> 11AB0..11ABF;AL # Lo [16] CANADIAN SYLLABICS NATTILIK HI..CANADIAN SYLLABICS SPA
> 11AC0..11AF8;AL # Lo [57] PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL STOP FINAL
> +11B00..11B09;BB # Po [10] DEVANAGARI HEAD MARK..DEVANAGARI SIGN MINDU
> 11C00..11C08;AL # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
> 11C0A..11C2E;AL # Lo [37] BHAIKSUKI LETTER E..BHAIKSUKI LETTER HA
> 11C2F;CM # Mc BHAIKSUKI VOWEL SIGN AA
> _AT_@ -3101,6 +3110,20 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 11EF3..11EF4;CM # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6;CM # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> 11EF7..11EF8;AL # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> +11F00..11F01;CM # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F02;AL # Lo KAWI SIGN REPHA
> +11F03;CM # Mc KAWI SIGN VISARGA
> +11F04..11F10;AL # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33;AL # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> +11F34..11F35;CM # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A;CM # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F;CM # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40;CM # Mn KAWI VOWEL SIGN EU
> +11F41;CM # Mc KAWI SIGN KILLER
> +11F42;CM # Mn KAWI CONJOINER
> +11F43..11F44;BA # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
> +11F45..11F4F;ID # Po [11] KAWI PUNCTUATION SECTION MARKER..KAWI PUNCTUATION CLOSING SPIRAL
> +11F50..11F59;NU # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 11FB0;AL # Lo LISU LETTER YHA
> 11FC0..11FD4;AL # No [21] TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH..TAMIL FRACTION DOWNSCALING FACTOR KIIZH
> 11FD5..11FDC;AL # So [8] TAMIL SIGN NEL..TAMIL SIGN MUKKURUNI
> _AT_@ -3126,10 +3149,18 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1328A..13378;AL # Lo [239] EGYPTIAN HIEROGLYPH O037..EGYPTIAN HIEROGLYPH V011
> 13379;OP # Lo EGYPTIAN HIEROGLYPH V011A
> 1337A..1337B;CL # Lo [2] EGYPTIAN HIEROGLYPH V011B..EGYPTIAN HIEROGLYPH V011C
> -1337C..1342E;AL # Lo [179] EGYPTIAN HIEROGLYPH V012..EGYPTIAN HIEROGLYPH AA032
> +1337C..1342F;AL # Lo [180] EGYPTIAN HIEROGLYPH V012..EGYPTIAN HIEROGLYPH V011D
> 13430..13436;GL # Cf [7] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH OVERLAY MIDDLE
> 13437;OP # Cf EGYPTIAN HIEROGLYPH BEGIN SEGMENT
> 13438;CL # Cf EGYPTIAN HIEROGLYPH END SEGMENT
> +13439..1343B;GL # Cf [3] EGYPTIAN HIEROGLYPH INSERT AT MIDDLE..EGYPTIAN HIEROGLYPH INSERT AT BOTTOM
> +1343C;OP # Cf EGYPTIAN HIEROGLYPH BEGIN ENCLOSURE
> +1343D;CL # Cf EGYPTIAN HIEROGLYPH END ENCLOSURE
> +1343E;OP # Cf EGYPTIAN HIEROGLYPH BEGIN WALLED ENCLOSURE
> +1343F;CL # Cf EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> +13440;CM # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13441..13446;AL # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> +13447..13455;CM # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 14400..145CD;AL # Lo [462] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A409
> 145CE;OP # Lo ANATOLIAN HIEROGLYPH A410 BEGIN LOGOGRAM MARK
> 145CF;CL # Lo ANATOLIAN HIEROGLYPH A410A END LOGOGRAM MARK
> _AT_@ -3179,7 +3210,9 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1AFFD..1AFFE;AL # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B0FF;ID # Lo [256] KATAKANA LETTER ARCHAIC E..HENTAIGANA LETTER RE-2
> 1B100..1B122;ID # Lo [35] HENTAIGANA LETTER RE-3..KATAKANA LETTER ARCHAIC WU
> +1B132;CJ # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152;CJ # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155;CJ # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167;CJ # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB;ID # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A;AL # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -3210,6 +3243,7 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1D200..1D241;AL # So [66] GREEK VOCAL NOTATION SYMBOL-1..GREEK INSTRUMENTAL NOTATION SYMBOL-54
> 1D242..1D244;CM # Mn [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
> 1D245;AL # So GREEK MUSICAL LEIMMA
> +1D2C0..1D2D3;AL # No [20] KAKTOVIK NUMERAL ZERO..KAKTOVIK NUMERAL NINETEEN
> 1D2E0..1D2F3;AL # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
> 1D300..1D356;AL # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
> 1D360..1D378;AL # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE
> _AT_@ -3270,11 +3304,14 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1DF00..1DF09;AL # Ll [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A;AL # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E;AL # Ll [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A;AL # Ll [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> 1E000..1E006;CM # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> 1E008..1E018;CM # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> 1E01B..1E021;CM # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024;CM # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A;CM # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E030..1E06D;AL # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> +1E08F;CM # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E100..1E12C;AL # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E130..1E136;CM # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E137..1E13D;AL # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> _AT_@ -3287,6 +3324,10 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1E2EC..1E2EF;CM # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> 1E2F0..1E2F9;NU # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> 1E2FF;PR # Sc WANCHO NGUN SIGN
> +1E4D0..1E4EA;AL # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB;AL # Lm NAG MUNDARI SIGN OJOD
> +1E4EC..1E4EF;CM # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> +1E4F0..1E4F9;NU # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E7E0..1E7E6;AL # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB;AL # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE;AL # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -3454,16 +3495,18 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1F6C1..1F6CB;ID # So [11] BATHTUB..COUCH AND LAMP
> 1F6CC;EB # So SLEEPING ACCOMMODATION
> 1F6CD..1F6D7;ID # So [11] SHOPPING BAGS..ELEVATOR
> -1F6D8..1F6DC;ID # Cn [5] <reserved-1F6D8>..<reserved-1F6DC>
> -1F6DD..1F6EC;ID # So [16] PLAYGROUND SLIDE..AIRPLANE ARRIVING
> +1F6D8..1F6DB;ID # Cn [4] <reserved-1F6D8>..<reserved-1F6DB>
> +1F6DC..1F6EC;ID # So [17] WIRELESS..AIRPLANE ARRIVING
> 1F6ED..1F6EF;ID # Cn [3] <reserved-1F6ED>..<reserved-1F6EF>
> 1F6F0..1F6FC;ID # So [13] SATELLITE..ROLLER SKATE
> 1F6FD..1F6FF;ID # Cn [3] <reserved-1F6FD>..<reserved-1F6FF>
> 1F700..1F773;AL # So [116] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ALCHEMICAL SYMBOL FOR HALF OUNCE
> -1F774..1F77F;ID # Cn [12] <reserved-1F774>..<reserved-1F77F>
> +1F774..1F776;ID # So [3] LOT OF FORTUNE..LUNAR ECLIPSE
> +1F777..1F77A;ID # Cn [4] <reserved-1F777>..<reserved-1F77A>
> +1F77B..1F77F;ID # So [5] HAUMEA..ORCUS
> 1F780..1F7D4;AL # So [85] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..HEAVY TWELVE POINTED PINWHEEL STAR
> -1F7D5..1F7D8;ID # So [4] CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE
> -1F7D9..1F7DF;ID # Cn [7] <reserved-1F7D9>..<reserved-1F7DF>
> +1F7D5..1F7D9;ID # So [5] CIRCLED TRIANGLE..NINE POINTED WHITE STAR
> +1F7DA..1F7DF;ID # Cn [6] <reserved-1F7DA>..<reserved-1F7DF>
> 1F7E0..1F7EB;ID # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE
> 1F7EC..1F7EF;ID # Cn [4] <reserved-1F7EC>..<reserved-1F7EF>
> 1F7F0;ID # So HEAVY EQUALS SIGN
> _AT_@ -3509,33 +3552,29 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 1FA54..1FA5F;ID # Cn [12] <reserved-1FA54>..<reserved-1FA5F>
> 1FA60..1FA6D;ID # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
> 1FA6E..1FA6F;ID # Cn [2] <reserved-1FA6E>..<reserved-1FA6F>
> -1FA70..1FA74;ID # So [5] BALLET SHOES..THONG SANDAL
> -1FA75..1FA77;ID # Cn [3] <reserved-1FA75>..<reserved-1FA77>
> -1FA78..1FA7C;ID # So [5] DROP OF BLOOD..CRUTCH
> +1FA70..1FA7C;ID # So [13] BALLET SHOES..CRUTCH
> 1FA7D..1FA7F;ID # Cn [3] <reserved-1FA7D>..<reserved-1FA7F>
> -1FA80..1FA86;ID # So [7] YO-YO..NESTING DOLLS
> -1FA87..1FA8F;ID # Cn [9] <reserved-1FA87>..<reserved-1FA8F>
> -1FA90..1FAAC;ID # So [29] RINGED PLANET..HAMSA
> -1FAAD..1FAAF;ID # Cn [3] <reserved-1FAAD>..<reserved-1FAAF>
> -1FAB0..1FABA;ID # So [11] FLY..NEST WITH EGGS
> -1FABB..1FABF;ID # Cn [5] <reserved-1FABB>..<reserved-1FABF>
> -1FAC0..1FAC2;ID # So [3] ANATOMICAL HEART..PEOPLE HUGGING
> +1FA80..1FA88;ID # So [9] YO-YO..FLUTE
> +1FA89..1FA8F;ID # Cn [7] <reserved-1FA89>..<reserved-1FA8F>
> +1FA90..1FABD;ID # So [46] RINGED PLANET..WING
> +1FABE;ID # Cn <reserved-1FABE>
> +1FABF..1FAC2;ID # So [4] GOOSE..PEOPLE HUGGING
> 1FAC3..1FAC5;EB # So [3] PREGNANT MAN..PERSON WITH CROWN
> -1FAC6..1FACF;ID # Cn [10] <reserved-1FAC6>..<reserved-1FACF>
> -1FAD0..1FAD9;ID # So [10] BLUEBERRIES..JAR
> -1FADA..1FADF;ID # Cn [6] <reserved-1FADA>..<reserved-1FADF>
> -1FAE0..1FAE7;ID # So [8] MELTING FACE..BUBBLES
> -1FAE8..1FAEF;ID # Cn [8] <reserved-1FAE8>..<reserved-1FAEF>
> -1FAF0..1FAF6;EB # So [7] HAND WITH INDEX FINGER AND THUMB CROSSED..HEART HANDS
> -1FAF7..1FAFF;ID # Cn [9] <reserved-1FAF7>..<reserved-1FAFF>
> +1FAC6..1FACD;ID # Cn [8] <reserved-1FAC6>..<reserved-1FACD>
> +1FACE..1FADB;ID # So [14] MOOSE..PEA POD
> +1FADC..1FADF;ID # Cn [4] <reserved-1FADC>..<reserved-1FADF>
> +1FAE0..1FAE8;ID # So [9] MELTING FACE..SHAKING FACE
> +1FAE9..1FAEF;ID # Cn [7] <reserved-1FAE9>..<reserved-1FAEF>
> +1FAF0..1FAF8;EB # So [9] HAND WITH INDEX FINGER AND THUMB CROSSED..RIGHTWARDS PUSHING HAND
> +1FAF9..1FAFF;ID # Cn [7] <reserved-1FAF9>..<reserved-1FAFF>
> 1FB00..1FB92;AL # So [147] BLOCK SEXTANT-1..UPPER HALF INVERSE MEDIUM SHADE AND LOWER HALF BLOCK
> 1FB94..1FBCA;AL # So [55] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..WHITE UP-POINTING CHEVRON
> 1FBF0..1FBF9;NU # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
> 1FC00..1FFFD;ID # Cn [1022] <reserved-1FC00>..<reserved-1FFFD>
> 20000..2A6DF;ID # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> 2A6E0..2A6FF;ID # Cn [32] <reserved-2A6E0>..<reserved-2A6FF>
> -2A700..2B738;ID # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> -2B739..2B73F;ID # Cn [7] <reserved-2B739>..<reserved-2B73F>
> +2A700..2B739;ID # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> +2B73A..2B73F;ID # Cn [6] <reserved-2B73A>..<reserved-2B73F>
> 2B740..2B81D;ID # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B81E..2B81F;ID # Cn [2] <reserved-2B81E>..<reserved-2B81F>
> 2B820..2CEA1;ID # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> _AT_@ -3546,7 +3585,9 @@ FFFD;AI # So REPLACEMENT CHARACTER
> 2FA1E..2FA1F;ID # Cn [2] <reserved-2FA1E>..<reserved-2FA1F>
> 2FA20..2FFFD;ID # Cn [1502] <reserved-2FA20>..<reserved-2FFFD>
> 30000..3134A;ID # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> -3134B..3FFFD;ID # Cn [60595] <reserved-3134B>..<reserved-3FFFD>
> +3134B..3134F;ID # Cn [5] <reserved-3134B>..<reserved-3134F>
> +31350..323AF;ID # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> +323B0..3FFFD;ID # Cn [56398] <reserved-323B0>..<reserved-3FFFD>
> E0001;CM # Cf LANGUAGE TAG
> E0020..E007F;CM # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF;CM # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
> diff --git a/data/LineBreakTest.txt b/data/LineBreakTest.txt
> index 8d1cef0..3122a2e 100644
> --- a/data/LineBreakTest.txt
> +++ b/data/LineBreakTest.txt
> _AT_@ -1,11 +1,11 @@
> -# LineBreakTest-14.0.0.txt
> -# Date: 2021-08-20, 21:08:45 GMT
> -# © 2021 Unicode®, Inc.
> +# LineBreakTest-15.0.0.txt
> +# Date: 2022-02-26, 00:38:39 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
> #
> # Default Line_Break Test
> #
> diff --git a/data/SentenceBreakProperty.txt b/data/SentenceBreakProperty.txt
> index 4b12b85..66fbf01 100644
> --- a/data/SentenceBreakProperty.txt
> +++ b/data/SentenceBreakProperty.txt
> _AT_@ -1,11 +1,11 @@
> -# SentenceBreakProperty-14.0.0.txt
> -# Date: 2021-08-12, 23:13:21 GMT
> -# © 2021 Unicode®, Inc.
> +# SentenceBreakProperty-15.0.0.txt
> +# Date: 2022-08-05, 22:17:35 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
>
> # ================================================
>
> _AT_@ -144,6 +144,7 @@
> 0CCC..0CCD ; Extend # Mn [2] KANNADA VOWEL SIGN AU..KANNADA SIGN VIRAMA
> 0CD5..0CD6 ; Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
> 0CE2..0CE3 ; Extend # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> +0CF3 ; Extend # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01 ; Extend # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03 ; Extend # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D3B..0D3C ; Extend # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA
> _AT_@ -167,7 +168,7 @@
> 0E47..0E4E ; Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
> 0EB1 ; Extend # Mn LAO VOWEL SIGN MAI KAN
> 0EB4..0EBC ; Extend # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
> -0EC8..0ECD ; Extend # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; Extend # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0F18..0F19 ; Extend # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> 0F35 ; Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> 0F37 ; Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> _AT_@ -371,6 +372,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 10AE5..10AE6 ; Extend # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> 10D24..10D27 ; Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> 10EAB..10EAC ; Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> +10EFD..10EFF ; Extend # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F46..10F50 ; Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> 10F82..10F85 ; Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> 11000 ; Extend # Mc BRAHMI SIGN CANDRABINDU
> _AT_@ -407,6 +409,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11235 ; Extend # Mc KHOJKI SIGN VIRAMA
> 11236..11237 ; Extend # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; Extend # Mn KHOJKI SIGN SUKUN
> +11241 ; Extend # Mn KHOJKI VOWEL SIGN VOCALIC R
> 112DF ; Extend # Mn KHUDAWADI SIGN ANUSVARA
> 112E0..112E2 ; Extend # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II
> 112E3..112EA ; Extend # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
> _AT_@ -516,6 +519,16 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11D97 ; Extend # Mn GUNJALA GONDI VIRAMA
> 11EF3..11EF4 ; Extend # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6 ; Extend # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F00..11F01 ; Extend # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F03 ; Extend # Mc KAWI SIGN VISARGA
> +11F34..11F35 ; Extend # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A ; Extend # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F ; Extend # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40 ; Extend # Mn KAWI VOWEL SIGN EU
> +11F41 ; Extend # Mc KAWI SIGN KILLER
> +11F42 ; Extend # Mn KAWI CONJOINER
> +13440 ; Extend # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13447..13455 ; Extend # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 16AF0..16AF4 ; Extend # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> 16B30..16B36 ; Extend # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> 16F4F ; Extend # Mn MIAO SIGN CONSONANT MODIFIER BAR
> _AT_@ -544,15 +557,17 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 1E01B..1E021 ; Extend # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Extend # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Extend # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E08F ; Extend # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E130..1E136 ; Extend # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E2AE ; Extend # Mn TOTO SIGN RISING TONE
> 1E2EC..1E2EF ; Extend # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> +1E4EC..1E4EF ; Extend # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> 1E8D0..1E8D6 ; Extend # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> 1E944..1E94A ; Extend # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
> E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 2508
> +# Total code points: 2550
>
> # ================================================
>
> _AT_@ -581,12 +596,12 @@ FEFF ; Format # Cf ZERO WIDTH NO-BREAK SPACE
> FFF9..FFFB ; Format # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
> 110BD ; Format # Cf KAITHI NUMBER SIGN
> 110CD ; Format # Cf KAITHI NUMBER SIGN ABOVE
> -13430..13438 ; Format # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
> +13430..1343F ; Format # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> 1BCA0..1BCA3 ; Format # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
> 1D173..1D17A ; Format # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
> E0001 ; Format # Cf LANGUAGE TAG
>
> -# Total code points: 65
> +# Total code points: 72
>
> # ================================================
>
> _AT_@ -880,6 +895,7 @@ E0001 ; Format # Cf LANGUAGE TAG
> 052D ; Lower # L& CYRILLIC SMALL LETTER DCHE
> 052F ; Lower # L& CYRILLIC SMALL LETTER EL WITH DESCENDER
> 0560..0588 ; Lower # L& [41] ARMENIAN SMALL LETTER TURNED AYB..ARMENIAN SMALL LETTER YI WITH STROKE
> +10FC ; Lower # Lm MODIFIER LETTER GEORGIAN NAR
> 13F8..13FD ; Lower # L& [6] CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETTER MV
> 1C80..1C88 ; Lower # L& [9] CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SMALL LETTER UNBLENDED UK
> 1D00..1D2B ; Lower # L& [44] LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTER SMALL CAPITAL EL
> _AT_@ -1228,12 +1244,14 @@ A7D3 ; Lower # L& LATIN SMALL LETTER DOUBLE THORN
> A7D5 ; Lower # L& LATIN SMALL LETTER DOUBLE WYNN
> A7D7 ; Lower # L& LATIN SMALL LETTER MIDDLE SCOTS S
> A7D9 ; Lower # L& LATIN SMALL LETTER SIGMOID S
> +A7F2..A7F4 ; Lower # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
> A7F6 ; Lower # L& LATIN SMALL LETTER REVERSED HALF H
> A7F8..A7F9 ; Lower # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
> A7FA ; Lower # L& LATIN LETTER SMALL CAPITAL TURNED M
> AB30..AB5A ; Lower # L& [43] LATIN SMALL LETTER BARRED ALPHA..LATIN SMALL LETTER Y WITH SHORT RIGHT LEG
> AB5C..AB5F ; Lower # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
> AB60..AB68 ; Lower # L& [9] LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LETTER TURNED R WITH MIDDLE TILDE
> +AB69 ; Lower # Lm MODIFIER LETTER SMALL TURNED W
> AB70..ABBF ; Lower # L& [80] CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTER YA
> FB00..FB06 ; Lower # L& [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
> FB13..FB17 ; Lower # L& [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
> _AT_@ -1281,9 +1299,11 @@ FF41..FF5A ; Lower # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
> 1D7CB ; Lower # L& MATHEMATICAL BOLD SMALL DIGAMMA
> 1DF00..1DF09 ; Lower # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0B..1DF1E ; Lower # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; Lower # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; Lower # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E922..1E943 ; Lower # L& [34] ADLAM SMALL LETTER ALIF..ADLAM SMALL LETTER SHA
>
> -# Total code points: 2424
> +# Total code points: 2497
>
> # ================================================
>
> _AT_@ -2102,7 +2122,6 @@ FF21..FF3A ; Upper # L& [26] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH LAT
> 1075..1081 ; OLetter # Lo [13] MYANMAR LETTER SHAN KA..MYANMAR LETTER SHAN HA
> 108E ; OLetter # Lo MYANMAR LETTER RUMAI PALAUNG FA
> 10D0..10FA ; OLetter # L& [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
> -10FC ; OLetter # Lm MODIFIER LETTER GEORGIAN NAR
> 10FD..10FF ; OLetter # L& [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
> 1100..1248 ; OLetter # Lo [329] HANGUL CHOSEONG KIYEOK..ETHIOPIC SYLLABLE QWA
> 124A..124D ; OLetter # Lo [4] ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
> _AT_@ -2215,7 +2234,6 @@ A6E6..A6EF ; OLetter # Nl [10] BAMUM LETTER MO..BAMUM LETTER KOGHOM
> A717..A71F ; OLetter # Lm [9] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
> A788 ; OLetter # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
> A78F ; OLetter # Lo LATIN LETTER SINOLOGICAL DOT
> -A7F2..A7F4 ; OLetter # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
> A7F7 ; OLetter # Lo LATIN EPIGRAPHIC LETTER SIDEWAYS I
> A7FB..A801 ; OLetter # Lo [7] LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI NAGRI LETTER I
> A803..A805 ; OLetter # Lo [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O
> _AT_@ -2258,7 +2276,6 @@ AB09..AB0E ; OLetter # Lo [6] ETHIOPIC SYLLABLE DDHU..ETHIOPIC SYLLABLE DDH
> AB11..AB16 ; OLetter # Lo [6] ETHIOPIC SYLLABLE DZU..ETHIOPIC SYLLABLE DZO
> AB20..AB26 ; OLetter # Lo [7] ETHIOPIC SYLLABLE CCHHA..ETHIOPIC SYLLABLE CCHHO
> AB28..AB2E ; OLetter # Lo [7] ETHIOPIC SYLLABLE BBA..ETHIOPIC SYLLABLE BBO
> -AB69 ; OLetter # Lm MODIFIER LETTER SMALL TURNED W
> ABC0..ABE2 ; OLetter # Lo [35] MEETEI MAYEK LETTER KOK..MEETEI MAYEK LETTER I LONSUM
> AC00..D7A3 ; OLetter # Lo [11172] HANGUL SYLLABLE GA..HANGUL SYLLABLE HIH
> D7B0..D7C6 ; OLetter # Lo [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
> _AT_@ -2366,6 +2383,7 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 111DC ; OLetter # Lo SHARADA HEADSTROKE
> 11200..11211 ; OLetter # Lo [18] KHOJKI LETTER A..KHOJKI LETTER JJA
> 11213..1122B ; OLetter # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA
> +1123F..11240 ; OLetter # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> 11280..11286 ; OLetter # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; OLetter # Lo MULTANI LETTER GHA
> 1128A..1128D ; OLetter # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -2427,12 +2445,16 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 11D6A..11D89 ; OLetter # Lo [32] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER SA
> 11D98 ; OLetter # Lo GUNJALA GONDI OM
> 11EE0..11EF2 ; OLetter # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> +11F02 ; OLetter # Lo KAWI SIGN REPHA
> +11F04..11F10 ; OLetter # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; OLetter # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> 11FB0 ; OLetter # Lo LISU LETTER YHA
> 12000..12399 ; OLetter # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; OLetter # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; OLetter # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; OLetter # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; OLetter # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; OLetter # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; OLetter # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; OLetter # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; OLetter # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; OLetter # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -2454,7 +2476,9 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1AFF5..1AFFB ; OLetter # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> 1AFFD..1AFFE ; OLetter # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000..1B122 ; OLetter # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU
> +1B132 ; OLetter # Lo HIRAGANA LETTER SMALL KO
> 1B150..1B152 ; OLetter # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
> +1B155 ; OLetter # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; OLetter # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
> 1B170..1B2FB ; OLetter # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> 1BC00..1BC6A ; OLetter # Lo [107] DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M
> _AT_@ -2467,6 +2491,8 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1E14E ; OLetter # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
> 1E290..1E2AD ; OLetter # Lo [30] TOTO LETTER PA..TOTO LETTER A
> 1E2C0..1E2EB ; OLetter # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> +1E4D0..1E4EA ; OLetter # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; OLetter # Lm NAG MUNDARI SIGN OJOD
> 1E7E0..1E7E6 ; OLetter # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; OLetter # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; OLetter # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -2507,14 +2533,15 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1EEA5..1EEA9 ; OLetter # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
> 1EEAB..1EEBB ; OLetter # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> 20000..2A6DF ; OLetter # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B738 ; OLetter # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738
> +2A700..2B739 ; OLetter # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> 2B740..2B81D ; OLetter # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> 2B820..2CEA1 ; OLetter # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> 2CEB0..2EBE0 ; OLetter # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> 2F800..2FA1D ; OLetter # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> 30000..3134A ; OLetter # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> +31350..323AF ; OLetter # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
>
> -# Total code points: 127761
> +# Total code points: 132036
>
> # ================================================
>
> _AT_@ -2573,16 +2600,18 @@ FF10..FF19 ; Numeric # Nd [10] FULLWIDTH DIGIT ZERO..FULLWIDTH DIGIT NINE
> 11C50..11C59 ; Numeric # Nd [10] BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE
> 11D50..11D59 ; Numeric # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE
> 11DA0..11DA9 ; Numeric # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE
> +11F50..11F59 ; Numeric # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 16A60..16A69 ; Numeric # Nd [10] MRO DIGIT ZERO..MRO DIGIT NINE
> 16AC0..16AC9 ; Numeric # Nd [10] TANGSA DIGIT ZERO..TANGSA DIGIT NINE
> 16B50..16B59 ; Numeric # Nd [10] PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT NINE
> 1D7CE..1D7FF ; Numeric # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
> 1E140..1E149 ; Numeric # Nd [10] NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG PUACHUE HMONG DIGIT NINE
> 1E2F0..1E2F9 ; Numeric # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> +1E4F0..1E4F9 ; Numeric # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E950..1E959 ; Numeric # Nd [10] ADLAM DIGIT ZERO..ADLAM DIGIT NINE
> 1FBF0..1FBF9 ; Numeric # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
>
> -# Total code points: 662
> +# Total code points: 682
>
> # ================================================
>
> _AT_@ -2664,6 +2693,7 @@ FF61 ; STerm # Po HALFWIDTH IDEOGRAPHIC FULL STOP
> 11A9B..11A9C ; STerm # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
> 11C41..11C42 ; STerm # Po [2] BHAIKSUKI DANDA..BHAIKSUKI DOUBLE DANDA
> 11EF7..11EF8 ; STerm # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> +11F43..11F44 ; STerm # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
> 16A6E..16A6F ; STerm # Po [2] MRO DANDA..MRO DOUBLE DANDA
> 16AF5 ; STerm # Po BASSA VAH FULL STOP
> 16B37..16B38 ; STerm # Po [2] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN VOS TSHAB CEEB
> _AT_@ -2672,7 +2702,7 @@ FF61 ; STerm # Po HALFWIDTH IDEOGRAPHIC FULL STOP
> 1BC9F ; STerm # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
> 1DA88 ; STerm # Po SIGNWRITING FULL STOP
>
> -# Total code points: 149
> +# Total code points: 151
>
> # ================================================
>
> diff --git a/data/SentenceBreakTest.txt b/data/SentenceBreakTest.txt
> index 61ea42c..be53fe9 100644
> --- a/data/SentenceBreakTest.txt
> +++ b/data/SentenceBreakTest.txt
> _AT_@ -1,11 +1,11 @@
> -# SentenceBreakTest-14.0.0.txt
> -# Date: 2021-03-08, 06:22:40 GMT
> -# © 2021 Unicode®, Inc.
> +# SentenceBreakTest-15.0.0.txt
> +# Date: 2022-02-26, 00:39:00 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
> #
> # Default Sentence_Break Test
> #
> diff --git a/data/SpecialCasing.txt b/data/SpecialCasing.txt
> index 1c2e968..08d04fa 100644
> --- a/data/SpecialCasing.txt
> +++ b/data/SpecialCasing.txt
> _AT_@ -1,11 +1,11 @@
> -# SpecialCasing-14.0.0.txt
> -# Date: 2021-03-08, 19:35:55 GMT
> -# © 2021 Unicode®, Inc.
> +# SpecialCasing-15.0.0.txt
> +# Date: 2022-02-02, 23:35:52 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
> #
> # Special Casing
> #
> diff --git a/data/UnicodeData.txt b/data/UnicodeData.txt
> index b5abef7..ea963a7 100644
> --- a/data/UnicodeData.txt
> +++ b/data/UnicodeData.txt
> _AT_@ -2975,6 +2975,7 @@
> 0CEF;KANNADA DIGIT NINE;Nd;0;L;;9;9;9;N;;;;;
> 0CF1;KANNADA SIGN JIHVAMULIYA;Lo;0;L;;;;;N;;;;;
> 0CF2;KANNADA SIGN UPADHMANIYA;Lo;0;L;;;;;N;;;;;
> +0CF3;KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT;Mc;0;L;;;;;N;;;;;
> 0D00;MALAYALAM SIGN COMBINING ANUSVARA ABOVE;Mn;0;NSM;;;;;N;;;;;
> 0D01;MALAYALAM SIGN CANDRABINDU;Mn;0;NSM;;;;;N;;;;;
> 0D02;MALAYALAM SIGN ANUSVARA;Mc;0;L;;;;;N;;;;;
> _AT_@ -3339,6 +3340,7 @@
> 0ECB;LAO TONE MAI CATAWA;Mn;122;NSM;;;;;N;;;;;
> 0ECC;LAO CANCELLATION MARK;Mn;0;NSM;;;;;N;;;;;
> 0ECD;LAO NIGGAHITA;Mn;0;NSM;;;;;N;;;;;
> +0ECE;LAO YAMAKKAN;Mn;0;NSM;;;;;N;;;;;
> 0ED0;LAO DIGIT ZERO;Nd;0;L;;0;0;0;N;;;;;
> 0ED1;LAO DIGIT ONE;Nd;0;L;;1;1;1;N;;;;;
> 0ED2;LAO DIGIT TWO;Nd;0;L;;2;2;2;N;;;;;
> _AT_@ -19393,6 +19395,9 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 10EAD;YEZIDI HYPHENATION MARK;Pd;0;R;;;;;N;;;;;
> 10EB0;YEZIDI LETTER LAM WITH DOT ABOVE;Lo;0;R;;;;;N;;;;;
> 10EB1;YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE;Lo;0;R;;;;;N;;;;;
> +10EFD;ARABIC SMALL LOW WORD SAKTA;Mn;220;NSM;;;;;N;;;;;
> +10EFE;ARABIC SMALL LOW WORD QASR;Mn;220;NSM;;;;;N;;;;;
> +10EFF;ARABIC SMALL LOW WORD MADDA;Mn;220;NSM;;;;;N;;;;;
> 10F00;OLD SOGDIAN LETTER ALEPH;Lo;0;R;;;;;N;;;;;
> 10F01;OLD SOGDIAN LETTER FINAL ALEPH;Lo;0;R;;;;;N;;;;;
> 10F02;OLD SOGDIAN LETTER BETH;Lo;0;R;;;;;N;;;;;
> _AT_@ -20058,6 +20063,9 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1123C;KHOJKI DOUBLE SECTION MARK;Po;0;L;;;;;N;;;;;
> 1123D;KHOJKI ABBREVIATION SIGN;Po;0;L;;;;;N;;;;;
> 1123E;KHOJKI SIGN SUKUN;Mn;0;NSM;;;;;N;;;;;
> +1123F;KHOJKI LETTER QA;Lo;0;L;;;;;N;;;;;
> +11240;KHOJKI LETTER SHORT I;Lo;0;L;;;;;N;;;;;
> +11241;KHOJKI VOWEL SIGN VOCALIC R;Mn;0;NSM;;;;;N;;;;;
> 11280;MULTANI LETTER A;Lo;0;L;;;;;N;;;;;
> 11281;MULTANI LETTER I;Lo;0;L;;;;;N;;;;;
> 11282;MULTANI LETTER U;Lo;0;L;;;;;N;;;;;
> _AT_@ -21256,6 +21264,16 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 11AF6;PAU CIN HAU LOW-FALLING TONE LONG FINAL;Lo;0;L;;;;;N;;;;;
> 11AF7;PAU CIN HAU LOW-FALLING TONE FINAL;Lo;0;L;;;;;N;;;;;
> 11AF8;PAU CIN HAU GLOTTAL STOP FINAL;Lo;0;L;;;;;N;;;;;
> +11B00;DEVANAGARI HEAD MARK;Po;0;L;;;;;N;;;;;
> +11B01;DEVANAGARI HEAD MARK WITH HEADSTROKE;Po;0;L;;;;;N;;;;;
> +11B02;DEVANAGARI SIGN BHALE;Po;0;L;;;;;N;;;;;
> +11B03;DEVANAGARI SIGN BHALE WITH HOOK;Po;0;L;;;;;N;;;;;
> +11B04;DEVANAGARI SIGN EXTENDED BHALE;Po;0;L;;;;;N;;;;;
> +11B05;DEVANAGARI SIGN EXTENDED BHALE WITH HOOK;Po;0;L;;;;;N;;;;;
> +11B06;DEVANAGARI SIGN WESTERN FIVE-LIKE BHALE;Po;0;L;;;;;N;;;;;
> +11B07;DEVANAGARI SIGN WESTERN NINE-LIKE BHALE;Po;0;L;;;;;N;;;;;
> +11B08;DEVANAGARI SIGN REVERSED NINE-LIKE BHALE;Po;0;L;;;;;N;;;;;
> +11B09;DEVANAGARI SIGN MINDU;Po;0;L;;;;;N;;;;;
> 11C00;BHAIKSUKI LETTER A;Lo;0;L;;;;;N;;;;;
> 11C01;BHAIKSUKI LETTER AA;Lo;0;L;;;;;N;;;;;
> 11C02;BHAIKSUKI LETTER I;Lo;0;L;;;;;N;;;;;
> _AT_@ -21584,6 +21602,92 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 11EF6;MAKASAR VOWEL SIGN O;Mc;0;L;;;;;N;;;;;
> 11EF7;MAKASAR PASSIMBANG;Po;0;L;;;;;N;;;;;
> 11EF8;MAKASAR END OF SECTION;Po;0;L;;;;;N;;;;;
> +11F00;KAWI SIGN CANDRABINDU;Mn;0;NSM;;;;;N;;;;;
> +11F01;KAWI SIGN ANUSVARA;Mn;0;NSM;;;;;N;;;;;
> +11F02;KAWI SIGN REPHA;Lo;0;L;;;;;N;;;;;
> +11F03;KAWI SIGN VISARGA;Mc;0;L;;;;;N;;;;;
> +11F04;KAWI LETTER A;Lo;0;L;;;;;N;;;;;
> +11F05;KAWI LETTER AA;Lo;0;L;;;;;N;;;;;
> +11F06;KAWI LETTER I;Lo;0;L;;;;;N;;;;;
> +11F07;KAWI LETTER II;Lo;0;L;;;;;N;;;;;
> +11F08;KAWI LETTER U;Lo;0;L;;;;;N;;;;;
> +11F09;KAWI LETTER UU;Lo;0;L;;;;;N;;;;;
> +11F0A;KAWI LETTER VOCALIC R;Lo;0;L;;;;;N;;;;;
> +11F0B;KAWI LETTER VOCALIC RR;Lo;0;L;;;;;N;;;;;
> +11F0C;KAWI LETTER VOCALIC L;Lo;0;L;;;;;N;;;;;
> +11F0D;KAWI LETTER VOCALIC LL;Lo;0;L;;;;;N;;;;;
> +11F0E;KAWI LETTER E;Lo;0;L;;;;;N;;;;;
> +11F0F;KAWI LETTER AI;Lo;0;L;;;;;N;;;;;
> +11F10;KAWI LETTER O;Lo;0;L;;;;;N;;;;;
> +11F12;KAWI LETTER KA;Lo;0;L;;;;;N;;;;;
> +11F13;KAWI LETTER KHA;Lo;0;L;;;;;N;;;;;
> +11F14;KAWI LETTER GA;Lo;0;L;;;;;N;;;;;
> +11F15;KAWI LETTER GHA;Lo;0;L;;;;;N;;;;;
> +11F16;KAWI LETTER NGA;Lo;0;L;;;;;N;;;;;
> +11F17;KAWI LETTER CA;Lo;0;L;;;;;N;;;;;
> +11F18;KAWI LETTER CHA;Lo;0;L;;;;;N;;;;;
> +11F19;KAWI LETTER JA;Lo;0;L;;;;;N;;;;;
> +11F1A;KAWI LETTER JHA;Lo;0;L;;;;;N;;;;;
> +11F1B;KAWI LETTER NYA;Lo;0;L;;;;;N;;;;;
> +11F1C;KAWI LETTER TTA;Lo;0;L;;;;;N;;;;;
> +11F1D;KAWI LETTER TTHA;Lo;0;L;;;;;N;;;;;
> +11F1E;KAWI LETTER DDA;Lo;0;L;;;;;N;;;;;
> +11F1F;KAWI LETTER DDHA;Lo;0;L;;;;;N;;;;;
> +11F20;KAWI LETTER NNA;Lo;0;L;;;;;N;;;;;
> +11F21;KAWI LETTER TA;Lo;0;L;;;;;N;;;;;
> +11F22;KAWI LETTER THA;Lo;0;L;;;;;N;;;;;
> +11F23;KAWI LETTER DA;Lo;0;L;;;;;N;;;;;
> +11F24;KAWI LETTER DHA;Lo;0;L;;;;;N;;;;;
> +11F25;KAWI LETTER NA;Lo;0;L;;;;;N;;;;;
> +11F26;KAWI LETTER PA;Lo;0;L;;;;;N;;;;;
> +11F27;KAWI LETTER PHA;Lo;0;L;;;;;N;;;;;
> +11F28;KAWI LETTER BA;Lo;0;L;;;;;N;;;;;
> +11F29;KAWI LETTER BHA;Lo;0;L;;;;;N;;;;;
> +11F2A;KAWI LETTER MA;Lo;0;L;;;;;N;;;;;
> +11F2B;KAWI LETTER YA;Lo;0;L;;;;;N;;;;;
> +11F2C;KAWI LETTER RA;Lo;0;L;;;;;N;;;;;
> +11F2D;KAWI LETTER LA;Lo;0;L;;;;;N;;;;;
> +11F2E;KAWI LETTER WA;Lo;0;L;;;;;N;;;;;
> +11F2F;KAWI LETTER SHA;Lo;0;L;;;;;N;;;;;
> +11F30;KAWI LETTER SSA;Lo;0;L;;;;;N;;;;;
> +11F31;KAWI LETTER SA;Lo;0;L;;;;;N;;;;;
> +11F32;KAWI LETTER HA;Lo;0;L;;;;;N;;;;;
> +11F33;KAWI LETTER JNYA;Lo;0;L;;;;;N;;;;;
> +11F34;KAWI VOWEL SIGN AA;Mc;0;L;;;;;N;;;;;
> +11F35;KAWI VOWEL SIGN ALTERNATE AA;Mc;0;L;;;;;N;;;;;
> +11F36;KAWI VOWEL SIGN I;Mn;0;NSM;;;;;N;;;;;
> +11F37;KAWI VOWEL SIGN II;Mn;0;NSM;;;;;N;;;;;
> +11F38;KAWI VOWEL SIGN U;Mn;0;NSM;;;;;N;;;;;
> +11F39;KAWI VOWEL SIGN UU;Mn;0;NSM;;;;;N;;;;;
> +11F3A;KAWI VOWEL SIGN VOCALIC R;Mn;0;NSM;;;;;N;;;;;
> +11F3E;KAWI VOWEL SIGN E;Mc;0;L;;;;;N;;;;;
> +11F3F;KAWI VOWEL SIGN AI;Mc;0;L;;;;;N;;;;;
> +11F40;KAWI VOWEL SIGN EU;Mn;0;NSM;;;;;N;;;;;
> +11F41;KAWI SIGN KILLER;Mc;9;L;;;;;N;;;;;
> +11F42;KAWI CONJOINER;Mn;9;NSM;;;;;N;;;;;
> +11F43;KAWI DANDA;Po;0;L;;;;;N;;;;;
> +11F44;KAWI DOUBLE DANDA;Po;0;L;;;;;N;;;;;
> +11F45;KAWI PUNCTUATION SECTION MARKER;Po;0;L;;;;;N;;;;;
> +11F46;KAWI PUNCTUATION ALTERNATE SECTION MARKER;Po;0;L;;;;;N;;;;;
> +11F47;KAWI PUNCTUATION FLOWER;Po;0;L;;;;;N;;;;;
> +11F48;KAWI PUNCTUATION SPACE FILLER;Po;0;L;;;;;N;;;;;
> +11F49;KAWI PUNCTUATION DOT;Po;0;L;;;;;N;;;;;
> +11F4A;KAWI PUNCTUATION DOUBLE DOT;Po;0;L;;;;;N;;;;;
> +11F4B;KAWI PUNCTUATION TRIPLE DOT;Po;0;L;;;;;N;;;;;
> +11F4C;KAWI PUNCTUATION CIRCLE;Po;0;L;;;;;N;;;;;
> +11F4D;KAWI PUNCTUATION FILLED CIRCLE;Po;0;L;;;;;N;;;;;
> +11F4E;KAWI PUNCTUATION SPIRAL;Po;0;L;;;;;N;;;;;
> +11F4F;KAWI PUNCTUATION CLOSING SPIRAL;Po;0;L;;;;;N;;;;;
> +11F50;KAWI DIGIT ZERO;Nd;0;L;;0;0;0;N;;;;;
> +11F51;KAWI DIGIT ONE;Nd;0;L;;1;1;1;N;;;;;
> +11F52;KAWI DIGIT TWO;Nd;0;L;;2;2;2;N;;;;;
> +11F53;KAWI DIGIT THREE;Nd;0;L;;3;3;3;N;;;;;
> +11F54;KAWI DIGIT FOUR;Nd;0;L;;4;4;4;N;;;;;
> +11F55;KAWI DIGIT FIVE;Nd;0;L;;5;5;5;N;;;;;
> +11F56;KAWI DIGIT SIX;Nd;0;L;;6;6;6;N;;;;;
> +11F57;KAWI DIGIT SEVEN;Nd;0;L;;7;7;7;N;;;;;
> +11F58;KAWI DIGIT EIGHT;Nd;0;L;;8;8;8;N;;;;;
> +11F59;KAWI DIGIT NINE;Nd;0;L;;9;9;9;N;;;;;
> 11FB0;LISU LETTER YHA;Lo;0;L;;;;;N;;;;;
> 11FC0;TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH;No;0;L;;;;1/320;N;;;;;
> 11FC1;TAMIL FRACTION ONE ONE-HUNDRED-AND-SIXTIETH;No;0;L;;;;1/160;N;;;;;
> _AT_@ -24040,6 +24144,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1342C;EGYPTIAN HIEROGLYPH AA030;Lo;0;L;;;;;N;;;;;
> 1342D;EGYPTIAN HIEROGLYPH AA031;Lo;0;L;;;;;N;;;;;
> 1342E;EGYPTIAN HIEROGLYPH AA032;Lo;0;L;;;;;N;;;;;
> +1342F;EGYPTIAN HIEROGLYPH V011D;Lo;0;L;;;;;N;;;;;
> 13430;EGYPTIAN HIEROGLYPH VERTICAL JOINER;Cf;0;L;;;;;N;;;;;
> 13431;EGYPTIAN HIEROGLYPH HORIZONTAL JOINER;Cf;0;L;;;;;N;;;;;
> 13432;EGYPTIAN HIEROGLYPH INSERT AT TOP START;Cf;0;L;;;;;N;;;;;
> _AT_@ -24049,6 +24154,35 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 13436;EGYPTIAN HIEROGLYPH OVERLAY MIDDLE;Cf;0;L;;;;;N;;;;;
> 13437;EGYPTIAN HIEROGLYPH BEGIN SEGMENT;Cf;0;L;;;;;N;;;;;
> 13438;EGYPTIAN HIEROGLYPH END SEGMENT;Cf;0;L;;;;;N;;;;;
> +13439;EGYPTIAN HIEROGLYPH INSERT AT MIDDLE;Cf;0;L;;;;;N;;;;;
> +1343A;EGYPTIAN HIEROGLYPH INSERT AT TOP;Cf;0;L;;;;;N;;;;;
> +1343B;EGYPTIAN HIEROGLYPH INSERT AT BOTTOM;Cf;0;L;;;;;N;;;;;
> +1343C;EGYPTIAN HIEROGLYPH BEGIN ENCLOSURE;Cf;0;L;;;;;N;;;;;
> +1343D;EGYPTIAN HIEROGLYPH END ENCLOSURE;Cf;0;L;;;;;N;;;;;
> +1343E;EGYPTIAN HIEROGLYPH BEGIN WALLED ENCLOSURE;Cf;0;L;;;;;N;;;;;
> +1343F;EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE;Cf;0;L;;;;;N;;;;;
> +13440;EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY;Mn;0;NSM;;;;;N;;;;;
> +13441;EGYPTIAN HIEROGLYPH FULL BLANK;Lo;0;L;;;;;N;;;;;
> +13442;EGYPTIAN HIEROGLYPH HALF BLANK;Lo;0;L;;;;;N;;;;;
> +13443;EGYPTIAN HIEROGLYPH LOST SIGN;Lo;0;L;;;;;N;;;;;
> +13444;EGYPTIAN HIEROGLYPH HALF LOST SIGN;Lo;0;L;;;;;N;;;;;
> +13445;EGYPTIAN HIEROGLYPH TALL LOST SIGN;Lo;0;L;;;;;N;;;;;
> +13446;EGYPTIAN HIEROGLYPH WIDE LOST SIGN;Lo;0;L;;;;;N;;;;;
> +13447;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START;Mn;0;NSM;;;;;N;;;;;
> +13448;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT BOTTOM START;Mn;0;NSM;;;;;N;;;;;
> +13449;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT START;Mn;0;NSM;;;;;N;;;;;
> +1344A;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP END;Mn;0;NSM;;;;;N;;;;;
> +1344B;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP;Mn;0;NSM;;;;;N;;;;;
> +1344C;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT BOTTOM START AND TOP END;Mn;0;NSM;;;;;N;;;;;
> +1344D;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT START AND TOP;Mn;0;NSM;;;;;N;;;;;
> +1344E;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT BOTTOM END;Mn;0;NSM;;;;;N;;;;;
> +1344F;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START AND BOTTOM END;Mn;0;NSM;;;;;N;;;;;
> +13450;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT BOTTOM;Mn;0;NSM;;;;;N;;;;;
> +13451;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT START AND BOTTOM;Mn;0;NSM;;;;;N;;;;;
> +13452;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT END;Mn;0;NSM;;;;;N;;;;;
> +13453;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP AND END;Mn;0;NSM;;;;;N;;;;;
> +13454;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT BOTTOM AND END;Mn;0;NSM;;;;;N;;;;;
> +13455;EGYPTIAN HIEROGLYPH MODIFIER DAMAGED;Mn;0;NSM;;;;;N;;;;;
> 14400;ANATOLIAN HIEROGLYPH A001;Lo;0;L;;;;;N;;;;;
> 14401;ANATOLIAN HIEROGLYPH A002;Lo;0;L;;;;;N;;;;;
> 14402;ANATOLIAN HIEROGLYPH A003;Lo;0;L;;;;;N;;;;;
> _AT_@ -27289,9 +27423,11 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1B120;KATAKANA LETTER ARCHAIC YI;Lo;0;L;;;;;N;;;;;
> 1B121;KATAKANA LETTER ARCHAIC YE;Lo;0;L;;;;;N;;;;;
> 1B122;KATAKANA LETTER ARCHAIC WU;Lo;0;L;;;;;N;;;;;
> +1B132;HIRAGANA LETTER SMALL KO;Lo;0;L;;;;;N;;;;;
> 1B150;HIRAGANA LETTER SMALL WI;Lo;0;L;;;;;N;;;;;
> 1B151;HIRAGANA LETTER SMALL WE;Lo;0;L;;;;;N;;;;;
> 1B152;HIRAGANA LETTER SMALL WO;Lo;0;L;;;;;N;;;;;
> +1B155;KATAKANA LETTER SMALL KO;Lo;0;L;;;;;N;;;;;
> 1B164;KATAKANA LETTER SMALL WI;Lo;0;L;;;;;N;;;;;
> 1B165;KATAKANA LETTER SMALL WE;Lo;0;L;;;;;N;;;;;
> 1B166;KATAKANA LETTER SMALL WO;Lo;0;L;;;;;N;;;;;
> _AT_@ -28573,6 +28709,26 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1D243;COMBINING GREEK MUSICAL TETRASEME;Mn;230;NSM;;;;;N;;;;;
> 1D244;COMBINING GREEK MUSICAL PENTASEME;Mn;230;NSM;;;;;N;;;;;
> 1D245;GREEK MUSICAL LEIMMA;So;0;ON;;;;;N;;;;;
> +1D2C0;KAKTOVIK NUMERAL ZERO;No;0;L;;;;0;N;;;;;
> +1D2C1;KAKTOVIK NUMERAL ONE;No;0;L;;;;1;N;;;;;
> +1D2C2;KAKTOVIK NUMERAL TWO;No;0;L;;;;2;N;;;;;
> +1D2C3;KAKTOVIK NUMERAL THREE;No;0;L;;;;3;N;;;;;
> +1D2C4;KAKTOVIK NUMERAL FOUR;No;0;L;;;;4;N;;;;;
> +1D2C5;KAKTOVIK NUMERAL FIVE;No;0;L;;;;5;N;;;;;
> +1D2C6;KAKTOVIK NUMERAL SIX;No;0;L;;;;6;N;;;;;
> +1D2C7;KAKTOVIK NUMERAL SEVEN;No;0;L;;;;7;N;;;;;
> +1D2C8;KAKTOVIK NUMERAL EIGHT;No;0;L;;;;8;N;;;;;
> +1D2C9;KAKTOVIK NUMERAL NINE;No;0;L;;;;9;N;;;;;
> +1D2CA;KAKTOVIK NUMERAL TEN;No;0;L;;;;10;N;;;;;
> +1D2CB;KAKTOVIK NUMERAL ELEVEN;No;0;L;;;;11;N;;;;;
> +1D2CC;KAKTOVIK NUMERAL TWELVE;No;0;L;;;;12;N;;;;;
> +1D2CD;KAKTOVIK NUMERAL THIRTEEN;No;0;L;;;;13;N;;;;;
> +1D2CE;KAKTOVIK NUMERAL FOURTEEN;No;0;L;;;;14;N;;;;;
> +1D2CF;KAKTOVIK NUMERAL FIFTEEN;No;0;L;;;;15;N;;;;;
> +1D2D0;KAKTOVIK NUMERAL SIXTEEN;No;0;L;;;;16;N;;;;;
> +1D2D1;KAKTOVIK NUMERAL SEVENTEEN;No;0;L;;;;17;N;;;;;
> +1D2D2;KAKTOVIK NUMERAL EIGHTEEN;No;0;L;;;;18;N;;;;;
> +1D2D3;KAKTOVIK NUMERAL NINETEEN;No;0;L;;;;19;N;;;;;
> 1D2E0;MAYAN NUMERAL ZERO;No;0;L;;;;0;N;;;;;
> 1D2E1;MAYAN NUMERAL ONE;No;0;L;;;;1;N;;;;;
> 1D2E2;MAYAN NUMERAL TWO;No;0;L;;;;2;N;;;;;
> _AT_@ -30404,6 +30560,12 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1DF1C;LATIN SMALL LETTER TESH DIGRAPH WITH RETROFLEX HOOK;Ll;0;L;;;;;N;;;;;
> 1DF1D;LATIN SMALL LETTER C WITH RETROFLEX HOOK;Ll;0;L;;;;;N;;;;;
> 1DF1E;LATIN SMALL LETTER S WITH CURL;Ll;0;L;;;;;N;;;;;
> +1DF25;LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> +1DF26;LATIN SMALL LETTER L WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> +1DF27;LATIN SMALL LETTER N WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> +1DF28;LATIN SMALL LETTER R WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> +1DF29;LATIN SMALL LETTER S WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> +1DF2A;LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK;Ll;0;L;;;;;N;;;;;
> 1E000;COMBINING GLAGOLITIC LETTER AZU;Mn;230;NSM;;;;;N;;;;;
> 1E001;COMBINING GLAGOLITIC LETTER BUKY;Mn;230;NSM;;;;;N;;;;;
> 1E002;COMBINING GLAGOLITIC LETTER VEDE;Mn;230;NSM;;;;;N;;;;;
> _AT_@ -30442,6 +30604,69 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1E028;COMBINING GLAGOLITIC LETTER BIG YUS;Mn;230;NSM;;;;;N;;;;;
> 1E029;COMBINING GLAGOLITIC LETTER IOTATED BIG YUS;Mn;230;NSM;;;;;N;;;;;
> 1E02A;COMBINING GLAGOLITIC LETTER FITA;Mn;230;NSM;;;;;N;;;;;
> +1E030;MODIFIER LETTER CYRILLIC SMALL A;Lm;0;L;<super> 0430;;;;N;;;;;
> +1E031;MODIFIER LETTER CYRILLIC SMALL BE;Lm;0;L;<super> 0431;;;;N;;;;;
> +1E032;MODIFIER LETTER CYRILLIC SMALL VE;Lm;0;L;<super> 0432;;;;N;;;;;
> +1E033;MODIFIER LETTER CYRILLIC SMALL GHE;Lm;0;L;<super> 0433;;;;N;;;;;
> +1E034;MODIFIER LETTER CYRILLIC SMALL DE;Lm;0;L;<super> 0434;;;;N;;;;;
> +1E035;MODIFIER LETTER CYRILLIC SMALL IE;Lm;0;L;<super> 0435;;;;N;;;;;
> +1E036;MODIFIER LETTER CYRILLIC SMALL ZHE;Lm;0;L;<super> 0436;;;;N;;;;;
> +1E037;MODIFIER LETTER CYRILLIC SMALL ZE;Lm;0;L;<super> 0437;;;;N;;;;;
> +1E038;MODIFIER LETTER CYRILLIC SMALL I;Lm;0;L;<super> 0438;;;;N;;;;;
> +1E039;MODIFIER LETTER CYRILLIC SMALL KA;Lm;0;L;<super> 043A;;;;N;;;;;
> +1E03A;MODIFIER LETTER CYRILLIC SMALL EL;Lm;0;L;<super> 043B;;;;N;;;;;
> +1E03B;MODIFIER LETTER CYRILLIC SMALL EM;Lm;0;L;<super> 043C;;;;N;;;;;
> +1E03C;MODIFIER LETTER CYRILLIC SMALL O;Lm;0;L;<super> 043E;;;;N;;;;;
> +1E03D;MODIFIER LETTER CYRILLIC SMALL PE;Lm;0;L;<super> 043F;;;;N;;;;;
> +1E03E;MODIFIER LETTER CYRILLIC SMALL ER;Lm;0;L;<super> 0440;;;;N;;;;;
> +1E03F;MODIFIER LETTER CYRILLIC SMALL ES;Lm;0;L;<super> 0441;;;;N;;;;;
> +1E040;MODIFIER LETTER CYRILLIC SMALL TE;Lm;0;L;<super> 0442;;;;N;;;;;
> +1E041;MODIFIER LETTER CYRILLIC SMALL U;Lm;0;L;<super> 0443;;;;N;;;;;
> +1E042;MODIFIER LETTER CYRILLIC SMALL EF;Lm;0;L;<super> 0444;;;;N;;;;;
> +1E043;MODIFIER LETTER CYRILLIC SMALL HA;Lm;0;L;<super> 0445;;;;N;;;;;
> +1E044;MODIFIER LETTER CYRILLIC SMALL TSE;Lm;0;L;<super> 0446;;;;N;;;;;
> +1E045;MODIFIER LETTER CYRILLIC SMALL CHE;Lm;0;L;<super> 0447;;;;N;;;;;
> +1E046;MODIFIER LETTER CYRILLIC SMALL SHA;Lm;0;L;<super> 0448;;;;N;;;;;
> +1E047;MODIFIER LETTER CYRILLIC SMALL YERU;Lm;0;L;<super> 044B;;;;N;;;;;
> +1E048;MODIFIER LETTER CYRILLIC SMALL E;Lm;0;L;<super> 044D;;;;N;;;;;
> +1E049;MODIFIER LETTER CYRILLIC SMALL YU;Lm;0;L;<super> 044E;;;;N;;;;;
> +1E04A;MODIFIER LETTER CYRILLIC SMALL DZZE;Lm;0;L;<super> A689;;;;N;;;;;
> +1E04B;MODIFIER LETTER CYRILLIC SMALL SCHWA;Lm;0;L;<super> 04D9;;;;N;;;;;
> +1E04C;MODIFIER LETTER CYRILLIC SMALL BYELORUSSIAN-UKRAINIAN I;Lm;0;L;<super> 0456;;;;N;;;;;
> +1E04D;MODIFIER LETTER CYRILLIC SMALL JE;Lm;0;L;<super> 0458;;;;N;;;;;
> +1E04E;MODIFIER LETTER CYRILLIC SMALL BARRED O;Lm;0;L;<super> 04E9;;;;N;;;;;
> +1E04F;MODIFIER LETTER CYRILLIC SMALL STRAIGHT U;Lm;0;L;<super> 04AF;;;;N;;;;;
> +1E050;MODIFIER LETTER CYRILLIC SMALL PALOCHKA;Lm;0;L;<super> 04CF;;;;N;;;;;
> +1E051;CYRILLIC SUBSCRIPT SMALL LETTER A;Lm;0;L;<sub> 0430;;;;N;;;;;
> +1E052;CYRILLIC SUBSCRIPT SMALL LETTER BE;Lm;0;L;<sub> 0431;;;;N;;;;;
> +1E053;CYRILLIC SUBSCRIPT SMALL LETTER VE;Lm;0;L;<sub> 0432;;;;N;;;;;
> +1E054;CYRILLIC SUBSCRIPT SMALL LETTER GHE;Lm;0;L;<sub> 0433;;;;N;;;;;
> +1E055;CYRILLIC SUBSCRIPT SMALL LETTER DE;Lm;0;L;<sub> 0434;;;;N;;;;;
> +1E056;CYRILLIC SUBSCRIPT SMALL LETTER IE;Lm;0;L;<sub> 0435;;;;N;;;;;
> +1E057;CYRILLIC SUBSCRIPT SMALL LETTER ZHE;Lm;0;L;<sub> 0436;;;;N;;;;;
> +1E058;CYRILLIC SUBSCRIPT SMALL LETTER ZE;Lm;0;L;<sub> 0437;;;;N;;;;;
> +1E059;CYRILLIC SUBSCRIPT SMALL LETTER I;Lm;0;L;<sub> 0438;;;;N;;;;;
> +1E05A;CYRILLIC SUBSCRIPT SMALL LETTER KA;Lm;0;L;<sub> 043A;;;;N;;;;;
> +1E05B;CYRILLIC SUBSCRIPT SMALL LETTER EL;Lm;0;L;<sub> 043B;;;;N;;;;;
> +1E05C;CYRILLIC SUBSCRIPT SMALL LETTER O;Lm;0;L;<sub> 043E;;;;N;;;;;
> +1E05D;CYRILLIC SUBSCRIPT SMALL LETTER PE;Lm;0;L;<sub> 043F;;;;N;;;;;
> +1E05E;CYRILLIC SUBSCRIPT SMALL LETTER ES;Lm;0;L;<sub> 0441;;;;N;;;;;
> +1E05F;CYRILLIC SUBSCRIPT SMALL LETTER U;Lm;0;L;<sub> 0443;;;;N;;;;;
> +1E060;CYRILLIC SUBSCRIPT SMALL LETTER EF;Lm;0;L;<sub> 0444;;;;N;;;;;
> +1E061;CYRILLIC SUBSCRIPT SMALL LETTER HA;Lm;0;L;<sub> 0445;;;;N;;;;;
> +1E062;CYRILLIC SUBSCRIPT SMALL LETTER TSE;Lm;0;L;<sub> 0446;;;;N;;;;;
> +1E063;CYRILLIC SUBSCRIPT SMALL LETTER CHE;Lm;0;L;<sub> 0447;;;;N;;;;;
> +1E064;CYRILLIC SUBSCRIPT SMALL LETTER SHA;Lm;0;L;<sub> 0448;;;;N;;;;;
> +1E065;CYRILLIC SUBSCRIPT SMALL LETTER HARD SIGN;Lm;0;L;<sub> 044A;;;;N;;;;;
> +1E066;CYRILLIC SUBSCRIPT SMALL LETTER YERU;Lm;0;L;<sub> 044B;;;;N;;;;;
> +1E067;CYRILLIC SUBSCRIPT SMALL LETTER GHE WITH UPTURN;Lm;0;L;<sub> 0491;;;;N;;;;;
> +1E068;CYRILLIC SUBSCRIPT SMALL LETTER BYELORUSSIAN-UKRAINIAN I;Lm;0;L;<sub> 0456;;;;N;;;;;
> +1E069;CYRILLIC SUBSCRIPT SMALL LETTER DZE;Lm;0;L;<sub> 0455;;;;N;;;;;
> +1E06A;CYRILLIC SUBSCRIPT SMALL LETTER DZHE;Lm;0;L;<sub> 045F;;;;N;;;;;
> +1E06B;MODIFIER LETTER CYRILLIC SMALL ES WITH DESCENDER;Lm;0;L;<super> 04AB;;;;N;;;;;
> +1E06C;MODIFIER LETTER CYRILLIC SMALL YERU WITH BACK YER;Lm;0;L;<super> A651;;;;N;;;;;
> +1E06D;MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE;Lm;0;L;<super> 04B1;;;;N;;;;;
> +1E08F;COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I;Mn;230;NSM;;;;;N;;;;;
> 1E100;NYIAKENG PUACHUE HMONG LETTER MA;Lo;0;L;;;;;N;;;;;
> 1E101;NYIAKENG PUACHUE HMONG LETTER TSA;Lo;0;L;;;;;N;;;;;
> 1E102;NYIAKENG PUACHUE HMONG LETTER NTA;Lo;0;L;;;;;N;;;;;
> _AT_@ -30603,6 +30828,48 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1E2F8;WANCHO DIGIT EIGHT;Nd;0;L;;8;8;8;N;;;;;
> 1E2F9;WANCHO DIGIT NINE;Nd;0;L;;9;9;9;N;;;;;
> 1E2FF;WANCHO NGUN SIGN;Sc;0;ET;;;;;N;;;;;
> +1E4D0;NAG MUNDARI LETTER O;Lo;0;L;;;;;N;;;;;
> +1E4D1;NAG MUNDARI LETTER OP;Lo;0;L;;;;;N;;;;;
> +1E4D2;NAG MUNDARI LETTER OL;Lo;0;L;;;;;N;;;;;
> +1E4D3;NAG MUNDARI LETTER OY;Lo;0;L;;;;;N;;;;;
> +1E4D4;NAG MUNDARI LETTER ONG;Lo;0;L;;;;;N;;;;;
> +1E4D5;NAG MUNDARI LETTER A;Lo;0;L;;;;;N;;;;;
> +1E4D6;NAG MUNDARI LETTER AJ;Lo;0;L;;;;;N;;;;;
> +1E4D7;NAG MUNDARI LETTER AB;Lo;0;L;;;;;N;;;;;
> +1E4D8;NAG MUNDARI LETTER ANY;Lo;0;L;;;;;N;;;;;
> +1E4D9;NAG MUNDARI LETTER AH;Lo;0;L;;;;;N;;;;;
> +1E4DA;NAG MUNDARI LETTER I;Lo;0;L;;;;;N;;;;;
> +1E4DB;NAG MUNDARI LETTER IS;Lo;0;L;;;;;N;;;;;
> +1E4DC;NAG MUNDARI LETTER IDD;Lo;0;L;;;;;N;;;;;
> +1E4DD;NAG MUNDARI LETTER IT;Lo;0;L;;;;;N;;;;;
> +1E4DE;NAG MUNDARI LETTER IH;Lo;0;L;;;;;N;;;;;
> +1E4DF;NAG MUNDARI LETTER U;Lo;0;L;;;;;N;;;;;
> +1E4E0;NAG MUNDARI LETTER UC;Lo;0;L;;;;;N;;;;;
> +1E4E1;NAG MUNDARI LETTER UD;Lo;0;L;;;;;N;;;;;
> +1E4E2;NAG MUNDARI LETTER UK;Lo;0;L;;;;;N;;;;;
> +1E4E3;NAG MUNDARI LETTER UR;Lo;0;L;;;;;N;;;;;
> +1E4E4;NAG MUNDARI LETTER E;Lo;0;L;;;;;N;;;;;
> +1E4E5;NAG MUNDARI LETTER ENN;Lo;0;L;;;;;N;;;;;
> +1E4E6;NAG MUNDARI LETTER EG;Lo;0;L;;;;;N;;;;;
> +1E4E7;NAG MUNDARI LETTER EM;Lo;0;L;;;;;N;;;;;
> +1E4E8;NAG MUNDARI LETTER EN;Lo;0;L;;;;;N;;;;;
> +1E4E9;NAG MUNDARI LETTER ETT;Lo;0;L;;;;;N;;;;;
> +1E4EA;NAG MUNDARI LETTER ELL;Lo;0;L;;;;;N;;;;;
> +1E4EB;NAG MUNDARI SIGN OJOD;Lm;0;L;;;;;N;;;;;
> +1E4EC;NAG MUNDARI SIGN MUHOR;Mn;232;NSM;;;;;N;;;;;
> +1E4ED;NAG MUNDARI SIGN TOYOR;Mn;232;NSM;;;;;N;;;;;
> +1E4EE;NAG MUNDARI SIGN IKIR;Mn;220;NSM;;;;;N;;;;;
> +1E4EF;NAG MUNDARI SIGN SUTUH;Mn;230;NSM;;;;;N;;;;;
> +1E4F0;NAG MUNDARI DIGIT ZERO;Nd;0;L;;0;0;0;N;;;;;
> +1E4F1;NAG MUNDARI DIGIT ONE;Nd;0;L;;1;1;1;N;;;;;
> +1E4F2;NAG MUNDARI DIGIT TWO;Nd;0;L;;2;2;2;N;;;;;
> +1E4F3;NAG MUNDARI DIGIT THREE;Nd;0;L;;3;3;3;N;;;;;
> +1E4F4;NAG MUNDARI DIGIT FOUR;Nd;0;L;;4;4;4;N;;;;;
> +1E4F5;NAG MUNDARI DIGIT FIVE;Nd;0;L;;5;5;5;N;;;;;
> +1E4F6;NAG MUNDARI DIGIT SIX;Nd;0;L;;6;6;6;N;;;;;
> +1E4F7;NAG MUNDARI DIGIT SEVEN;Nd;0;L;;7;7;7;N;;;;;
> +1E4F8;NAG MUNDARI DIGIT EIGHT;Nd;0;L;;8;8;8;N;;;;;
> +1E4F9;NAG MUNDARI DIGIT NINE;Nd;0;L;;9;9;9;N;;;;;
> 1E7E0;ETHIOPIC SYLLABLE HHYA;Lo;0;L;;;;;N;;;;;
> 1E7E1;ETHIOPIC SYLLABLE HHYU;Lo;0;L;;;;;N;;;;;
> 1E7E2;ETHIOPIC SYLLABLE HHYI;Lo;0;L;;;;;N;;;;;
> _AT_@ -32678,6 +32945,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1F6D5;HINDU TEMPLE;So;0;ON;;;;;N;;;;;
> 1F6D6;HUT;So;0;ON;;;;;N;;;;;
> 1F6D7;ELEVATOR;So;0;ON;;;;;N;;;;;
> +1F6DC;WIRELESS;So;0;ON;;;;;N;;;;;
> 1F6DD;PLAYGROUND SLIDE;So;0;ON;;;;;N;;;;;
> 1F6DE;WHEEL;So;0;ON;;;;;N;;;;;
> 1F6DF;RING BUOY;So;0;ON;;;;;N;;;;;
> _AT_@ -32823,6 +33091,14 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1F771;ALCHEMICAL SYMBOL FOR MONTH;So;0;ON;;;;;N;;;;;
> 1F772;ALCHEMICAL SYMBOL FOR HALF DRAM;So;0;ON;;;;;N;;;;;
> 1F773;ALCHEMICAL SYMBOL FOR HALF OUNCE;So;0;ON;;;;;N;;;;;
> +1F774;LOT OF FORTUNE;So;0;ON;;;;;N;;;;;
> +1F775;OCCULTATION;So;0;ON;;;;;N;;;;;
> +1F776;LUNAR ECLIPSE;So;0;ON;;;;;N;;;;;
> +1F77B;HAUMEA;So;0;ON;;;;;N;;;;;
> +1F77C;MAKEMAKE;So;0;ON;;;;;N;;;;;
> +1F77D;GONGGONG;So;0;ON;;;;;N;;;;;
> +1F77E;QUAOAR;So;0;ON;;;;;N;;;;;
> +1F77F;ORCUS;So;0;ON;;;;;N;;;;;
> 1F780;BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE;So;0;ON;;;;;N;;;;;
> 1F781;BLACK UP-POINTING ISOSCELES RIGHT TRIANGLE;So;0;ON;;;;;N;;;;;
> 1F782;BLACK RIGHT-POINTING ISOSCELES RIGHT TRIANGLE;So;0;ON;;;;;N;;;;;
> _AT_@ -32912,6 +33188,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1F7D6;NEGATIVE CIRCLED TRIANGLE;So;0;ON;;;;;N;;;;;
> 1F7D7;CIRCLED SQUARE;So;0;ON;;;;;N;;;;;
> 1F7D8;NEGATIVE CIRCLED SQUARE;So;0;ON;;;;;N;;;;;
> +1F7D9;NINE POINTED WHITE STAR;So;0;ON;;;;;N;;;;;
> 1F7E0;LARGE ORANGE CIRCLE;So;0;ON;;;;;N;;;;;
> 1F7E1;LARGE YELLOW CIRCLE;So;0;ON;;;;;N;;;;;
> 1F7E2;LARGE GREEN CIRCLE;So;0;ON;;;;;N;;;;;
> _AT_@ -33434,6 +33711,9 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FA72;BRIEFS;So;0;ON;;;;;N;;;;;
> 1FA73;SHORTS;So;0;ON;;;;;N;;;;;
> 1FA74;THONG SANDAL;So;0;ON;;;;;N;;;;;
> +1FA75;LIGHT BLUE HEART;So;0;ON;;;;;N;;;;;
> +1FA76;GREY HEART;So;0;ON;;;;;N;;;;;
> +1FA77;PINK HEART;So;0;ON;;;;;N;;;;;
> 1FA78;DROP OF BLOOD;So;0;ON;;;;;N;;;;;
> 1FA79;ADHESIVE BANDAGE;So;0;ON;;;;;N;;;;;
> 1FA7A;STETHOSCOPE;So;0;ON;;;;;N;;;;;
> _AT_@ -33446,6 +33726,8 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FA84;MAGIC WAND;So;0;ON;;;;;N;;;;;
> 1FA85;PINATA;So;0;ON;;;;;N;;;;;
> 1FA86;NESTING DOLLS;So;0;ON;;;;;N;;;;;
> +1FA87;MARACAS;So;0;ON;;;;;N;;;;;
> +1FA88;FLUTE;So;0;ON;;;;;N;;;;;
> 1FA90;RINGED PLANET;So;0;ON;;;;;N;;;;;
> 1FA91;CHAIR;So;0;ON;;;;;N;;;;;
> 1FA92;RAZOR;So;0;ON;;;;;N;;;;;
> _AT_@ -33475,6 +33757,9 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FAAA;IDENTIFICATION CARD;So;0;ON;;;;;N;;;;;
> 1FAAB;LOW BATTERY;So;0;ON;;;;;N;;;;;
> 1FAAC;HAMSA;So;0;ON;;;;;N;;;;;
> +1FAAD;FOLDING HAND FAN;So;0;ON;;;;;N;;;;;
> +1FAAE;HAIR PICK;So;0;ON;;;;;N;;;;;
> +1FAAF;KHANDA;So;0;ON;;;;;N;;;;;
> 1FAB0;FLY;So;0;ON;;;;;N;;;;;
> 1FAB1;WORM;So;0;ON;;;;;N;;;;;
> 1FAB2;BEETLE;So;0;ON;;;;;N;;;;;
> _AT_@ -33486,12 +33771,18 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FAB8;CORAL;So;0;ON;;;;;N;;;;;
> 1FAB9;EMPTY NEST;So;0;ON;;;;;N;;;;;
> 1FABA;NEST WITH EGGS;So;0;ON;;;;;N;;;;;
> +1FABB;HYACINTH;So;0;ON;;;;;N;;;;;
> +1FABC;JELLYFISH;So;0;ON;;;;;N;;;;;
> +1FABD;WING;So;0;ON;;;;;N;;;;;
> +1FABF;GOOSE;So;0;ON;;;;;N;;;;;
> 1FAC0;ANATOMICAL HEART;So;0;ON;;;;;N;;;;;
> 1FAC1;LUNGS;So;0;ON;;;;;N;;;;;
> 1FAC2;PEOPLE HUGGING;So;0;ON;;;;;N;;;;;
> 1FAC3;PREGNANT MAN;So;0;ON;;;;;N;;;;;
> 1FAC4;PREGNANT PERSON;So;0;ON;;;;;N;;;;;
> 1FAC5;PERSON WITH CROWN;So;0;ON;;;;;N;;;;;
> +1FACE;MOOSE;So;0;ON;;;;;N;;;;;
> +1FACF;DONKEY;So;0;ON;;;;;N;;;;;
> 1FAD0;BLUEBERRIES;So;0;ON;;;;;N;;;;;
> 1FAD1;BELL PEPPER;So;0;ON;;;;;N;;;;;
> 1FAD2;OLIVE;So;0;ON;;;;;N;;;;;
> _AT_@ -33502,6 +33793,8 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FAD7;POURING LIQUID;So;0;ON;;;;;N;;;;;
> 1FAD8;BEANS;So;0;ON;;;;;N;;;;;
> 1FAD9;JAR;So;0;ON;;;;;N;;;;;
> +1FADA;GINGER ROOT;So;0;ON;;;;;N;;;;;
> +1FADB;PEA POD;So;0;ON;;;;;N;;;;;
> 1FAE0;MELTING FACE;So;0;ON;;;;;N;;;;;
> 1FAE1;SALUTING FACE;So;0;ON;;;;;N;;;;;
> 1FAE2;FACE WITH OPEN EYES AND HAND OVER MOUTH;So;0;ON;;;;;N;;;;;
> _AT_@ -33510,6 +33803,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FAE5;DOTTED LINE FACE;So;0;ON;;;;;N;;;;;
> 1FAE6;BITING LIP;So;0;ON;;;;;N;;;;;
> 1FAE7;BUBBLES;So;0;ON;;;;;N;;;;;
> +1FAE8;SHAKING FACE;So;0;ON;;;;;N;;;;;
> 1FAF0;HAND WITH INDEX FINGER AND THUMB CROSSED;So;0;ON;;;;;N;;;;;
> 1FAF1;RIGHTWARDS HAND;So;0;ON;;;;;N;;;;;
> 1FAF2;LEFTWARDS HAND;So;0;ON;;;;;N;;;;;
> _AT_@ -33517,6 +33811,8 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 1FAF4;PALM UP HAND;So;0;ON;;;;;N;;;;;
> 1FAF5;INDEX POINTING AT THE VIEWER;So;0;ON;;;;;N;;;;;
> 1FAF6;HEART HANDS;So;0;ON;;;;;N;;;;;
> +1FAF7;LEFTWARDS PUSHING HAND;So;0;ON;;;;;N;;;;;
> +1FAF8;RIGHTWARDS PUSHING HAND;So;0;ON;;;;;N;;;;;
> 1FB00;BLOCK SEXTANT-1;So;0;ON;;;;;N;;;;;
> 1FB01;BLOCK SEXTANT-2;So;0;ON;;;;;N;;;;;
> 1FB02;BLOCK SEXTANT-12;So;0;ON;;;;;N;;;;;
> _AT_@ -33732,7 +34028,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 20000;<CJK Ideograph Extension B, First>;Lo;0;L;;;;;N;;;;;
> 2A6DF;<CJK Ideograph Extension B, Last>;Lo;0;L;;;;;N;;;;;
> 2A700;<CJK Ideograph Extension C, First>;Lo;0;L;;;;;N;;;;;
> -2B738;<CJK Ideograph Extension C, Last>;Lo;0;L;;;;;N;;;;;
> +2B739;<CJK Ideograph Extension C, Last>;Lo;0;L;;;;;N;;;;;
> 2B740;<CJK Ideograph Extension D, First>;Lo;0;L;;;;;N;;;;;
> 2B81D;<CJK Ideograph Extension D, Last>;Lo;0;L;;;;;N;;;;;
> 2B820;<CJK Ideograph Extension E, First>;Lo;0;L;;;;;N;;;;;
> _AT_@ -34283,6 +34579,8 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
> 2FA1D;CJK COMPATIBILITY IDEOGRAPH-2FA1D;Lo;0;L;2A600;;;;N;;;;;
> 30000;<CJK Ideograph Extension G, First>;Lo;0;L;;;;;N;;;;;
> 3134A;<CJK Ideograph Extension G, Last>;Lo;0;L;;;;;N;;;;;
> +31350;<CJK Ideograph Extension H, First>;Lo;0;L;;;;;N;;;;;
> +323AF;<CJK Ideograph Extension H, Last>;Lo;0;L;;;;;N;;;;;
> E0001;LANGUAGE TAG;Cf;0;BN;;;;;N;;;;;
> E0020;TAG SPACE;Cf;0;BN;;;;;N;;;;;
> E0021;TAG EXCLAMATION MARK;Cf;0;BN;;;;;N;;;;;
> diff --git a/data/WordBreakProperty.txt b/data/WordBreakProperty.txt
> index 73cd069..6f868d2 100644
> --- a/data/WordBreakProperty.txt
> +++ b/data/WordBreakProperty.txt
> _AT_@ -1,11 +1,11 @@
> -# WordBreakProperty-14.0.0.txt
> -# Date: 2021-07-10, 00:35:32 GMT
> -# © 2021 Unicode®, Inc.
> +# WordBreakProperty-15.0.0.txt
> +# Date: 2022-04-27, 02:41:26 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
>
> # ================================================
>
> _AT_@ -180,6 +180,7 @@ FB46..FB4F ; Hebrew_Letter # Lo [10] HEBREW LETTER TSADI WITH DAGESH..HEBREW
> 0CCC..0CCD ; Extend # Mn [2] KANNADA VOWEL SIGN AU..KANNADA SIGN VIRAMA
> 0CD5..0CD6 ; Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
> 0CE2..0CE3 ; Extend # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> +0CF3 ; Extend # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> 0D00..0D01 ; Extend # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> 0D02..0D03 ; Extend # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> 0D3B..0D3C ; Extend # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA
> _AT_@ -203,7 +204,7 @@ FB46..FB4F ; Hebrew_Letter # Lo [10] HEBREW LETTER TSADI WITH DAGESH..HEBREW
> 0E47..0E4E ; Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
> 0EB1 ; Extend # Mn LAO VOWEL SIGN MAI KAN
> 0EB4..0EBC ; Extend # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
> -0EC8..0ECD ; Extend # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
> +0EC8..0ECE ; Extend # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
> 0F18..0F19 ; Extend # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> 0F35 ; Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> 0F37 ; Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> _AT_@ -407,6 +408,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 10AE5..10AE6 ; Extend # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> 10D24..10D27 ; Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> 10EAB..10EAC ; Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> +10EFD..10EFF ; Extend # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> 10F46..10F50 ; Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> 10F82..10F85 ; Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> 11000 ; Extend # Mc BRAHMI SIGN CANDRABINDU
> _AT_@ -443,6 +445,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11235 ; Extend # Mc KHOJKI SIGN VIRAMA
> 11236..11237 ; Extend # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
> 1123E ; Extend # Mn KHOJKI SIGN SUKUN
> +11241 ; Extend # Mn KHOJKI VOWEL SIGN VOCALIC R
> 112DF ; Extend # Mn KHUDAWADI SIGN ANUSVARA
> 112E0..112E2 ; Extend # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II
> 112E3..112EA ; Extend # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
> _AT_@ -552,6 +555,16 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 11D97 ; Extend # Mn GUNJALA GONDI VIRAMA
> 11EF3..11EF4 ; Extend # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> 11EF5..11EF6 ; Extend # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> +11F00..11F01 ; Extend # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> +11F03 ; Extend # Mc KAWI SIGN VISARGA
> +11F34..11F35 ; Extend # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> +11F36..11F3A ; Extend # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> +11F3E..11F3F ; Extend # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> +11F40 ; Extend # Mn KAWI VOWEL SIGN EU
> +11F41 ; Extend # Mc KAWI SIGN KILLER
> +11F42 ; Extend # Mn KAWI CONJOINER
> +13440 ; Extend # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
> +13447..13455 ; Extend # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> 16AF0..16AF4 ; Extend # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> 16B30..16B36 ; Extend # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> 16F4F ; Extend # Mn MIAO SIGN CONSONANT MODIFIER BAR
> _AT_@ -580,16 +593,18 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
> 1E01B..1E021 ; Extend # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> 1E023..1E024 ; Extend # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> 1E026..1E02A ; Extend # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> +1E08F ; Extend # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> 1E130..1E136 ; Extend # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> 1E2AE ; Extend # Mn TOTO SIGN RISING TONE
> 1E2EC..1E2EF ; Extend # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> +1E4EC..1E4EF ; Extend # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
> 1E8D0..1E8D6 ; Extend # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> 1E944..1E94A ; Extend # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
> 1F3FB..1F3FF ; Extend # Sk [5] EMOJI MODIFIER FITZPATRICK TYPE-1-2..EMOJI MODIFIER FITZPATRICK TYPE-6
> E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
> E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
>
> -# Total code points: 2512
> +# Total code points: 2554
>
> # ================================================
>
> _AT_@ -615,12 +630,12 @@ FEFF ; Format # Cf ZERO WIDTH NO-BREAK SPACE
> FFF9..FFFB ; Format # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
> 110BD ; Format # Cf KAITHI NUMBER SIGN
> 110CD ; Format # Cf KAITHI NUMBER SIGN ABOVE
> -13430..13438 ; Format # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
> +13430..1343F ; Format # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
> 1BCA0..1BCA3 ; Format # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
> 1D173..1D17A ; Format # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
> E0001 ; Format # Cf LANGUAGE TAG
>
> -# Total code points: 64
> +# Total code points: 71
>
> # ================================================
>
> _AT_@ -641,9 +656,10 @@ FF71..FF9D ; Katakana # Lo [45] HALFWIDTH KATAKANA LETTER A..HALFWIDTH KATAK
> 1AFFD..1AFFE ; Katakana # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> 1B000 ; Katakana # Lo KATAKANA LETTER ARCHAIC E
> 1B120..1B122 ; Katakana # Lo [3] KATAKANA LETTER ARCHAIC YI..KATAKANA LETTER ARCHAIC WU
> +1B155 ; Katakana # Lo KATAKANA LETTER SMALL KO
> 1B164..1B167 ; Katakana # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
>
> -# Total code points: 330
> +# Total code points: 331
>
> # ================================================
>
> _AT_@ -1127,6 +1143,7 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 111DC ; ALetter # Lo SHARADA HEADSTROKE
> 11200..11211 ; ALetter # Lo [18] KHOJKI LETTER A..KHOJKI LETTER JJA
> 11213..1122B ; ALetter # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA
> +1123F..11240 ; ALetter # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I
> 11280..11286 ; ALetter # Lo [7] MULTANI LETTER A..MULTANI LETTER GA
> 11288 ; ALetter # Lo MULTANI LETTER GHA
> 1128A..1128D ; ALetter # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
> _AT_@ -1187,12 +1204,16 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 11D6A..11D89 ; ALetter # Lo [32] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER SA
> 11D98 ; ALetter # Lo GUNJALA GONDI OM
> 11EE0..11EF2 ; ALetter # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
> +11F02 ; ALetter # Lo KAWI SIGN REPHA
> +11F04..11F10 ; ALetter # Lo [13] KAWI LETTER A..KAWI LETTER O
> +11F12..11F33 ; ALetter # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
> 11FB0 ; ALetter # Lo LISU LETTER YHA
> 12000..12399 ; ALetter # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
> 12400..1246E ; ALetter # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
> 12480..12543 ; ALetter # Lo [196] CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM SIGN ZU5 TIMES THREE DISH TENU
> 12F90..12FF0 ; ALetter # Lo [97] CYPRO-MINOAN SIGN CM001..CYPRO-MINOAN SIGN CM114
> -13000..1342E ; ALetter # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
> +13000..1342F ; ALetter # Lo [1072] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH V011D
> +13441..13446 ; ALetter # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
> 14400..14646 ; ALetter # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
> 16800..16A38 ; ALetter # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
> 16A40..16A5E ; ALetter # Lo [31] MRO LETTER TA..MRO LETTER TEK
> _AT_@ -1245,11 +1266,15 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1DF00..1DF09 ; ALetter # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
> 1DF0A ; ALetter # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
> 1DF0B..1DF1E ; ALetter # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL
> +1DF25..1DF2A ; ALetter # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK
> +1E030..1E06D ; ALetter # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> 1E100..1E12C ; ALetter # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W
> 1E137..1E13D ; ALetter # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> 1E14E ; ALetter # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
> 1E290..1E2AD ; ALetter # Lo [30] TOTO LETTER PA..TOTO LETTER A
> 1E2C0..1E2EB ; ALetter # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH
> +1E4D0..1E4EA ; ALetter # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL
> +1E4EB ; ALetter # Lm NAG MUNDARI SIGN OJOD
> 1E7E0..1E7E6 ; ALetter # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
> 1E7E8..1E7EB ; ALetter # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
> 1E7ED..1E7EE ; ALetter # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
> _AT_@ -1294,7 +1319,7 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
> 1F150..1F169 ; ALetter # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
> 1F170..1F189 ; ALetter # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
>
> -# Total code points: 29336
> +# Total code points: 29489
>
> # ================================================
>
> _AT_@ -1398,16 +1423,18 @@ FF10..FF19 ; Numeric # Nd [10] FULLWIDTH DIGIT ZERO..FULLWIDTH DIGIT NINE
> 11C50..11C59 ; Numeric # Nd [10] BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE
> 11D50..11D59 ; Numeric # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE
> 11DA0..11DA9 ; Numeric # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE
> +11F50..11F59 ; Numeric # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
> 16A60..16A69 ; Numeric # Nd [10] MRO DIGIT ZERO..MRO DIGIT NINE
> 16AC0..16AC9 ; Numeric # Nd [10] TANGSA DIGIT ZERO..TANGSA DIGIT NINE
> 16B50..16B59 ; Numeric # Nd [10] PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT NINE
> 1D7CE..1D7FF ; Numeric # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
> 1E140..1E149 ; Numeric # Nd [10] NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG PUACHUE HMONG DIGIT NINE
> 1E2F0..1E2F9 ; Numeric # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE
> +1E4F0..1E4F9 ; Numeric # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
> 1E950..1E959 ; Numeric # Nd [10] ADLAM DIGIT ZERO..ADLAM DIGIT NINE
> 1FBF0..1FBF9 ; Numeric # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
>
> -# Total code points: 661
> +# Total code points: 681
>
> # ================================================
>
> diff --git a/data/WordBreakTest.txt b/data/WordBreakTest.txt
> index 1d1435b..27f64bf 100644
> --- a/data/WordBreakTest.txt
> +++ b/data/WordBreakTest.txt
> _AT_@ -1,11 +1,11 @@
> -# WordBreakTest-14.0.0.txt
> -# Date: 2021-03-08, 06:22:40 GMT
> -# © 2021 Unicode®, Inc.
> +# WordBreakTest-15.0.0.txt
> +# Date: 2022-02-26, 00:39:00 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Unicode Character Database
> -# For documentation, see http://www.unicode.org/reports/tr44/
> +# For documentation, see https://www.unicode.org/reports/tr44/
> #
> # Default Word_Break Test
> #
> diff --git a/data/emoji-data.txt b/data/emoji-data.txt
> index 7806c7a..999a436 100644
> --- a/data/emoji-data.txt
> +++ b/data/emoji-data.txt
> _AT_@ -1,13 +1,13 @@
> -# emoji-data-14.0.0.txt
> -# Date: 2021-08-26, 17:22:22 GMT
> -# © 2021 Unicode®, Inc.
> +# emoji-data.txt
> +# Date: 2022-08-02, 00:26:10 GMT
> +# © 2022 Unicode®, Inc.
> # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see http://www.unicode.org/terms_of_use.html
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> #
> # Emoji Data for UTS #51
> -# Used with Emoji Version 14.0 and subsequent minor revisions (if any)
> +# Used with Emoji Version 15.0 and subsequent minor revisions (if any)
> #
> -# For documentation and usage, see http://www.unicode.org/reports/tr51
> +# For documentation and usage, see https://www.unicode.org/reports/tr51
> #
> # Format:
> # <codepoint(s)> ; <property> # <comments>
> _AT_@ -19,8 +19,7 @@
>
> # ================================================
>
> -# All omitted code points have Emoji=No
> -# _AT_missing: 0000..10FFFF ; Emoji ; No
> +# All omitted code points have Emoji=No
>
> 0023 ; Emoji # E0.0 [1] (#️) hash sign
> 002A ; Emoji # E0.0 [1] (*️) asterisk
> _AT_@ -341,6 +340,7 @@
> 1F6D1..1F6D2 ; Emoji # E3.0 [2] (🛑..🛒) stop sign..shopping cart
> 1F6D5 ; Emoji # E12.0 [1] (🛕) hindu temple
> 1F6D6..1F6D7 ; Emoji # E13.0 [2] (🛖..🛗) hut..elevator
> +1F6DC ; Emoji # E15.0 [1] (🛜) wireless
> 1F6DD..1F6DF ; Emoji # E14.0 [3] (🛝..🛟) playground slide..ring buoy
> 1F6E0..1F6E5 ; Emoji # E0.7 [6] (🛠️..🛥️) hammer and wrench..motor boat
> 1F6E9 ; Emoji # E0.7 [1] (🛩️) small airplane
> _AT_@ -401,28 +401,36 @@
> 1F9E7..1F9FF ; Emoji # E11.0 [25] (🧧..🧿) red envelope..nazar amulet
> 1FA70..1FA73 ; Emoji # E12.0 [4] (🩰..🩳) ballet shoes..shorts
> 1FA74 ; Emoji # E13.0 [1] (🩴) thong sandal
> +1FA75..1FA77 ; Emoji # E15.0 [3] (🩵..🩷) light blue heart..pink heart
> 1FA78..1FA7A ; Emoji # E12.0 [3] (🩸..🩺) drop of blood..stethoscope
> 1FA7B..1FA7C ; Emoji # E14.0 [2] (🩻..🩼) x-ray..crutch
> 1FA80..1FA82 ; Emoji # E12.0 [3] (🪀..🪂) yo-yo..parachute
> 1FA83..1FA86 ; Emoji # E13.0 [4] (🪃..🪆) boomerang..nesting dolls
> +1FA87..1FA88 ; Emoji # E15.0 [2] (🪇..🪈) maracas..flute
> 1FA90..1FA95 ; Emoji # E12.0 [6] (🪐..🪕) ringed planet..banjo
> 1FA96..1FAA8 ; Emoji # E13.0 [19] (🪖..🪨) military helmet..rock
> 1FAA9..1FAAC ; Emoji # E14.0 [4] (🪩..🪬) mirror ball..hamsa
> +1FAAD..1FAAF ; Emoji # E15.0 [3] (🪭..🪯) folding hand fan..khanda
> 1FAB0..1FAB6 ; Emoji # E13.0 [7] (🪰..🪶) fly..feather
> 1FAB7..1FABA ; Emoji # E14.0 [4] (🪷..🪺) lotus..nest with eggs
> +1FABB..1FABD ; Emoji # E15.0 [3] (🪻..🪽) hyacinth..wing
> +1FABF ; Emoji # E15.0 [1] (🪿) goose
> 1FAC0..1FAC2 ; Emoji # E13.0 [3] (🫀..🫂) anatomical heart..people hugging
> 1FAC3..1FAC5 ; Emoji # E14.0 [3] (🫃..🫅) pregnant man..person with crown
> +1FACE..1FACF ; Emoji # E15.0 [2] (🫎..🫏) moose..donkey
> 1FAD0..1FAD6 ; Emoji # E13.0 [7] (🫐..🫖) blueberries..teapot
> 1FAD7..1FAD9 ; Emoji # E14.0 [3] (🫗..🫙) pouring liquid..jar
> +1FADA..1FADB ; Emoji # E15.0 [2] (🫚..🫛) ginger root..pea pod
> 1FAE0..1FAE7 ; Emoji # E14.0 [8] (🫠..🫧) melting face..bubbles
> +1FAE8 ; Emoji # E15.0 [1] (🫨) shaking face
> 1FAF0..1FAF6 ; Emoji # E14.0 [7] (🫰..🫶) hand with index finger and thumb crossed..heart hands
> +1FAF7..1FAF8 ; Emoji # E15.0 [2] (🫷..🫸) leftwards pushing hand..rightwards pushing hand
>
> -# Total elements: 1404
> +# Total elements: 1424
>
> # ================================================
>
> -# All omitted code points have Emoji_Presentation=No
> -# _AT_missing: 0000..10FFFF ; Emoji_Presentation ; No
> +# All omitted code points have Emoji_Presentation=No
>
> 231A..231B ; Emoji_Presentation # E0.6 [2] (⌚..⌛) watch..hourglass done
> 23E9..23EC ; Emoji_Presentation # E0.6 [4] (⏩..⏬) fast-forward button..fast down button
> _AT_@ -625,6 +633,7 @@
> 1F6D1..1F6D2 ; Emoji_Presentation # E3.0 [2] (🛑..🛒) stop sign..shopping cart
> 1F6D5 ; Emoji_Presentation # E12.0 [1] (🛕) hindu temple
> 1F6D6..1F6D7 ; Emoji_Presentation # E13.0 [2] (🛖..🛗) hut..elevator
> +1F6DC ; Emoji_Presentation # E15.0 [1] (🛜) wireless
> 1F6DD..1F6DF ; Emoji_Presentation # E14.0 [3] (🛝..🛟) playground slide..ring buoy
> 1F6EB..1F6EC ; Emoji_Presentation # E1.0 [2] (🛫..🛬) airplane departure..airplane arrival
> 1F6F4..1F6F6 ; Emoji_Presentation # E3.0 [3] (🛴..🛶) kick scooter..canoe
> _AT_@ -681,28 +690,36 @@
> 1F9E7..1F9FF ; Emoji_Presentation # E11.0 [25] (🧧..🧿) red envelope..nazar amulet
> 1FA70..1FA73 ; Emoji_Presentation # E12.0 [4] (🩰..🩳) ballet shoes..shorts
> 1FA74 ; Emoji_Presentation # E13.0 [1] (🩴) thong sandal
> +1FA75..1FA77 ; Emoji_Presentation # E15.0 [3] (🩵..🩷) light blue heart..pink heart
> 1FA78..1FA7A ; Emoji_Presentation # E12.0 [3] (🩸..🩺) drop of blood..stethoscope
> 1FA7B..1FA7C ; Emoji_Presentation # E14.0 [2] (🩻..🩼) x-ray..crutch
> 1FA80..1FA82 ; Emoji_Presentation # E12.0 [3] (🪀..🪂) yo-yo..parachute
> 1FA83..1FA86 ; Emoji_Presentation # E13.0 [4] (🪃..🪆) boomerang..nesting dolls
> +1FA87..1FA88 ; Emoji_Presentation # E15.0 [2] (🪇..🪈) maracas..flute
> 1FA90..1FA95 ; Emoji_Presentation # E12.0 [6] (🪐..🪕) ringed planet..banjo
> 1FA96..1FAA8 ; Emoji_Presentation # E13.0 [19] (🪖..🪨) military helmet..rock
> 1FAA9..1FAAC ; Emoji_Presentation # E14.0 [4] (🪩..🪬) mirror ball..hamsa
> +1FAAD..1FAAF ; Emoji_Presentation # E15.0 [3] (🪭..🪯) folding hand fan..khanda
> 1FAB0..1FAB6 ; Emoji_Presentation # E13.0 [7] (🪰..🪶) fly..feather
> 1FAB7..1FABA ; Emoji_Presentation # E14.0 [4] (🪷..🪺) lotus..nest with eggs
> +1FABB..1FABD ; Emoji_Presentation # E15.0 [3] (🪻..🪽) hyacinth..wing
> +1FABF ; Emoji_Presentation # E15.0 [1] (🪿) goose
> 1FAC0..1FAC2 ; Emoji_Presentation # E13.0 [3] (🫀..🫂) anatomical heart..people hugging
> 1FAC3..1FAC5 ; Emoji_Presentation # E14.0 [3] (🫃..🫅) pregnant man..person with crown
> +1FACE..1FACF ; Emoji_Presentation # E15.0 [2] (🫎..🫏) moose..donkey
> 1FAD0..1FAD6 ; Emoji_Presentation # E13.0 [7] (🫐..🫖) blueberries..teapot
> 1FAD7..1FAD9 ; Emoji_Presentation # E14.0 [3] (🫗..🫙) pouring liquid..jar
> +1FADA..1FADB ; Emoji_Presentation # E15.0 [2] (🫚..🫛) ginger root..pea pod
> 1FAE0..1FAE7 ; Emoji_Presentation # E14.0 [8] (🫠..🫧) melting face..bubbles
> +1FAE8 ; Emoji_Presentation # E15.0 [1] (🫨) shaking face
> 1FAF0..1FAF6 ; Emoji_Presentation # E14.0 [7] (🫰..🫶) hand with index finger and thumb crossed..heart hands
> +1FAF7..1FAF8 ; Emoji_Presentation # E15.0 [2] (🫷..🫸) leftwards pushing hand..rightwards pushing hand
>
> -# Total elements: 1185
> +# Total elements: 1205
>
> # ================================================
>
> -# All omitted code points have Emoji_Modifier=No
> -# _AT_missing: 0000..10FFFF ; Emoji_Modifier ; No
> +# All omitted code points have Emoji_Modifier=No
>
> 1F3FB..1F3FF ; Emoji_Modifier # E1.0 [5] (🏻..🏿) light skin tone..dark skin tone
>
> _AT_@ -710,8 +727,7 @@
>
> # ================================================
>
> -# All omitted code points have Emoji_Modifier_Base=No
> -# _AT_missing: 0000..10FFFF ; Emoji_Modifier_Base ; No
> +# All omitted code points have Emoji_Modifier_Base=No
>
> 261D ; Emoji_Modifier_Base # E0.6 [1] (☝️) index pointing up
> 26F9 ; Emoji_Modifier_Base # E0.7 [1] (⛹️) person bouncing ball
> _AT_@ -762,13 +778,13 @@
> 1F9D1..1F9DD ; Emoji_Modifier_Base # E5.0 [13] (🧑..🧝) person..elf
> 1FAC3..1FAC5 ; Emoji_Modifier_Base # E14.0 [3] (🫃..🫅) pregnant man..person with crown
> 1FAF0..1FAF6 ; Emoji_Modifier_Base # E14.0 [7] (🫰..🫶) hand with index finger and thumb crossed..heart hands
> +1FAF7..1FAF8 ; Emoji_Modifier_Base # E15.0 [2] (🫷..🫸) leftwards pushing hand..rightwards pushing hand
>
> -# Total elements: 132
> +# Total elements: 134
>
> # ================================================
>
> -# All omitted code points have Emoji_Component=No
> -# _AT_missing: 0000..10FFFF ; Emoji_Component ; No
> +# All omitted code points have Emoji_Component=No
>
> 0023 ; Emoji_Component # E0.0 [1] (#️) hash sign
> 002A ; Emoji_Component # E0.0 [1] (*️) asterisk
> _AT_@ -785,8 +801,7 @@ E0020..E007F ; Emoji_Component # E0.0 [96] (󠀠..󠁿) tag space..c
>
> # ================================================
>
> -# All omitted code points have Extended_Pictographic=No
> -# _AT_missing: 0000..10FFFF ; Extended_Pictographic ; No
> +# All omitted code points have Extended_Pictographic=No
>
> 00A9 ; Extended_Pictographic# E0.6 [1] (©️) copyright
> 00AE ; Extended_Pictographic# E0.6 [1] (®️) registered
> _AT_@ -1190,7 +1205,8 @@ E0020..E007F ; Emoji_Component # E0.0 [96] (󠀠..󠁿) tag space..c
> 1F6D3..1F6D4 ; Extended_Pictographic# E0.0 [2] (🛓..🛔) STUPA..PAGODA
> 1F6D5 ; Extended_Pictographic# E12.0 [1] (🛕) hindu temple
> 1F6D6..1F6D7 ; Extended_Pictographic# E13.0 [2] (🛖..🛗) hut..elevator
> -1F6D8..1F6DC ; Extended_Pictographic# E0.0 [5] (🛘..🛜) <reserved-1F6D8>..<reserved-1F6DC>
> +1F6D8..1F6DB ; Extended_Pictographic# E0.0 [4] (🛘..🛛) <reserved-1F6D8>..<reserved-1F6DB>
> +1F6DC ; Extended_Pictographic# E15.0 [1] (🛜) wireless
> 1F6DD..1F6DF ; Extended_Pictographic# E14.0 [3] (🛝..🛟) playground slide..ring buoy
> 1F6E0..1F6E5 ; Extended_Pictographic# E0.7 [6] (🛠️..🛥️) hammer and wrench..motor boat
> 1F6E6..1F6E8 ; Extended_Pictographic# E0.0 [3] (🛦..🛨) UP-POINTING MILITARY AIRPLANE..UP-POINTING SMALL AIRPLANE
> _AT_@ -1207,7 +1223,7 @@ E0020..E007F ; Emoji_Component # E0.0 [96] (󠀠..󠁿) tag space..c
> 1F6FA ; Extended_Pictographic# E12.0 [1] (🛺) auto rickshaw
> 1F6FB..1F6FC ; Extended_Pictographic# E13.0 [2] (🛻..🛼) pickup truck..roller skate
> 1F6FD..1F6FF ; Extended_Pictographic# E0.0 [3] (🛽..🛿) <reserved-1F6FD>..<reserved-1F6FF>
> -1F774..1F77F ; Extended_Pictographic# E0.0 [12] (🝴..🝿) <reserved-1F774>..<reserved-1F77F>
> +1F774..1F77F ; Extended_Pictographic# E0.0 [12] (🝴..🝿) LOT OF FORTUNE..ORCUS
> 1F7D5..1F7DF ; Extended_Pictographic# E0.0 [11] (🟕..🟟) CIRCLED TRIANGLE..<reserved-1F7DF>
> 1F7E0..1F7EB ; Extended_Pictographic# E12.0 [12] (🟠..🟫) orange circle..brown square
> 1F7EC..1F7EF ; Extended_Pictographic# E0.0 [4] (🟬..🟯) <reserved-1F7EC>..<reserved-1F7EF>
> _AT_@ -1266,30 +1282,37 @@ E0020..E007F ; Emoji_Component # E0.0 [96] (󠀠..󠁿) tag space..c
> 1FA00..1FA6F ; Extended_Pictographic# E0.0 [112] (🨀..🩯) NEUTRAL CHESS KING..<reserved-1FA6F>
> 1FA70..1FA73 ; Extended_Pictographic# E12.0 [4] (🩰..🩳) ballet shoes..shorts
> 1FA74 ; Extended_Pictographic# E13.0 [1] (🩴) thong sandal
> -1FA75..1FA77 ; Extended_Pictographic# E0.0 [3] (🩵..🩷) <reserved-1FA75>..<reserved-1FA77>
> +1FA75..1FA77 ; Extended_Pictographic# E15.0 [3] (🩵..🩷) light blue heart..pink heart
> 1FA78..1FA7A ; Extended_Pictographic# E12.0 [3] (🩸..🩺) drop of blood..stethoscope
> 1FA7B..1FA7C ; Extended_Pictographic# E14.0 [2] (🩻..🩼) x-ray..crutch
> 1FA7D..1FA7F ; Extended_Pictographic# E0.0 [3] (🩽..🩿) <reserved-1FA7D>..<reserved-1FA7F>
> 1FA80..1FA82 ; Extended_Pictographic# E12.0 [3] (🪀..🪂) yo-yo..parachute
> 1FA83..1FA86 ; Extended_Pictographic# E13.0 [4] (🪃..🪆) boomerang..nesting dolls
> -1FA87..1FA8F ; Extended_Pictographic# E0.0 [9] (🪇..🪏) <reserved-1FA87>..<reserved-1FA8F>
> +1FA87..1FA88 ; Extended_Pictographic# E15.0 [2] (🪇..🪈) maracas..flute
> +1FA89..1FA8F ; Extended_Pictographic# E0.0 [7] (🪉..🪏) <reserved-1FA89>..<reserved-1FA8F>
> 1FA90..1FA95 ; Extended_Pictographic# E12.0 [6] (🪐..🪕) ringed planet..banjo
> 1FA96..1FAA8 ; Extended_Pictographic# E13.0 [19] (🪖..🪨) military helmet..rock
> 1FAA9..1FAAC ; Extended_Pictographic# E14.0 [4] (🪩..🪬) mirror ball..hamsa
> -1FAAD..1FAAF ; Extended_Pictographic# E0.0 [3] (🪭..🪯) <reserved-1FAAD>..<reserved-1FAAF>
> +1FAAD..1FAAF ; Extended_Pictographic# E15.0 [3] (🪭..🪯) folding hand fan..khanda
> 1FAB0..1FAB6 ; Extended_Pictographic# E13.0 [7] (🪰..🪶) fly..feather
> 1FAB7..1FABA ; Extended_Pictographic# E14.0 [4] (🪷..🪺) lotus..nest with eggs
> -1FABB..1FABF ; Extended_Pictographic# E0.0 [5] (🪻..🪿) <reserved-1FABB>..<reserved-1FABF>
> +1FABB..1FABD ; Extended_Pictographic# E15.0 [3] (🪻..🪽) hyacinth..wing
> +1FABE ; Extended_Pictographic# E0.0 [1] (🪾) <reserved-1FABE>
> +1FABF ; Extended_Pictographic# E15.0 [1] (🪿) goose
> 1FAC0..1FAC2 ; Extended_Pictographic# E13.0 [3] (🫀..🫂) anatomical heart..people hugging
> 1FAC3..1FAC5 ; Extended_Pictographic# E14.0 [3] (🫃..🫅) pregnant man..person with crown
> -1FAC6..1FACF ; Extended_Pictographic# E0.0 [10] (🫆..🫏) <reserved-1FAC6>..<reserved-1FACF>
> +1FAC6..1FACD ; Extended_Pictographic# E0.0 [8] (🫆..🫍) <reserved-1FAC6>..<reserved-1FACD>
> +1FACE..1FACF ; Extended_Pictographic# E15.0 [2] (🫎..🫏) moose..donkey
> 1FAD0..1FAD6 ; Extended_Pictographic# E13.0 [7] (🫐..🫖) blueberries..teapot
> 1FAD7..1FAD9 ; Extended_Pictographic# E14.0 [3] (🫗..🫙) pouring liquid..jar
> -1FADA..1FADF ; Extended_Pictographic# E0.0 [6] (🫚..🫟) <reserved-1FADA>..<reserved-1FADF>
> +1FADA..1FADB ; Extended_Pictographic# E15.0 [2] (🫚..🫛) ginger root..pea pod
> +1FADC..1FADF ; Extended_Pictographic# E0.0 [4] (🫜..🫟) <reserved-1FADC>..<reserved-1FADF>
> 1FAE0..1FAE7 ; Extended_Pictographic# E14.0 [8] (🫠..🫧) melting face..bubbles
> -1FAE8..1FAEF ; Extended_Pictographic# E0.0 [8] (🫨..🫯) <reserved-1FAE8>..<reserved-1FAEF>
> +1FAE8 ; Extended_Pictographic# E15.0 [1] (🫨) shaking face
> +1FAE9..1FAEF ; Extended_Pictographic# E0.0 [7] (🫩..🫯) <reserved-1FAE9>..<reserved-1FAEF>
> 1FAF0..1FAF6 ; Extended_Pictographic# E14.0 [7] (🫰..🫶) hand with index finger and thumb crossed..heart hands
> -1FAF7..1FAFF ; Extended_Pictographic# E0.0 [9] (🫷..🫿) <reserved-1FAF7>..<reserved-1FAFF>
> +1FAF7..1FAF8 ; Extended_Pictographic# E15.0 [2] (🫷..🫸) leftwards pushing hand..rightwards pushing hand
> +1FAF9..1FAFF ; Extended_Pictographic# E0.0 [7] (🫹..🫿) <reserved-1FAF9>..<reserved-1FAFF>
> 1FC00..1FFFD ; Extended_Pictographic# E0.0[1022] (🰀..🿽) <reserved-1FC00>..<reserved-1FFFD>
>
> # Total elements: 3537
>

-- 
Kind regards,
Hiltjo
Received on Thu Sep 15 2022 - 09:44:11 CEST

This archive was generated by hypermail 2.3.0 : Thu Sep 15 2022 - 09:48:35 CEST