1
1
mirror of https://github.com/mrabarnett/mrab-regex.git synced 2025-10-05 20:02:39 +02:00
Files
mrab-regex/docs/UnicodeProperties.rst

1552 lines
29 KiB
ReStructuredText
Raw Permalink Normal View History

The following is a list of the 94 properties which are supported by this module:
Alphabetic [Alpha]
No [F, False, N]
Yes [T, True, Y]
Alphanumeric [AlNum]
No [F, False, N]
Yes [T, True, Y]
Any
No [F, False, N]
Yes [T, True, Y]
ASCII_Hex_Digit [AHex]
No [F, False, N]
Yes [T, True, Y]
Bidi_Class [bc]
2012-11-20 03:20:20 +00:00
Arabic_Letter [AL]
Arabic_Number [AN]
Boundary_Neutral [BN]
Common_Separator [CS]
European_Number [EN]
European_Separator [ES]
European_Terminator [ET]
2013-10-12 02:35:02 +01:00
First_Strong_Isolate [FSI]
Left_To_Right [L]
2012-11-20 03:20:20 +00:00
Left_To_Right_Embedding [LRE]
2013-10-12 02:35:02 +01:00
Left_To_Right_Isolate [LRI]
2012-11-20 03:20:20 +00:00
Left_To_Right_Override [LRO]
Nonspacing_Mark [NSM]
Other_Neutral [ON]
Paragraph_Separator [B]
Pop_Directional_Format [PDF]
2013-10-12 02:35:02 +01:00
Pop_Directional_Isolate [PDI]
2012-11-20 03:20:20 +00:00
Right_To_Left [R]
Right_To_Left_Embedding [RLE]
2013-10-12 02:35:02 +01:00
Right_To_Left_Isolate [RLI]
2012-11-20 03:20:20 +00:00
Right_To_Left_Override [RLO]
Segment_Separator [S]
White_Space [WS]
Bidi_Control [Bidi_C]
No [F, False, N]
Yes [T, True, Y]
Bidi_Mirrored [Bidi_M]
No [F, False, N]
Yes [T, True, Y]
Blank
No [F, False, N]
Yes [T, True, Y]
Block [blk]
2016-06-25 00:05:10 +01:00
Adlam
Aegean_Numbers
Ahom
Alchemical_Symbols [Alchemical]
Alphabetic_Presentation_Forms [Alphabetic_PF]
Anatolian_Hieroglyphs
Ancient_Greek_Musical_Notation [Ancient_Greek_Music]
Ancient_Greek_Numbers
Ancient_Symbols
Arabic
Arabic_Extended_A [Arabic_Ext_A]
Arabic_Mathematical_Alphabetic_Symbols [Arabic_Math]
Arabic_Presentation_Forms_A [Arabic_PF_A]
Arabic_Presentation_Forms_B [Arabic_PF_B]
Arabic_Supplement [Arabic_Sup]
Armenian
Arrows
Avestan
Balinese
Bamum
Bamum_Supplement [Bamum_Sup]
Basic_Latin [ASCII]
2014-06-28 20:06:07 +01:00
Bassa_Vah
Batak
Bengali
2016-06-25 00:05:10 +01:00
Bhaiksuki
Block_Elements
Bopomofo
Bopomofo_Extended [Bopomofo_Ext]
Box_Drawing
Brahmi
Braille_Patterns [Braille]
Buginese
Buhid
Byzantine_Musical_Symbols [Byzantine_Music]
Carian
2014-06-28 20:06:07 +01:00
Caucasian_Albanian
Chakma
Cham
Cherokee
Cherokee_Supplement [Cherokee_Sup]
2018-06-05 20:49:44 +01:00
Chess_Symbols
CJK_Compatibility [CJK_Compat]
CJK_Compatibility_Forms [CJK_Compat_Forms]
CJK_Compatibility_Ideographs [CJK_Compat_Ideographs]
CJK_Compatibility_Ideographs_Supplement [CJK_Compat_Ideographs_Sup]
CJK_Radicals_Supplement [CJK_Radicals_Sup]
CJK_Strokes
CJK_Symbols_And_Punctuation [CJK_Symbols]
CJK_Unified_Ideographs [CJK]
CJK_Unified_Ideographs_Extension_A [CJK_Ext_A]
CJK_Unified_Ideographs_Extension_B [CJK_Ext_B]
CJK_Unified_Ideographs_Extension_C [CJK_Ext_C]
CJK_Unified_Ideographs_Extension_D [CJK_Ext_D]
CJK_Unified_Ideographs_Extension_E [CJK_Ext_E]
2017-06-23 01:13:51 +01:00
CJK_Unified_Ideographs_Extension_F [CJK_Ext_F]
Combining_Diacritical_Marks [Diacriticals]
2014-06-28 20:06:07 +01:00
Combining_Diacritical_Marks_Extended [Diacriticals_Ext]
Combining_Diacritical_Marks_For_Symbols [Combining_Marks_For_Symbols, Diacriticals_For_Symbols]
Combining_Diacritical_Marks_Supplement [Diacriticals_Sup]
Combining_Half_Marks [Half_Marks]
Common_Indic_Number_Forms [Indic_Number_Forms]
Control_Pictures
Coptic
2014-06-28 20:06:07 +01:00
Coptic_Epact_Numbers
Counting_Rod_Numerals [Counting_Rod]
Cuneiform
Cuneiform_Numbers_And_Punctuation [Cuneiform_Numbers]
Currency_Symbols
Cypriot_Syllabary
Cyrillic
Cyrillic_Extended_A [Cyrillic_Ext_A]
Cyrillic_Extended_B [Cyrillic_Ext_B]
2016-06-25 00:05:10 +01:00
Cyrillic_Extended_C [Cyrillic_Ext_C]
Cyrillic_Supplement [Cyrillic_Sup, Cyrillic_Supplementary]
Deseret
Devanagari
Devanagari_Extended [Devanagari_Ext]
Dingbats
2018-06-05 20:49:44 +01:00
Dogra
Domino_Tiles [Domino]
2014-06-28 20:06:07 +01:00
Duployan
Early_Dynastic_Cuneiform
Egyptian_Hieroglyphs
2019-06-02 02:32:45 +01:00
Egyptian_Hieroglyph_Format_Controls
2014-06-28 20:06:07 +01:00
Elbasan
2019-06-02 02:32:45 +01:00
Elymaic
Emoticons
Enclosed_Alphanumerics [Enclosed_Alphanum]
Enclosed_Alphanumeric_Supplement [Enclosed_Alphanum_Sup]
Enclosed_CJK_Letters_And_Months [Enclosed_CJK]
Enclosed_Ideographic_Supplement [Enclosed_Ideographic_Sup]
Ethiopic
Ethiopic_Extended [Ethiopic_Ext]
Ethiopic_Extended_A [Ethiopic_Ext_A]
Ethiopic_Supplement [Ethiopic_Sup]
General_Punctuation [Punctuation]
Geometric_Shapes
2014-06-28 20:06:07 +01:00
Geometric_Shapes_Extended [Geometric_Shapes_Ext]
Georgian
2018-06-05 20:49:44 +01:00
Georgian_Extended [Georgian_Ext]
Georgian_Supplement [Georgian_Sup]
Glagolitic
2016-06-25 00:05:10 +01:00
Glagolitic_Supplement [Glagolitic_Sup]
Gothic
2014-06-28 20:06:07 +01:00
Grantha
Greek_And_Coptic [Greek]
Greek_Extended [Greek_Ext]
Gujarati
2018-06-05 20:49:44 +01:00
Gunjala_Gondi
Gurmukhi
Halfwidth_And_Fullwidth_Forms [Half_And_Full_Forms]
Hangul_Compatibility_Jamo [Compat_Jamo]
Hangul_Jamo [Jamo]
Hangul_Jamo_Extended_A [Jamo_Ext_A]
Hangul_Jamo_Extended_B [Jamo_Ext_B]
Hangul_Syllables [Hangul]
2018-06-05 20:49:44 +01:00
Hanifi_Rohingya
Hanunoo
Hatran
Hebrew
High_Private_Use_Surrogates [High_PU_Surrogates]
High_Surrogates
Hiragana
Ideographic_Description_Characters [IDC]
2016-06-25 00:05:10 +01:00
Ideographic_Symbols_And_Punctuation [Ideographic_Symbols]
Imperial_Aramaic
2018-06-05 20:49:44 +01:00
Indic_Siyaq_Numbers
Inscriptional_Pahlavi
Inscriptional_Parthian
IPA_Extensions [IPA_Ext]
Javanese
Kaithi
2017-06-23 01:13:51 +01:00
Kana_Extended_A [Kana_Ext_A]
Kana_Supplement [Kana_Sup]
Kanbun
Kangxi_Radicals [Kangxi]
Kannada
Katakana
Katakana_Phonetic_Extensions [Katakana_Ext]
Kayah_Li
Kharoshthi
Khmer
Khmer_Symbols
2014-06-28 20:06:07 +01:00
Khojki
Khudawadi
Lao
Latin_1_Supplement [Latin_1, Latin_1_Sup]
Latin_Extended_A [Latin_Ext_A]
Latin_Extended_Additional [Latin_Ext_Additional]
Latin_Extended_B [Latin_Ext_B]
Latin_Extended_C [Latin_Ext_C]
Latin_Extended_D [Latin_Ext_D]
2014-06-28 20:06:07 +01:00
Latin_Extended_E [Latin_Ext_E]
Lepcha
Letterlike_Symbols
Limbu
2014-06-28 20:06:07 +01:00
Linear_A
Linear_B_Ideograms
Linear_B_Syllabary
Lisu
Low_Surrogates
Lycian
Lydian
2014-06-28 20:06:07 +01:00
Mahajani
Mahjong_Tiles [Mahjong]
2018-06-05 20:49:44 +01:00
Makasar
Malayalam
Mandaic
2014-06-28 20:06:07 +01:00
Manichaean
2016-06-25 00:05:10 +01:00
Marchen
2017-06-23 01:13:51 +01:00
Masaram_Gondi
Mathematical_Alphanumeric_Symbols [Math_Alphanum]
Mathematical_Operators [Math_Operators]
2018-06-05 20:49:44 +01:00
Mayan_Numerals
Medefaidrin
Meetei_Mayek
Meetei_Mayek_Extensions [Meetei_Mayek_Ext]
2014-06-28 20:06:07 +01:00
Mende_Kikakui
Meroitic_Cursive
Meroitic_Hieroglyphs
Miao
Miscellaneous_Mathematical_Symbols_A [Misc_Math_Symbols_A]
Miscellaneous_Mathematical_Symbols_B [Misc_Math_Symbols_B]
Miscellaneous_Symbols [Misc_Symbols]
Miscellaneous_Symbols_And_Arrows [Misc_Arrows]
Miscellaneous_Symbols_And_Pictographs [Misc_Pictographs]
Miscellaneous_Technical [Misc_Technical]
2014-06-28 20:06:07 +01:00
Modi
Modifier_Tone_Letters
Mongolian
2016-06-25 00:05:10 +01:00
Mongolian_Supplement [Mongolian_Sup]
2014-06-28 20:06:07 +01:00
Mro
Multani
Musical_Symbols [Music]
Myanmar
Myanmar_Extended_A [Myanmar_Ext_A]
2014-06-28 20:06:07 +01:00
Myanmar_Extended_B [Myanmar_Ext_B]
Nabataean
2019-06-02 02:32:45 +01:00
Nandinagari
2016-06-25 00:05:10 +01:00
Newa
New_Tai_Lue
NKo
No_Block [NB]
Number_Forms
2017-06-23 01:13:51 +01:00
Nushu
2019-06-02 02:32:45 +01:00
Nyiakeng_Puachue_Hmong
Ogham
Old_Hungarian
Old_Italic
2014-06-28 20:06:07 +01:00
Old_North_Arabian
Old_Permic
Old_Persian
2018-06-05 20:49:44 +01:00
Old_Sogdian
Old_South_Arabian
Old_Turkic
Ol_Chiki
Optical_Character_Recognition [OCR]
Oriya
2014-06-28 20:06:07 +01:00
Ornamental_Dingbats
2016-06-25 00:05:10 +01:00
Osage
Osmanya
2019-06-02 02:32:45 +01:00
Ottoman_Siyaq_Numbers
2014-06-28 20:06:07 +01:00
Pahawh_Hmong
Palmyrene
Pau_Cin_Hau
Phags_Pa
Phaistos_Disc [Phaistos]
Phoenician
Phonetic_Extensions [Phonetic_Ext]
Phonetic_Extensions_Supplement [Phonetic_Ext_Sup]
Playing_Cards
Private_Use_Area [Private_Use, PUA]
2014-06-28 20:06:07 +01:00
Psalter_Pahlavi
Rejang
Rumi_Numeral_Symbols [Rumi]
Runic
Samaritan
Saurashtra
Sharada
Shavian
2014-06-28 20:06:07 +01:00
Shorthand_Format_Controls
Siddham
Sinhala
2014-06-28 20:06:07 +01:00
Sinhala_Archaic_Numbers
Small_Form_Variants [Small_Forms]
2019-06-02 02:32:45 +01:00
Small_Kana_Extension [Small_Kana_Ext]
2018-06-05 20:49:44 +01:00
Sogdian
Sora_Sompeng
2017-06-23 01:13:51 +01:00
Soyombo
Spacing_Modifier_Letters [Modifier_Letters]
Specials
Sundanese
Sundanese_Supplement [Sundanese_Sup]
Superscripts_And_Subscripts [Super_And_Sub]
Supplemental_Arrows_A [Sup_Arrows_A]
Supplemental_Arrows_B [Sup_Arrows_B]
2014-06-28 20:06:07 +01:00
Supplemental_Arrows_C [Sup_Arrows_C]
Supplemental_Mathematical_Operators [Sup_Math_Operators]
Supplemental_Punctuation [Sup_Punctuation]
Supplemental_Symbols_And_Pictographs [Sup_Symbols_And_Pictographs]
Supplementary_Private_Use_Area_A [Sup_PUA_A]
Supplementary_Private_Use_Area_B [Sup_PUA_B]
Sutton_SignWriting
Syloti_Nagri
2019-06-02 02:32:45 +01:00
Symbols_And_Pictographs_Extended_A [Symbols_And_Pictographs_Ext_A]
Syriac
2017-06-23 01:13:51 +01:00
Syriac_Supplement [Syriac_Sup]
Tagalog
Tagbanwa
Tags
Tai_Le
Tai_Tham
Tai_Viet
Tai_Xuan_Jing_Symbols [Tai_Xuan_Jing]
Takri
Tamil
2019-06-02 02:32:45 +01:00
Tamil_Supplement [Tamil_Sup]
2016-06-25 00:05:10 +01:00
Tangut
Tangut_Components
Telugu
Thaana
Thai
Tibetan
Tifinagh
2014-06-28 20:06:07 +01:00
Tirhuta
Transport_And_Map_Symbols [Transport_And_Map]
Ugaritic
Unified_Canadian_Aboriginal_Syllabics [Canadian_Syllabics, UCAS]
Unified_Canadian_Aboriginal_Syllabics_Extended [UCAS_Ext]
Vai
Variation_Selectors [VS]
Variation_Selectors_Supplement [VS_Sup]
Vedic_Extensions [Vedic_Ext]
Vertical_Forms
2019-06-02 02:32:45 +01:00
Wancho
2014-06-28 20:06:07 +01:00
Warang_Citi
Yijing_Hexagram_Symbols [Yijing]
Yi_Radicals
Yi_Syllables
2017-06-23 01:13:51 +01:00
Zanabazar_Square
Canonical_Combining_Class [ccc]
2012-11-20 03:20:20 +00:00
Above [230, A]
Above_Left [228, AL]
Above_Right [232, AR]
Attached_Above [214, ATA]
Attached_Above_Right [216, ATAR]
Attached_Below [202, ATB]
Attached_Below_Left [200, ATBL]
2012-11-20 03:20:20 +00:00
Below [220, B]
Below_Left [218, BL]
Below_Right [222, BR]
CCC10 [10]
CCC103 [103]
CCC107 [107]
CCC11 [11]
CCC118 [118]
CCC12 [12]
CCC122 [122]
CCC129 [129]
CCC13 [13]
CCC130 [130]
CCC132 [132]
CCC133 [133]
2012-11-20 03:20:20 +00:00
CCC14 [14]
CCC15 [15]
CCC16 [16]
CCC17 [17]
CCC18 [18]
CCC19 [19]
CCC20 [20]
CCC21 [21]
CCC22 [22]
CCC23 [23]
CCC24 [24]
CCC25 [25]
CCC26 [26]
CCC27 [27]
CCC28 [28]
CCC29 [29]
CCC30 [30]
CCC31 [31]
CCC32 [32]
CCC33 [33]
CCC34 [34]
CCC35 [35]
CCC36 [36]
CCC84 [84]
CCC91 [91]
Double_Above [234, DA]
Double_Below [233, DB]
Iota_Subscript [240, IS]
Kana_Voicing [8, KV]
Left [224, L]
Not_Reordered [0, NR]
2012-11-20 03:20:20 +00:00
Nukta [7, NK]
Overlay [1, OV]
Right [226, R]
Virama [9, VR]
Cased
No [F, False, N]
Yes [T, True, Y]
Case_Ignorable [CI]
No [F, False, N]
Yes [T, True, Y]
Changes_When_Casefolded [CWCF]
No [F, False, N]
Yes [T, True, Y]
Changes_When_Casemapped [CWCM]
No [F, False, N]
Yes [T, True, Y]
Changes_When_Lowercased [CWL]
No [F, False, N]
Yes [T, True, Y]
Changes_When_Titlecased [CWT]
No [F, False, N]
Yes [T, True, Y]
Changes_When_Uppercased [CWU]
No [F, False, N]
Yes [T, True, Y]
Dash
No [F, False, N]
Yes [T, True, Y]
Decomposition_Type [dt]
Canonical [Can]
Circle [Enc]
Compat [Com]
Final [Fin]
Font
Fraction [Fra]
Initial [Init]
Isolated [Iso]
Medial [Med]
Narrow [Nar]
2012-11-20 03:20:20 +00:00
Nobreak [Nb]
None
Small [Sml]
Square [Sqr]
Sub
Super [Sup]
Vertical [Vert]
Wide
Default_Ignorable_Code_Point [DI]
No [F, False, N]
Yes [T, True, Y]
Deprecated [Dep]
No [F, False, N]
Yes [T, True, Y]
Diacritic [Dia]
No [F, False, N]
Yes [T, True, Y]
East_Asian_Width [ea]
2012-11-20 03:20:20 +00:00
Ambiguous [A]
Fullwidth [F]
Halfwidth [H]
Narrow [Na]
Neutral [N]
2012-11-20 03:20:20 +00:00
Wide [W]
Emoji
No
Yes
Emoji_Component
No
Yes
Emoji_Modifier
No
Yes
Emoji_Modifier_Base
No
Yes
Emoji_Presentation
No
Yes
Extended_Pictographic
No
Yes
Extender [Ext]
No [F, False, N]
Yes [T, True, Y]
General_Category [gc]
Assigned
Cased_Letter [LC]
2012-11-20 03:20:20 +00:00
Close_Punctuation [Pe]
Connector_Punctuation [Pc]
Control [Cc, cntrl]
Currency_Symbol [Sc]
Dash_Punctuation [Pd]
Decimal_Number [digit, Nd]
2012-11-20 03:20:20 +00:00
Enclosing_Mark [Me]
Final_Punctuation [Pf]
Format [Cf]
Initial_Punctuation [Pi]
Letter [L, L&]
Letter_Number [Nl]
Line_Separator [Zl]
Lowercase_Letter [Ll]
Mark [Combining_Mark, M, M&]
Math_Symbol [Sm]
Modifier_Letter [Lm]
Modifier_Symbol [Sk]
Nonspacing_Mark [Mn]
Number [N, N&]
Open_Punctuation [Ps]
Other [C, C&]
Other_Letter [Lo]
Other_Number [No]
Other_Punctuation [Po]
Other_Symbol [So]
Paragraph_Separator [Zp]
Private_Use [Co]
Punctuation [P, P&, punct]
Separator [Z, Z&]
Space_Separator [Zs]
Spacing_Mark [Mc]
Surrogate [Cs]
Symbol [S, S&]
Titlecase_Letter [Lt]
Unassigned [Cn]
Uppercase_Letter [Lu]
Graph
No [F, False, N]
Yes [T, True, Y]
Grapheme_Base [Gr_Base]
No [F, False, N]
Yes [T, True, Y]
Grapheme_Cluster_Break [GCB]
2012-11-20 03:20:20 +00:00
Control [CN]
CR
Extend [EX]
2016-06-25 00:05:10 +01:00
E_Base [EB]
E_Base_GAZ [EBG]
E_Modifier [EM]
Glue_After_Zwj [GAZ]
L
LF
LV
LVT
Other [XX]
Prepend [PP]
2012-11-20 03:20:20 +00:00
Regional_Indicator [RI]
SpacingMark [SM]
T
V
2016-06-25 00:05:10 +01:00
ZWJ
Grapheme_Extend [Gr_Ext]
No [F, False, N]
Yes [T, True, Y]
Grapheme_Link [Gr_Link]
No [F, False, N]
Yes [T, True, Y]
Hangul_Syllable_Type [hst]
Leading_Jamo [L]
2012-11-20 03:20:20 +00:00
LVT_Syllable [LVT]
LV_Syllable [LV]
Not_Applicable [NA]
Trailing_Jamo [T]
Vowel_Jamo [V]
Hex_Digit [Hex]
No [F, False, N]
Yes [T, True, Y]
Hyphen
No [F, False, N]
Yes [T, True, Y]
Ideographic [Ideo]
No [F, False, N]
Yes [T, True, Y]
IDS_Binary_Operator [IDSB]
No [F, False, N]
Yes [T, True, Y]
IDS_Trinary_Operator [IDST]
No [F, False, N]
Yes [T, True, Y]
ID_Continue [IDC]
No [F, False, N]
Yes [T, True, Y]
ID_Start [IDS]
No [F, False, N]
Yes [T, True, Y]
Indic_Positional_Category [InPC]
Bottom
2017-06-23 01:13:51 +01:00
Bottom_And_Left
Bottom_And_Right
Left
Left_And_Right
2012-11-20 03:20:20 +00:00
NA
Overstruck
Right
Top
Top_And_Bottom
Top_And_Bottom_And_Right
Top_And_Left
Top_And_Left_And_Right
Top_And_Right
Visual_Order_Left
Indic_Syllabic_Category [InSC]
Avagraha
Bindu
2014-06-28 20:06:07 +01:00
Brahmi_Joining_Number
Cantillation_Mark
Consonant
Consonant_Dead
Consonant_Final
Consonant_Head_Letter
2018-06-05 20:49:44 +01:00
Consonant_Initial_Postfixed
Consonant_Killer
Consonant_Medial
Consonant_Placeholder
2014-06-28 20:06:07 +01:00
Consonant_Preceding_Repha
Consonant_Prefixed
Consonant_Subjoined
2014-06-28 20:06:07 +01:00
Consonant_Succeeding_Repha
Consonant_With_Stacker
2014-06-28 20:06:07 +01:00
Gemination_Mark
Invisible_Stacker
Joiner
Modifying_Letter
2014-06-28 20:06:07 +01:00
Non_Joiner
Nukta
2014-06-28 20:06:07 +01:00
Number
Number_Joiner
Other
2014-06-28 20:06:07 +01:00
Pure_Killer
Register_Shifter
Syllable_Modifier
Tone_Letter
Tone_Mark
Virama
Visarga
Vowel
Vowel_Dependent
Vowel_Independent
Joining_Group [jg]
2016-06-25 00:05:10 +01:00
African_Feh
African_Noon
African_Qaf
Ain
Alaph
Alef
Beh
Beth
Burushaski_Yeh_Barree
Dal
Dalath_Rish
E
Farsi_Yeh
Fe
Feh
Final_Semkath
Gaf
Gamal
Hah
2012-11-20 03:20:20 +00:00
Hamza_On_Heh_Goal [Teh_Marbuta_Goal]
2018-06-05 20:49:44 +01:00
Hanifi_Rohingya_Kinna_Ya
Hanifi_Rohingya_Pa
He
Heh
Heh_Goal
Heth
Kaf
Kaph
Khaph
Knotted_Heh
Lam
Lamadh
2017-06-23 01:13:51 +01:00
Malayalam_Bha
Malayalam_Ja
Malayalam_Lla
Malayalam_Llla
Malayalam_Nga
Malayalam_Nna
Malayalam_Nnna
Malayalam_Nya
Malayalam_Ra
Malayalam_Ssa
Malayalam_Tta
2014-06-28 20:06:07 +01:00
Manichaean_Aleph
Manichaean_Ayin
Manichaean_Beth
Manichaean_Daleth
Manichaean_Dhamedh
Manichaean_Five
Manichaean_Gimel
Manichaean_Heth
Manichaean_Hundred
Manichaean_Kaph
Manichaean_Lamedh
Manichaean_Mem
Manichaean_Nun
Manichaean_One
Manichaean_Pe
Manichaean_Qoph
Manichaean_Resh
Manichaean_Sadhe
Manichaean_Samekh
Manichaean_Taw
Manichaean_Ten
Manichaean_Teth
Manichaean_Thamedh
Manichaean_Twenty
Manichaean_Waw
Manichaean_Yodh
Manichaean_Zayin
Meem
Mim
Noon
No_Joining_Group
Nun
Nya
Pe
Qaf
Qaph
Reh
Reversed_Pe
Rohingya_Yeh
Sad
Sadhe
Seen
Semkath
Shin
2014-06-28 20:06:07 +01:00
Straight_Waw
Swash_Kaf
Syriac_Waw
Tah
Taw
Teh_Marbuta
Teth
Waw
Yeh
Yeh_Barree
Yeh_With_Tail
Yudh
Yudh_He
Zain
Zhain
Joining_Type [jt]
2012-11-20 03:20:20 +00:00
Dual_Joining [D]
Join_Causing [C]
2013-10-12 02:35:02 +01:00
Left_Joining [L]
Non_Joining [U]
2012-11-20 03:20:20 +00:00
Right_Joining [R]
Transparent [T]
Join_Control [Join_C]
No [F, False, N]
Yes [T, True, Y]
Line_Break [lb]
2012-11-20 03:20:20 +00:00
Alphabetic [AL]
Ambiguous [AI]
Break_After [BA]
Break_Before [BB]
Break_Both [B2]
Break_Symbols [SY]
Carriage_Return [CR]
Close_Parenthesis [CP]
Close_Punctuation [CL]
Combining_Mark [CM]
Complex_Context [SA]
Conditional_Japanese_Starter [CJ]
Contingent_Break [CB]
Exclamation [EX]
2016-06-25 00:05:10 +01:00
E_Base [EB]
E_Modifier [EM]
2012-11-20 03:20:20 +00:00
Glue [GL]
H2
H3
2012-11-20 03:20:20 +00:00
Hebrew_Letter [HL]
Hyphen [HY]
Ideographic [ID]
Infix_Numeric [IS]
Inseparable [IN, Inseperable]
JL
2012-11-20 03:20:20 +00:00
JT
JV
2012-11-20 03:20:20 +00:00
Line_Feed [LF]
Mandatory_Break [BK]
Next_Line [NL]
Nonstarter [NS]
Numeric [NU]
Open_Punctuation [OP]
Postfix_Numeric [PO]
Prefix_Numeric [PR]
Quotation [QU]
Regional_Indicator [RI]
Space [SP]
Surrogate [SG]
Unknown [XX]
2012-11-20 03:20:20 +00:00
Word_Joiner [WJ]
2016-06-25 00:05:10 +01:00
ZWJ
2012-11-20 03:20:20 +00:00
ZWSpace [ZW]
Logical_Order_Exception [LOE]
No [F, False, N]
Yes [T, True, Y]
Lowercase [Lower]
No [F, False, N]
Yes [T, True, Y]
Math
No [F, False, N]
Yes [T, True, Y]
NFC_Quick_Check [NFC_QC]
Maybe [M]
No [N]
Yes [Y]
NFD_Quick_Check [NFD_QC]
No [N]
Yes [Y]
NFKC_Quick_Check [NFKC_QC]
Maybe [M]
No [N]
Yes [Y]
NFKD_Quick_Check [NFKD_QC]
No [N]
Yes [Y]
Noncharacter_Code_Point [NChar]
No [F, False, N]
Yes [T, True, Y]
Numeric_Type [nt]
Decimal [De]
2012-11-20 03:20:20 +00:00
Digit [Di]
None
2012-11-20 03:20:20 +00:00
Numeric [Nu]
Numeric_Value [nv]
-1/2
0
1
1/10
1/12
1/16
2016-06-25 00:05:10 +01:00
1/160
1/2
2016-06-25 00:05:10 +01:00
1/20
1/3
2019-06-02 02:32:45 +01:00
1/32
1/320
1/4
2016-06-25 00:05:10 +01:00
1/40
1/5
1/6
2019-06-02 02:32:45 +01:00
1/64
1/7
1/8
2019-06-02 02:32:45 +01:00
1/80
1/9
10
100
1000
10000
100000
2014-06-28 20:06:07 +01:00
1000000
2018-06-05 20:49:44 +01:00
10000000
100000000
2014-06-28 20:06:07 +01:00
10000000000
1000000000000
11
11/12
11/2
12
13
13/2
14
15
15/2
16
17
17/2
18
19
2
2/3
2/5
20
200
2000
20000
200000
2018-06-05 20:49:44 +01:00
20000000
21
2012-11-20 03:20:20 +00:00
216000
22
23
24
25
26
27
28
29
3
3/16
3/2
2016-06-25 00:05:10 +01:00
3/20
3/4
3/5
2019-06-02 02:32:45 +01:00
3/64
3/8
2016-06-25 00:05:10 +01:00
3/80
30
300
3000
30000
300000
31
32
33
34
35
36
37
38
39
4
4/5
40
400
4000
40000
400000
41
42
43
2012-11-20 03:20:20 +00:00
432000
44
45
46
47
48
49
5
5/12
5/2
5/6
5/8
50
500
5000
50000
500000
6
60
600
6000
60000
600000
7
7/12
7/2
7/8
70
700
7000
70000
700000
8
80
800
8000
80000
800000
9
9/2
90
900
9000
90000
900000
NaN
Other_Alphabetic [OAlpha]
No [F, False, N]
Yes [T, True, Y]
Other_Default_Ignorable_Code_Point [ODI]
No [F, False, N]
Yes [T, True, Y]
Other_Grapheme_Extend [OGr_Ext]
No [F, False, N]
Yes [T, True, Y]
Other_ID_Continue [OIDC]
No [F, False, N]
Yes [T, True, Y]
Other_ID_Start [OIDS]
No [F, False, N]
Yes [T, True, Y]
Other_Lowercase [OLower]
No [F, False, N]
Yes [T, True, Y]
Other_Math [OMath]
No [F, False, N]
Yes [T, True, Y]
Other_Uppercase [OUpper]
No [F, False, N]
Yes [T, True, Y]
Pattern_Syntax [Pat_Syn]
No [F, False, N]
Yes [T, True, Y]
Pattern_White_Space [Pat_WS]
No [F, False, N]
Yes [T, True, Y]
Posix_AlNum
No [F, False, N]
Yes [T, True, Y]
Posix_Digit
No [F, False, N]
Yes [T, True, Y]
Posix_Punct
No [F, False, N]
Yes [T, True, Y]
Posix_XDigit
No [F, False, N]
Yes [T, True, Y]
2016-06-25 00:05:10 +01:00
Prepended_Concatenation_Mark [PCM]
No [F, False, N]
Yes [T, True, Y]
Print
No [F, False, N]
Yes [T, True, Y]
Quotation_Mark [QMark]
No [F, False, N]
Yes [T, True, Y]
Radical
No [F, False, N]
Yes [T, True, Y]
2017-06-23 01:13:51 +01:00
Regional_Indicator [RI]
No [F, False, N]
Yes [T, True, Y]
Script [sc]
2016-06-25 00:05:10 +01:00
Adlam [Adlm]
Ahom
Anatolian_Hieroglyphs [Hluw]
Arabic [Arab]
Armenian [Armn]
Avestan [Avst]
Balinese [Bali]
Bamum [Bamu]
2014-06-28 20:06:07 +01:00
Bassa_Vah [Bass]
Batak [Batk]
Bengali [Beng]
2016-06-25 00:05:10 +01:00
Bhaiksuki [Bhks]
Bopomofo [Bopo]
Brahmi [Brah]
Braille [Brai]
Buginese [Bugi]
Buhid [Buhd]
Canadian_Aboriginal [Cans]
Carian [Cari]
2014-06-28 20:06:07 +01:00
Caucasian_Albanian [Aghb]
Chakma [Cakm]
Cham
Cherokee [Cher]
Common [Zyyy]
Coptic [Copt, Qaac]
Cuneiform [Xsux]
Cypriot [Cprt]
Cyrillic [Cyrl]
Deseret [Dsrt]
Devanagari [Deva]
2018-06-05 20:49:44 +01:00
Dogra [Dogr]
2014-06-28 20:06:07 +01:00
Duployan [Dupl]
Egyptian_Hieroglyphs [Egyp]
2014-06-28 20:06:07 +01:00
Elbasan [Elba]
2019-06-02 02:32:45 +01:00
Elymaic [Elym]
Ethiopic [Ethi]
Georgian [Geor]
Glagolitic [Glag]
Gothic [Goth]
2014-06-28 20:06:07 +01:00
Grantha [Gran]
Greek [Grek]
Gujarati [Gujr]
2018-06-05 20:49:44 +01:00
Gunjala_Gondi [Gong]
Gurmukhi [Guru]
Han [Hani]
Hangul [Hang]
2018-06-05 20:49:44 +01:00
Hanifi_Rohingya [Rohg]
Hanunoo [Hano]
Hatran [Hatr]
Hebrew [Hebr]
Hiragana [Hira]
Imperial_Aramaic [Armi]
Inherited [Qaai, Zinh]
Inscriptional_Pahlavi [Phli]
Inscriptional_Parthian [Prti]
Javanese [Java]
Kaithi [Kthi]
Kannada [Knda]
Katakana [Kana]
Katakana_Or_Hiragana [Hrkt]
Kayah_Li [Kali]
Kharoshthi [Khar]
Khmer [Khmr]
2014-06-28 20:06:07 +01:00
Khojki [Khoj]
Khudawadi [Sind]
Lao [Laoo]
Latin [Latn]
Lepcha [Lepc]
Limbu [Limb]
2014-06-28 20:06:07 +01:00
Linear_A [Lina]
Linear_B [Linb]
Lisu
Lycian [Lyci]
Lydian [Lydi]
2014-06-28 20:06:07 +01:00
Mahajani [Mahj]
2018-06-05 20:49:44 +01:00
Makasar [Maka]
Malayalam [Mlym]
Mandaic [Mand]
2014-06-28 20:06:07 +01:00
Manichaean [Mani]
2016-06-25 00:05:10 +01:00
Marchen [Marc]
2017-06-23 01:13:51 +01:00
Masaram_Gondi [Gonm]
2018-06-05 20:49:44 +01:00
Medefaidrin [Medf]
Meetei_Mayek [Mtei]
2014-06-28 20:06:07 +01:00
Mende_Kikakui [Mend]
Meroitic_Cursive [Merc]
Meroitic_Hieroglyphs [Mero]
Miao [Plrd]
2014-06-28 20:06:07 +01:00
Modi
Mongolian [Mong]
2014-06-28 20:06:07 +01:00
Mro [Mroo]
Multani [Mult]
Myanmar [Mymr]
2014-06-28 20:06:07 +01:00
Nabataean [Nbat]
2019-06-02 02:32:45 +01:00
Nandinagari [Nand]
2016-06-25 00:05:10 +01:00
Newa
New_Tai_Lue [Talu]
2012-11-20 03:20:20 +00:00
Nko [Nkoo]
2017-06-23 01:13:51 +01:00
Nushu [Nshu]
2019-06-02 02:32:45 +01:00
Nyiakeng_Puachue_Hmong [Hmnp]
Ogham [Ogam]
Old_Hungarian [Hung]
Old_Italic [Ital]
2014-06-28 20:06:07 +01:00
Old_North_Arabian [Narb]
Old_Permic [Perm]
Old_Persian [Xpeo]
2018-06-05 20:49:44 +01:00
Old_Sogdian [Sogo]
Old_South_Arabian [Sarb]
Old_Turkic [Orkh]
Ol_Chiki [Olck]
Oriya [Orya]
2016-06-25 00:05:10 +01:00
Osage [Osge]
Osmanya [Osma]
2014-06-28 20:06:07 +01:00
Pahawh_Hmong [Hmng]
Palmyrene [Palm]
Pau_Cin_Hau [Pauc]
Phags_Pa [Phag]
Phoenician [Phnx]
Psalter_Pahlavi [Phlp]
Rejang [Rjng]
Runic [Runr]
Samaritan [Samr]
Saurashtra [Saur]
Sharada [Shrd]
Shavian [Shaw]
Siddham [Sidd]
SignWriting [Sgnw]
Sinhala [Sinh]
Sogdian [Sogd]
Sora_Sompeng [Sora]
Soyombo [Soyo]
Sundanese [Sund]
Syloti_Nagri [Sylo]
Syriac [Syrc]
Tagalog [Tglg]
Tagbanwa [Tagb]
Tai_Le [Tale]
Tai_Tham [Lana]
Tai_Viet [Tavt]
Takri [Takr]
Tamil [Taml]
Tangut [Tang]
Telugu [Telu]
Thaana [Thaa]
Thai
Tibetan [Tibt]
Tifinagh [Tfng]
Tirhuta [Tirh]
Ugaritic [Ugar]
Unknown [Zzzz]
Vai [Vaii]
2019-06-02 02:32:45 +01:00
Wancho [Wcho]
Warang_Citi [Wara]
Yi [Yiii]
Zanabazar_Square [Zanb]
Script_Extensions [scx]
Adlam [Adlm]
Adlm Arab Mand Mani Phlp Rohg Sogd Syrc
Ahom
Anatolian_Hieroglyphs [Hluw]
Arab Copt
Arab Rohg
Arab Rohg Syrc Thaa
Arab Syrc
Arab Syrc Thaa
Arab Thaa
Arabic [Arab]
Armenian [Armn]
Armn Geor
Avestan [Avst]
Balinese [Bali]
Bamum [Bamu]
Bassa_Vah [Bass]
Batak [Batk]
Beng Cakm Sylo
Beng Deva
2019-06-02 02:32:45 +01:00
Beng Deva Dogr Gong Gonm Gran Gujr Guru Knda Limb Mahj Mlym Nand Orya Sind Sinh Sylo Takr Taml Telu Tirh
Beng Deva Dogr Gong Gonm Gran Gujr Guru Knda Mahj Mlym Nand Orya Sind Sinh Sylo Takr Taml Telu Tirh
Beng Deva Gran Gujr Guru Knda Latn Mlym Orya Shrd Taml Telu Tirh
Beng Deva Gran Gujr Guru Knda Latn Mlym Orya Taml Telu Tirh
Beng Deva Gran Knda
2019-06-02 02:32:45 +01:00
Beng Deva Gran Knda Nand Orya Telu Tirh
Bengali [Beng]
Bhaiksuki [Bhks]
Bopo Hang Hani Hira Kana
Bopo Hang Hani Hira Kana Yiii
Bopo Hani
Bopomofo [Bopo]
Brahmi [Brah]
Braille [Brai]
Bugi Java
Buginese [Bugi]
Buhd Hano Tagb Tglg
Buhid [Buhd]
Cakm Mymr Tale
Canadian_Aboriginal [Cans]
Carian [Cari]
Caucasian_Albanian [Aghb]
Chakma [Cakm]
Cham
Cherokee [Cher]
Common [Zyyy]
Coptic [Copt, Qaac]
Cprt Lina Linb
Cprt Linb
Cuneiform [Xsux]
Cypriot [Cprt]
Cyrillic [Cyrl]
Cyrl Glag
Cyrl Latn
Cyrl Perm
Deseret [Dsrt]
2019-06-02 02:32:45 +01:00
Deva Dogr Gujr Guru Khoj Knda Kthi Mahj Mlym Modi Nand Sind Takr Tirh
Deva Dogr Gujr Guru Khoj Knda Kthi Mahj Modi Nand Sind Takr Tirh
Deva Dogr Gujr Guru Khoj Kthi Mahj Modi Sind Takr Tirh
Deva Dogr Kthi Mahj
Deva Gran
Deva Gran Knda
Deva Gran Latn
Deva Knda Mlym Orya Taml Telu
2019-06-02 02:32:45 +01:00
Deva Nand
Deva Shrd
Deva Taml
Devanagari [Deva]
Dogra [Dogr]
Duployan [Dupl]
Egyptian_Hieroglyphs [Egyp]
Elbasan [Elba]
2019-06-02 02:32:45 +01:00
Elymaic [Elym]
Ethiopic [Ethi]
Geor Latn
Georgian [Geor]
Glagolitic [Glag]
Gothic [Goth]
Gran Taml
Grantha [Gran]
Greek [Grek]
Gujarati [Gujr]
Gujr Khoj
Gunjala_Gondi [Gong]
Gurmukhi [Guru]
Guru Mult
Han [Hani]
Hangul [Hang]
Hani Hira Kana
Hanifi_Rohingya [Rohg]
Hanunoo [Hano]
Hatran [Hatr]
Hebrew [Hebr]
Hira Kana
Hiragana [Hira]
Imperial_Aramaic [Armi]
Inherited [Qaai, Zinh]
Inscriptional_Pahlavi [Phli]
Inscriptional_Parthian [Prti]
Javanese [Java]
Kaithi [Kthi]
Kali Latn Mymr
Kannada [Knda]
Katakana [Kana]
Kayah_Li [Kali]
Kharoshthi [Khar]
Khmer [Khmr]
Khojki [Khoj]
Khudawadi [Sind]
2019-06-02 02:32:45 +01:00
Knda Nand
Lao [Laoo]
Latin [Latn]
2019-06-02 02:32:45 +01:00
Latn Mong
Lepcha [Lepc]
Limbu [Limb]
Linear_A [Lina]
Linear_B [Linb]
Lisu
Lycian [Lyci]
Lydian [Lydi]
Mahajani [Mahj]
Makasar [Maka]
Malayalam [Mlym]
Mandaic [Mand]
Manichaean [Mani]
Marchen [Marc]
Masaram_Gondi [Gonm]
Medefaidrin [Medf]
Meetei_Mayek [Mtei]
Mende_Kikakui [Mend]
Meroitic_Cursive [Merc]
Meroitic_Hieroglyphs [Mero]
Miao [Plrd]
Modi
Mong Phag
Mongolian [Mong]
Mro [Mroo]
Multani [Mult]
Myanmar [Mymr]
Nabataean [Nbat]
2019-06-02 02:32:45 +01:00
Nandinagari [Nand]
Newa
New_Tai_Lue [Talu]
Nko [Nkoo]
Nushu [Nshu]
2019-06-02 02:32:45 +01:00
Nyiakeng_Puachue_Hmong [Hmnp]
Ogham [Ogam]
Old_Hungarian [Hung]
Old_Italic [Ital]
Old_North_Arabian [Narb]
Old_Permic [Perm]
Old_Persian [Xpeo]
Old_Sogdian [Sogo]
Old_South_Arabian [Sarb]
Old_Turkic [Orkh]
Ol_Chiki [Olck]
Oriya [Orya]
Osage [Osge]
Osmanya [Osma]
Pahawh_Hmong [Hmng]
Palmyrene [Palm]
Pau_Cin_Hau [Pauc]
Phags_Pa [Phag]
Phoenician [Phnx]
2014-06-28 20:06:07 +01:00
Psalter_Pahlavi [Phlp]
Rejang [Rjng]
Runic [Runr]
Samaritan [Samr]
Saurashtra [Saur]
Sharada [Shrd]
Shavian [Shaw]
2014-06-28 20:06:07 +01:00
Siddham [Sidd]
SignWriting [Sgnw]
Sinhala [Sinh]
2018-06-05 20:49:44 +01:00
Sogdian [Sogd]
Sora_Sompeng [Sora]
2017-06-23 01:13:51 +01:00
Soyombo [Soyo]
Sundanese [Sund]
Syloti_Nagri [Sylo]
Syriac [Syrc]
Tagalog [Tglg]
Tagbanwa [Tagb]
Tai_Le [Tale]
Tai_Tham [Lana]
Tai_Viet [Tavt]
Takri [Takr]
Tamil [Taml]
2016-06-25 00:05:10 +01:00
Tangut [Tang]
Telugu [Telu]
Thaana [Thaa]
Thai
Tibetan [Tibt]
Tifinagh [Tfng]
2014-06-28 20:06:07 +01:00
Tirhuta [Tirh]
Ugaritic [Ugar]
Unknown [Zzzz]
Vai [Vaii]
2019-06-02 02:32:45 +01:00
Wancho [Wcho]
2014-06-28 20:06:07 +01:00
Warang_Citi [Wara]
2012-11-20 03:20:20 +00:00
Yi [Yiii]
2017-06-23 01:13:51 +01:00
Zanabazar_Square [Zanb]
Sentence_Break [SB]
ATerm [AT]
Close [CL]
CR
Extend [EX]
Format [FO]
LF
2012-11-20 03:20:20 +00:00
Lower [LO]
Numeric [NU]
OLetter [LE]
Other [XX]
2012-11-20 03:20:20 +00:00
SContinue [SC]
Sep [SE]
2012-11-20 03:20:20 +00:00
Sp
STerm [ST]
Upper [UP]
2016-06-25 00:05:10 +01:00
Sentence_Terminal [STerm]
No [F, False, N]
Yes [T, True, Y]
2016-06-25 00:05:10 +01:00
Soft_Dotted [SD]
No [F, False, N]
Yes [T, True, Y]
Terminal_Punctuation [Term]
No [F, False, N]
Yes [T, True, Y]
Unified_Ideograph [UIdeo]
No [F, False, N]
Yes [T, True, Y]
Uppercase [Upper]
No [F, False, N]
Yes [T, True, Y]
Variation_Selector [VS]
No [F, False, N]
Yes [T, True, Y]
White_Space [space, WSpace]
No [F, False, N]
Yes [T, True, Y]
Word
No [F, False, N]
Yes [T, True, Y]
Word_Break [WB]
ALetter [LE]
CR
2013-10-12 02:35:02 +01:00
Double_Quote [DQ]
Extend
ExtendNumLet [EX]
2016-06-25 00:05:10 +01:00
E_Base [EB]
E_Base_GAZ [EBG]
E_Modifier [EM]
Format [FO]
2016-06-25 00:05:10 +01:00
Glue_After_Zwj [GAZ]
2013-10-12 02:35:02 +01:00
Hebrew_Letter [HL]
Katakana [KA]
LF
MidLetter [ML]
2012-11-20 03:20:20 +00:00
MidNum [MN]
MidNumLet [MB]
2012-11-20 03:20:20 +00:00
Newline [NL]
Numeric [NU]
Other [XX]
2012-11-20 03:20:20 +00:00
Regional_Indicator [RI]
2013-10-12 02:35:02 +01:00
Single_Quote [SQ]
2018-06-05 20:49:44 +01:00
WSegSpace
2016-06-25 00:05:10 +01:00
ZWJ
2012-11-20 03:20:20 +00:00
XDigit
No [F, False, N]
Yes [T, True, Y]
XID_Continue [XIDC]
No [F, False, N]
Yes [T, True, Y]
XID_Start [XIDS]
No [F, False, N]
Yes [T, True, Y]