Version 5.1 of the
Unicode standard was released in
April 4, 2008. The previous version of the standard was
Unicode 5.0, and the next is
Unicode 5.2.
All the gory details can be found at http://www.unicode.org/versions/Unicode5.1.0/
The changes from 5.0 include the following.
New Code Blocks
17 new
code blocks were added in 5.1
U+1B80 to U+1BBF Sundanese 55/64
U+1C00 to U+1C4F Lepcha 74/80
U+1C50 to U+1C7F Ol Chiki 48/48
U+2DE0 to U+2DFF Cyrillic Extended A 32/32
U+A500 to U+A63F Vai 300/320
U+A640 to U+A69F Cyrillic Extended B 78/96
U+A880 to U+A8DF Saurashtra 81/96
U+A900 to U+A92F Kayah Li 48/48
U+A930 to U+A95F Rejang 37/48
U+AA00 to U+AA5F Cham 83/96
U+10190 to U+101CF Ancient Symbols 12/64
U+101D0 to U+101FF Phaistos Disc 46/48
U+10280 to U+1029F Lycian 29/32
U+102A0 to U+102DF Carian 49/64
U+10920 to U+1093F Lydian 27/32
U+1F000 to U+1F02F Mahjong Tiles 44/48
U+1F030 to U+1F09F Domino Tiles 100/112
New Characters
Excluding those in the new
code blocks, there were 481 new characters added in Unicode 5.1
Number of characters in each General Category :
Letter, Uppercase Lu : 67
Letter, Lowercase Ll : 80
Letter, Modifier Lm : 9
Letter, Other Lo : 79
Mark, Non-Spacing Mn : 65
Mark, Spacing Combining Mc : 22
Number, Decimal Digit Nd : 10
Number, Letter Nl : 4
Number, Other No : 13
Punctuation, Dash Pd : 1
Punctuation, Open Ps : 6
Punctuation, Close Pe : 6
Punctuation, Initial quote Pi : 1
Punctuation, Final quote Pf : 1
Punctuation, Other Po : 16
Symbol, Math Sm : 31
Symbol, Modifier Sk : 3
Symbol, Other So : 66
Other, Format Cf : 1
Number of characters in each Bidirectional Category :
Left To Right L :260
Right To Left Arabic AL : 24
European Number Terminator ET : 2
Non Spacing Mark NSM : 65
Boundary Neutral BN : 1
Other Neutral ON :129
The columns below should be interpreted as :
- The Unicode code for the character
- The character in question
- The Unicode name for the character
- The Unicode General Category for the character
- The Unicode Bidirectional Category for the character
If the characters below show up poorly, or not at all, see Unicode Support for possible solutions.
Greek and Coptic
Archaic letters
- U+0370 Ͱ Greek capital letter heta Lu L
- ref U+2C75 Ⱶ Latin capital letter half h (Latin Extended C)
- U+0371 ͱ Greek small letter heta Ll L
- ref U+2C76 ⱶ Latin small letter half h (Latin Extended C)
- U+0372 Ͳ Greek capital letter archaic sampi Lu L
- U+0373 ͳ Greek small letter archaic sampi Ll L
Archaic letters
- U+0376 Ͷ Greek capital letter pamphylian digamma Lu L
- U+0377 ͷ Greek small letter pamphylian digamma Ll L
Variant letterforms
- U+03CF Ϗ Greek capital kai symbol Lu L
- ref U+03D7 ϗ Greek kai symbol (Greek and Coptic)
Cyrillic
Historic miscellaneous
- U+0487 ҇ combining Cyrillic pokrytie Mn NSM
- * used only with letter titlos
- ref U+0311 ̑ combining inverted breve (Combining Diacritical Marks)
- ref U+A66F ꙯ combining Cyrillic vzmet (Cyrillic Extended B)
Cyrillic Supplement
Mordvin letters
- U+0514 Ԕ Cyrillic capital letter lha Lu L
- U+0515 ԕ Cyrillic small letter lha Ll L
- aka voiceless l
- U+0516 Ԗ Cyrillic capital letter rha Lu L
- U+0517 ԗ Cyrillic small letter rha Ll L
- aka voiceless r
- U+0518 Ԙ Cyrillic capital letter yae Lu L
- U+0519 ԙ Cyrillic small letter yae Ll L
Kurdish letters
- U+051A Ԛ Cyrillic capital letter qa Lu L
- U+051B ԛ Cyrillic small letter qa Ll L
- U+051C Ԝ Cyrillic capital letter we Lu L
- U+051D ԝ Cyrillic small letter we Ll L
Aleut letter
- U+051E Ԟ Cyrillic capital letter aleut ka Lu L
- U+051F ԟ Cyrillic small letter aleut ka Ll L
- * used for q in Aleut
Chuvash letters
These are obsolete letters formerly used in Jakovlev's Chuvash orthography.
- U+0520 Ԡ Cyrillic capital letter el with middle hook Lu L
- U+0521 ԡ Cyrillic small letter el with middle hook Ll L
- aka palatalized l
- U+0522 Ԣ Cyrillic capital letter en with middle hook Lu L
- U+0523 ԣ Cyrillic small letter en with middle hook Ll L
- aka palatalized n
Arabic
Radix symbols
- U+0606 ؆ Arabic indic cube root Sm ON
- ref U+221B ∛ cube root (Mathematical Operators)
- U+0607 ؇ Arabic indic fourth root Sm ON
- ref U+221C ∜ fourth root (Mathematical Operators)
Letterlike symbol
- U+0608 ؈ Arabic ray Sm AL
Punctuation
- U+0609 ؉ Arabic indic per mille sign Po ET
- ref U+2030 ‰ per mille sign (General Punctuation)
- U+060A ؊ Arabic indic per ten thousand sign Po ET
- ref U+2031 ‱ per ten thousand sign (General Punctuation)
Koranic annotation signs
- U+0616 ؖ Arabic small high ligature alef with lam with yeh Mn NSM
- U+0617 ؗ Arabic small high zain Mn NSM
- U+0618 ؘ Arabic small fatha Mn NSM
- * should not be confused with 064E FATHA
- U+0619 ؙ Arabic small damma Mn NSM
- * should not be confused with 064F DAMMA
- U+061A ؚ Arabic small kasra Mn NSM
- * should not be confused with 0650 KASRA
Additions for early Persian and Azerbaijani
- U+063B ػ Arabic letter keheh with two dots above Lo AL
- U+063C ؼ Arabic letter keheh with three dots below Lo AL
- U+063D ؽ Arabic letter farsi yeh with inverted v Lo AL
- * Azerbaijani
- U+063E ؾ Arabic letter farsi yeh with two dots above Lo AL
- U+063F ؿ Arabic letter farsi yeh with three dots above Lo AL
Arabic Supplement
Additions for Khowar
- U+076E ݮ Arabic letter hah with small arabic letter tah below Lo AL
- U+076F ݯ Arabic letter hah with small arabic letter tah and two dots Lo AL
- U+0770 ݰ Arabic letter seen with small arabic letter tah and two dots Lo AL
- U+0771 ݱ Arabic letter reh with small arabic letter tah and two dots Lo AL
Addition for Torwali
- U+0772 ݲ Arabic letter hah with small arabic letter tah above Lo AL
Additions for Burushaski
- U+0773 ݳ Arabic letter alef with extended arabic indic digit two above Lo AL
- U+0774 ݴ Arabic letter alef with extended arabic indic digit three above Lo AL
- U+0775 ݵ Arabic letter farsi yeh with extended arabic indic digit two above Lo AL
- U+0776 ݶ Arabic letter farsi yeh with extended arabic indic digit three above Lo AL
- U+0777 ݷ Arabic letter farsi yeh with extended arabic indic digit four below Lo AL
- U+0778 ݸ Arabic letter waw with extended arabic indic digit two above Lo AL
- U+0779 ݹ Arabic letter waw with extended arabic indic digit three above Lo AL
- U+077A ݺ Arabic letter yeh barree with extended arabic indic digit two above Lo AL
- U+077B ݻ Arabic letter yeh barree with extended arabic indic digit three above Lo AL
- U+077C ݼ Arabic letter hah with extended arabic indic digit four below Lo AL
- U+077D ݽ Arabic letter seen with extended arabic indic digit four above Lo AL
Additions for early Persian
- U+077E ݾ Arabic letter seen with inverted v Lo AL
- U+077F ݿ Arabic letter kaf with two dots above Lo AL
Devanagari
Devanagari-specific additions
- U+0971 ॱ Devanagari sign high spacing dot Lm L
Additional vowel for Marathi
- U+0972 ॲ Devanagari letter candra a Lo L
- * Marathi
Gurmukhi
Various signs
- U+0A51 ੑ Gurmukhi sign udaat Mn NSM
Gurmukhi-specific additions
- U+0A75 ੵ Gurmukhi sign yakash Mn NSM
Oriya
Dependent vowel signs
- U+0B44 ୄ Oriya vowel sign vocalic rr Mn NSM
Dependent vowels
- U+0B62 ୢ Oriya vowel sign vocalic l Mn NSM
- U+0B63 ୣ Oriya vowel sign vocalic ll Mn NSM
Tamil
Various signs
- U+0BD0 ௐ Tamil om Lo L
Telugu
Addition for Sanskrit
- U+0C3D ఽ Telugu sign avagraha Lo L
Historic phonetic variants
- U+0C58 ౘ Telugu letter tsa Lo L
- U+0C59 ౙ Telugu letter dza Lo L
Dependent vowels
- U+0C62 ౢ Telugu vowel sign vocalic l Mn NSM
- U+0C63 ౣ Telugu vowel sign vocalic ll Mn NSM
Telugu fractions and weights
- U+0C78 ౸ Telugu fraction digit zero for odd powers of four No ON
- U+0C79 ౹ Telugu fraction digit one for odd powers of four No ON
- U+0C7A ౺ Telugu fraction digit two for odd powers of four No ON
- U+0C7B ౻ Telugu fraction digit three for odd powers of four No ON
- U+0C7C ౼ Telugu fraction digit one for even powers of four No ON
- U+0C7D ౽ Telugu fraction digit two for even powers of four No ON
- U+0C7E ౾ Telugu fraction digit three for even powers of four No ON
- U+0C7F ౿ Telugu sign tuumu So L
Malayalam
Addition for Sanskrit
- U+0D3D ഽ Malayalam sign avagraha Lo L
- aka praslesham
Dependent vowel signs
- U+0D44 ൄ Malayalam vowel sign vocalic rr Mn NSM
Dependent vowels
- U+0D62 ൢ Malayalam vowel sign vocalic l Mn NSM
- U+0D63 ൣ Malayalam vowel sign vocalic ll Mn NSM
Malayalam numerics
- U+0D70 ൰ Malayalam number ten No L
- U+0D71 ൱ Malayalam number one hundred No L
- U+0D72 ൲ Malayalam number one thousand No L
Fractions
- U+0D73 ൳ Malayalam fraction one quarter No L
- U+0D74 ൴ Malayalam fraction one half No L
- U+0D75 ൵ Malayalam fraction three quarters No L
Date mark
- U+0D79 ൹ Malayalam date mark So L
Chillu letters
- U+0D7A ൺ Malayalam letter chillu nn Lo L
- U+0D7B ൻ Malayalam letter chillu n Lo L
- U+0D7C ർ Malayalam letter chillu rr Lo L
- * historically derived from the full letter ra
- * also used for chillu r
- U+0D7D ൽ Malayalam letter chillu l Lo L
- * historically derived from the full letter ta
- * used for chillu t and chillu d
- U+0D7E ൾ Malayalam letter chillu ll Lo L
- U+0D7F ൿ Malayalam letter chillu k Lo L
Tibetan
Extensions for Balti
- U+0F6B ཫ Tibetan letter kka Lo L
- U+0F6C ཬ Tibetan letter rra Lo L
Astrological signs
- U+0FCE ࿎ Tibetan sign rdel nag rdel dkar So L
- aka dena deka
- * signifies good luck earlier, bad luck later
Marks
- U+0FD2 ࿒ Tibetan mark nyis tsheg Po L
- aka nyi tsek
Head marks
- U+0FD3 ࿓ Tibetan mark initial brda rnying yig mgo mdun ma Po L
- aka da nying yik go dun ma
- U+0FD4 ࿔ Tibetan mark closing brda rnying yig mgo sgab ma Po L
- aka da nying yik go kab ma
Myanmar
Independent vowels
- U+1022 ဢ Myanmar letter shan a Lo L
- U+1028 ဨ Myanmar letter mon e Lo L
Dependent vowel signs
- U+102B ါ Myanmar vowel sign tall aa Mc L
- U+1033 ဳ Myanmar vowel sign mon ii Mn NSM
- U+1034 ဴ Myanmar vowel sign mon o Mn NSM
- U+1035 ဵ Myanmar vowel sign e above Mn NSM
Various signs
- U+103A ် Myanmar sign asat Mn NSM
- aka killer (always rendered visibly)
Dependent consonant signs
- U+103B ျ Myanmar consonant sign medial ya Mc L
- U+103C ြ Myanmar consonant sign medial ra Mc L
- U+103D ွ Myanmar consonant sign medial wa Mn NSM
- U+103E ှ Myanmar consonant sign medial ha Mn NSM
Consonant
- U+103F ဿ Myanmar letter great sa Lo L
Extensions for Mon
- U+105A ၚ Myanmar letter mon nga Lo L
- U+105B ၛ Myanmar letter mon jha Lo L
- U+105C ၜ Myanmar letter mon bba Lo L
- U+105D ၝ Myanmar letter mon bbe Lo L
- U+105E ၞ Myanmar consonant sign mon medial na Mn NSM
- U+105F ၟ Myanmar consonant sign mon medial ma Mn NSM
- U+1060 ၠ Myanmar consonant sign mon medial la Mn NSM
Extensions for S'gaw Karen
- U+1061 ၡ Myanmar letter sgaw karen sha Lo L
- U+1062 ၢ Myanmar vowel sign sgaw karen eu Mc L
- U+1063 ၣ Myanmar tone mark sgaw karen hathi Mc L
- U+1064 ၤ Myanmar tone mark sgaw karen ke pho Mc L
Extensions for Western Pwo Karen
- U+1065 ၥ Myanmar letter western pwo karen tha Lo L
- U+1066 ၦ Myanmar letter western pwo karen pwa Lo L
- U+1067 ၧ Myanmar vowel sign western pwo karen eu Mc L
- U+1068 ၨ Myanmar vowel sign western pwo karen ue Mc L
- U+1069 ၩ Myanmar sign western pwo karen tone 1 Mc L
- U+106A ၪ Myanmar sign western pwo karen tone 2 Mc L
- U+106B ၫ Myanmar sign western pwo karen tone 3 Mc L
- U+106C ၬ Myanmar sign western pwo karen tone 4 Mc L
- U+106D ၭ Myanmar sign western pwo karen tone 5 Mc L
Extensions for Eastern Pwo Karen
- U+106E ၮ Myanmar letter eastern pwo karen nna Lo L
- U+106F ၯ Myanmar letter eastern pwo karen ywa Lo L
- U+1070 ၰ Myanmar letter eastern pwo karen ghwa Lo L
Extension for Geba Karen
- U+1071 ၱ Myanmar vowel sign geba karen i Mn NSM
Extensions for Kayah
- U+1072 ၲ Myanmar vowel sign kayah oe Mn NSM
- U+1073 ၳ Myanmar vowel sign kayah u Mn NSM
- U+1074 ၴ Myanmar vowel sign kayah ee Mn NSM
Extensions for Shan
- U+1075 ၵ Myanmar letter shan ka Lo L
- U+1076 ၶ Myanmar letter shan kha Lo L
- U+1077 ၷ Myanmar letter shan ga Lo L
- U+1078 ၸ Myanmar letter shan ca Lo L
- U+1079 ၹ Myanmar letter shan za Lo L
- U+107A ၺ Myanmar letter shan nya Lo L
- U+107B ၻ Myanmar letter shan da Lo L
- U+107C ၼ Myanmar letter shan na Lo L
- U+107D ၽ Myanmar letter shan pha Lo L
- U+107E ၾ Myanmar letter shan fa Lo L
- U+107F ၿ Myanmar letter shan ba Lo L
- U+1080 ႀ Myanmar letter shan tha Lo L
- U+1081 ႁ Myanmar letter shan ha Lo L
- U+1082 ႂ Myanmar consonant sign shan medial wa Mn NSM
- U+1083 ႃ Myanmar vowel sign shan aa Mc L
- U+1084 ႄ Myanmar vowel sign shan e Mc L
- U+1085 ႅ Myanmar vowel sign shan e above Mn NSM
- U+1086 ႆ Myanmar vowel sign shan final y Mn NSM
- U+1087 ႇ Myanmar sign shan tone 2 Mc L
- U+1088 ႈ Myanmar sign shan tone 3 Mc L
- U+1089 ႉ Myanmar sign shan tone 5 Mc L
- U+108A ႊ Myanmar sign shan tone 6 Mc L
- U+108B ႋ Myanmar sign shan council tone 2 Mc L
- U+108C ႌ Myanmar sign shan council tone 3 Mc L
- U+108D ႍ Myanmar sign shan council emphatic tone Mn NSM
Extensions for Rumai Palaung
- U+108E ႎ Myanmar letter rumai palaung fa Lo L
- U+108F ႏ Myanmar sign rumai palaung tone 5 Mc L
Shan digits
- U+1090 ႐ Myanmar shan digit zero Nd L
- U+1091 ႑ Myanmar shan digit one Nd L
- U+1092 ႒ Myanmar shan digit two Nd L
- U+1093 ႓ Myanmar shan digit three Nd L
- U+1094 ႔ Myanmar shan digit four Nd L
- U+1095 ႕ Myanmar shan digit five Nd L
- U+1096 ႖ Myanmar shan digit six Nd L
- U+1097 ႗ Myanmar shan digit seven Nd L
- U+1098 ႘ Myanmar shan digit eight Nd L
- U+1099 ႙ Myanmar shan digit nine Nd L
Shan symbols
- U+109E ႞ Myanmar symbol shan one So L
- U+109F ႟ Myanmar symbol shan exclamation So L
Mongolian
Extensions for Sanskrit and Tibetan
- U+18AA ᢪ Mongolian letter Manchu Ali Gali lha Lo L
Combining Diacritical Marks Supplement
Contour tone marks
- U+1DCB ᷋ combining breve macron Mn NSM
- * Lithuanian dialectology
- U+1DCC ᷌ combining macron breve Mn NSM
- * Lithuanian dialectology
Double diacritic
- U+1DCD ᷍ combining double circumflex above Mn NSM
Medievalist additions
- U+1DCE ᷎ combining ogonek above Mn NSM
- U+1DCF ᷏ combining zigzag below Mn NSM
- U+1DD0 ᷐ combining is below Mn NSM
- U+1DD1 ᷑ combining ur above Mn NSM
- U+1DD2 ᷒ combining us above Mn NSM
Medieval superscript letter diacritics
- U+1DD3 ᷓ combining Latin small letter flattened open a above Mn NSM
- U+1DD4 ᷔ combining Latin small letter ae Mn NSM
- U+1DD5 ᷕ combining Latin small letter ao Mn NSM
- U+1DD6 ᷖ combining Latin small letter av Mn NSM
- U+1DD7 ᷗ combining Latin small letter C cedilla Mn NSM
- U+1DD8 ᷘ combining Latin small letter insular d Mn NSM
- U+1DD9 ᷙ combining Latin small letter eth Mn NSM
- U+1DDA ᷚ combining Latin small letter G Mn NSM
- U+1DDB ᷛ combining Latin letter small capital g Mn NSM
- U+1DDC ᷜ combining Latin small letter K Mn NSM
- U+1DDD ᷝ combining Latin small letter L Mn NSM
- U+1DDE ᷞ combining Latin letter small capital l Mn NSM
- U+1DDF ᷟ combining Latin letter small capital m Mn NSM
- U+1DE0 ᷠ combining Latin small letter N Mn NSM
- U+1DE1 ᷡ combining Latin letter small capital n Mn NSM
- U+1DE2 ᷢ combining Latin letter small capital r Mn NSM
- U+1DE3 ᷣ combining Latin small letter R rotunda Mn NSM
- U+1DE4 ᷤ combining Latin small letter S Mn NSM
- U+1DE5 ᷥ combining Latin small letter long s Mn NSM
- U+1DE6 ᷦ combining Latin small letter Z Mn NSM
Latin Extended Additional
Medievalist additions
- U+1E9C ẜ Latin small letter long s with diagonal stroke Ll L
- U+1E9D ẝ Latin small letter long s with high stroke Ll L
Addition for German typography
- U+1E9E ẞ Latin capital letter sharp s Lu L
- * does not casemap to 00DF
- ref U+00DF ß Latin small letter sharp s (Latin-1 Supplement)
Medievalist addition
- U+1E9F ẟ Latin small letter delta Ll L
Medievalist additions
- U+1EFA Ỻ Latin capital letter middle welsh ll Lu L
- U+1EFB ỻ Latin small letter middle welsh ll Ll L
- U+1EFC Ỽ Latin capital letter middle welsh v Lu L
- U+1EFD ỽ Latin small letter middle welsh v Ll L
- U+1EFE Ỿ Latin capital letter Y with loop Lu L
- U+1EFF ỿ Latin small letter Y with loop Ll L
General Punctuation
Invisible operators
- U+2064 invisible plus Cf BN
- * contiguity operator indicating addition
Combining Diacritical Marks for Symbols
Additional diacritical marks for symbols
- U+20F0 ⃰ combining asterisk above Mn NSM
Letterlike Symbols
Biblical editorial symbol
- U+214F ⅏ symbol for samaritan source So L
Number Forms
Archaic Roman numerals
- U+2185 ↅ Roman numeral six late form Nl L
- U+2186 ↆ Roman numeral fifty early form Nl L
- U+2187 ↇ Roman numeral fifty thousand Nl L
- U+2188 ↈ Roman numeral one hundred thousand Nl L
Miscellaneous Symbols
Miscellaneous symbols
- U+269D ⚝ outlined white star So ON
- * symbol of Morocco
Astrological signs
- U+26B3 ⚳ ceres So ON
- U+26B4 ⚴ pallas So ON
- U+26B5 ⚵ juno So ON
- U+26B6 ⚶ vesta So ON
- U+26B7 ⚷ chiron So ON
- U+26B8 ⚸ black moon lilith So ON
- U+26B9 ⚹ sextile So ON
- ref U+002A * asterisk (Basic Latin)
- U+26BA ⚺ semisextile So ON
- ref U+22BB ⊻ xor (Mathematical Operators)
- U+26BB ⚻ quincunx So ON
- ref U+22BC ⊼ nand (Mathematical Operators)
- U+26BC ⚼ sesquiquadrate So ON
Symbols for draughts and checkers
- U+26C0 ⛀ white draughts man So ON
- U+26C1 ⛁ white draughts king So ON
- U+26C2 ⛂ black draughts man So ON
- U+26C3 ⛃ black draughts king So ON
Miscellaneous Mathematical Symbols A
Division operator
- U+27CC ⟌ long division Sm ON
- * graphically extends over the dividend
- ref U+00F7 ÷ division sign (Latin-1 Supplement)
- ref U+2215 ∕ division slash (Mathematical Operators)
- ref U+221A √ square root (Mathematical Operators)
Mathematical brackets
- U+27EC ⟬ mathematical left white tortoise shell bracket Ps ON
- ref U+2997 ⦗ left black tortoise shell bracket (Miscellaneous Mathematical Symbols B)
- ref U+3018 〘 left white tortoise shell bracket (CJK Symbols and Punctuation)
- U+27ED ⟭ mathematical right white tortoise shell bracket Pe ON
- ref U+2998 ⦘ right black tortoise shell bracket (Miscellaneous Mathematical Symbols B)
- ref U+3019 〙 right white tortoise shell bracket (CJK Symbols and Punctuation)
- U+27EE ⟮ mathematical left flattened parenthesis Ps ON
- aka lgroup
- U+27EF ⟯ mathematical right flattened parenthesis Pe ON
- aka rgroup
Miscellaneous Symbols and Arrows
Squares
- U+2B1B ⬛ black large square So ON
- ref U+25A0 ■ black square (Geometric Shapes)
- U+2B1C ⬜ white large square So ON
- ref U+25A1 □ white square (Geometric Shapes)
- U+2B1D ⬝ black very small square So ON
- ref U+25AA ▪ black small square (Geometric Shapes)
- U+2B1E ⬞ white very small square So ON
- ref U+25AB ▫ white small square (Geometric Shapes)
Pentagons
- U+2B1F ⬟ black pentagon So ON
Circle
- U+2B24 ⬤ black large circle So ON
- ref U+25CF ● black circle (Geometric Shapes)
- ref U+25EF ◯ large circle (Geometric Shapes)
Diamonds and lozenges
- U+2B25 ⬥ black medium diamond So ON
- ref U+25C6 ◆ black diamond (Geometric Shapes)
- U+2B26 ⬦ white medium diamond So ON
- U+2B27 ⬧ black medium lozenge So ON
- U+2B28 ⬨ white medium lozenge So ON
- ref U+25CA ◊ lozenge (Geometric Shapes)
- U+2B29 ⬩ black small diamond So ON
- ref U+22C4 ⋄ diamond operator (Mathematical Operators)
- U+2B2A ⬪ black small lozenge So ON
- U+2B2B ⬫ white small lozenge So ON
Ellipses
- U+2B2C ⬬ black horizontal ellipse So ON
- U+2B2D ⬭ white horizontal ellipse So ON
- U+2B2E ⬮ black vertical ellipse So ON
- U+2B2F ⬯ white vertical ellipse So ON
Mathematical arrows
These provide the opposite direction complement for arrows for mathermatical use not originally encoded in both a leftwards and rightwards direction.
- U+2B30 ⬰ left arrow with small circle Sm ON
- ref U+21F4 ⇴ right arrow with small circle (Arrows)
- U+2B31 ⬱ three leftwards arrows Sm ON
- ref U+21F6 ⇶ three rightwards arrows (Arrows)
- U+2B32 ⬲ left arrow with circled plus Sm ON
- ref U+27F4 ⟴ right arrow with circled plus (Supplemental Arrows A)
- U+2B33 ⬳ long leftwards squiggle arrow Sm ON
- ref U+27FF ⟿ long rightwards squiggle arrow (Supplemental Arrows A)
- ref U+21DC ⇜ leftwards squiggle arrow (Arrows)
- U+2B34 ⬴ leftwards two headed arrow with vertical stroke Sm ON
- ref U+2900 ⤀ rightwards two headed arrow with vertical stroke (Supplemental Arrows B)
- U+2B35 ⬵ leftwards two headed arrow with double vertical stroke Sm ON
- ref U+2901 ⤁ rightwards two headed arrow with double vertical stroke (Supplemental Arrows B)
- U+2B36 ⬶ leftwards two headed arrow from bar Sm ON
- ref U+2905 ⤅ rightwards two headed arrow from bar (Supplemental Arrows B)
- U+2B37 ⬷ leftwards two headed triple dash arrow Sm ON
- ref U+2910 ⤐ rightwards two headed triple dash arrow (Supplemental Arrows B)
- U+2B38 ⬸ leftwards arrow with dotted stem Sm ON
- ref U+2911 ⤑ rightwards arrow with dotted stem (Supplemental Arrows B)
- U+2B39 ⬹ leftwards arrow with tail with vertical stroke Sm ON
- ref U+2914 ⤔ rightwards arrow with tail with vertical stroke (Supplemental Arrows B)
- U+2B3A ⬺ leftwards arrow with tail with double vertical stroke Sm ON
- ref U+2915 ⤕ rightwards arrow with tail with double vertical stroke (Supplemental Arrows B)
- U+2B3B ⬻ leftwards two headed arrow with tail Sm ON
- ref U+2916 ⤖ rightwards two headed arrow with tail (Supplemental Arrows B)
- U+2B3C ⬼ leftwards two headed arrow with tail with vertical stroke Sm ON
- ref U+2917 ⤗ rightwards two headed arrow with tail with vertical stroke (Supplemental Arrows B)
- U+2B3D ⬽ leftwards two headed arrow with tail with double vertical stroke Sm ON
- ref U+2918 ⤘ rightwards two headed arrow with tail with double vertical stroke (Supplemental Arrows B)
- U+2B3E ⬾ leftwards arrow through x Sm ON
- ref U+2947 ⥇ rightwards arrow through x (Supplemental Arrows B)
- U+2B3F ⬿ wave arrow pointing directly left Sm ON
- ref U+2933 ⤳ wave arrow pointing directly right (Supplemental Arrows B)
- ref U+219C ↜ leftwards wave arrow (Arrows)
- U+2B40 ⭀ equals sign above leftwards arrow Sm ON
- ref U+2971 ⥱ equals sign above rightwards arrow (Supplemental Arrows B)
- U+2B41 ⭁ reverse tilde operator above leftwards arrow Sm ON
- * mirror image of "2972"
- ref U+2972 ⥲ tilde operator above rightwards arrow (Supplemental Arrows B)
- U+2B42 ⭂ leftwards arrow above reverse almost equal to Sm ON
- * mirror image of "2975"
- ref U+2975 ⥵ rightwards arrow above almost equal to (Supplemental Arrows B)
- U+2B43 ⭃ rightwards arrow through greater than Sm ON
- * mirror image of "2977"
- ref U+2977 ⥷ leftwards arrow through less than (Supplemental Arrows B)
- U+2B44 ⭄ rightwards arrow through superset Sm ON
- * mirror image of "297A"
- ref U+297A ⥺ leftwards arrow through subset (Supplemental Arrows B)
- U+2B45 ⭅ leftwards quadruple arrow So ON
- ref U+27F0 ⟰ upwards quadruple arrow (Supplemental Arrows A)
- U+2B46 ⭆ rightwards quadruple arrow So ON
- U+2B47 ⭇ reverse tilde operator above rightwards arrow Sm ON
- U+2B48 ⭈ rightwards arrow above reverse almost equal to Sm ON
- U+2B49 ⭉ tilde operator above leftwards arrow Sm ON
- U+2B4A ⭊ leftwards arrow above almost equal to Sm ON
- U+2B4B ⭋ leftwards arrow above reverse tilde operator Sm ON
- * mirror image of "2974"
- ref U+2974 ⥴ rightwards arrow above tilde operator (Supplemental Arrows B)
- U+2B4C ⭌ rightwards arrow above reverse tilde operator Sm ON
- * mirror image of "2973"
- ref U+2973 ⥳ leftwards arrow above tilde operator (Supplemental Arrows B)
Stars
- U+2B50 ⭐ white medium star So ON
- ref U+22C6 ⋆ star operator (Mathematical Operators)
- U+2B51 ⭑ black small star So ON
- ref U+066D ٭ Arabic five pointed star (Arabic)
- U+2B52 ⭒ white small star So ON
Pentagons
- U+2B53 ⭓ black right pointing pentagon So ON
- U+2B54 ⭔ white right pointing pentagon So ON
Latin Extended C
Miscellaneous additions
- U+2C6D Ɑ Latin capital letter alpha Lu L
- * lowercase is 0251
- U+2C6E Ɱ Latin capital letter M with hook Lu L
- * lowercase is 0271
- U+2C6F Ɐ Latin capital letter turned a Lu L
- * lowercase is 0250
- U+2C71 ⱱ Latin small letter V with right hook Ll L
- U+2C72 Ⱳ Latin capital letter W with hook Lu L
- U+2C73 ⱳ Latin small letter W with hook Ll L
Additions for UPA
- U+2C78 ⱸ Latin small letter E with notch Ll L
- U+2C79 ⱹ Latin small letter turned r with tail Ll L
- U+2C7A ⱺ Latin small letter O with low ring inside Ll L
- U+2C7B ⱻ Latin letter small capital turned e Ll L
- U+2C7C ⱼ Latin subscript small letter J Ll L
- U+2C7D ⱽ modifier letter capital v Lm L
Supplemental Punctuation
General punctuation
- U+2E18 ⸘ inverted interrobang Po ON
- aka gnaborretni
- ref U+203D ‽ interrobang (General Punctuation)
- U+2E19 ⸙ palm branch Po ON
- * used as a separator
Dictionary punctuation
These punctuation marks are used mostly in German dictionaries, to indicate umlaut or case changes with abbreviated stems.
- U+2E1A ⸚ hyphen with diaeresis Pd ON
- * indicates umlaut of the stem vowel of a plural form
- U+2E1B ⸛ tilde with ring above Po ON
- * indicates change in case for derived form
Dictionary punctuation
- U+2E1E ⸞ tilde with dot above Po ON
- * indicates derived form changes to uppercase
- U+2E1F ⸟ tilde with dot below Po ON
- * indicates derived form changes to lowercase
Brackets
- U+2E20 ⸠ left vertical bar with quill Pi ON
- U+2E21 ⸡ right vertical bar with quill Pf ON
Half brackets
These form a set of four corner brackets and are used editorially. They are distinguished from mathematical floor and ceiling characters. Occasionally quine corners are substituted for half brackets.
- U+2E22 ⸢ top left half bracket Ps ON
- ref U+2308 ⌈ left ceiling (Miscellaneous Technical)
- ref U+231C ⌜ top left corner (Miscellaneous Technical)
- ref U+300C 「 left corner bracket (CJK Symbols and Punctuation)
- U+2E23 ⸣ top right half bracket Pe ON
- U+2E24 ⸤ bottom left half bracket Ps ON
- U+2E25 ⸥ bottom right half bracket Pe ON
Brackets
- U+2E26 ⸦ left sideways u bracket Ps ON
- ref U+2282 ⊂ subset of (Mathematical Operators)
- U+2E27 ⸧ right sideways u bracket Pe ON
- ref U+2283 ⊃ superset of (Mathematical Operators)
- U+2E28 ⸨ left double parenthesis Ps ON
- ref U+2985 ⦅ left white parenthesis (Miscellaneous Mathematical Symbols B)
- ref U+FF5F ⦅ fullwidth left white parenthesis (Halfwidth and Fullwidth Forms)
- U+2E29 ⸩ right double parenthesis Pe ON
Medievalist punctuation
- U+2E2A ⸪ two dots over one dot punctuation Po ON
- U+2E2B ⸫ one dot over two dots punctuation Po ON
- U+2E2C ⸬ squared four dot punctuation Po ON
- U+2E2D ⸭ five dot mark Po ON
- U+2E2E ⸮ reversed question mark Po ON
- aka punctus percontativus
- ref U+003F ? question mark (Basic Latin)
- ref U+00BF ¿ inverted question mark (Latin-1 Supplement)
- ref U+061F ؟ Arabic question mark (Arabic)
- U+2E2F ⸯ vertical tilde Lm ON
- * used for Cyrillic yerik
- ref U+033E ̾ combining vertical tilde (Combining Diacritical Marks)
- ref U+A67F ꙿ Cyrillic payerok (Cyrillic Extended B)
- U+2E30 ⸰ ring point Po ON
- * used in Avestan
- ref U+2218 ∘ ring operator (Mathematical Operators)
- ref U+25E6 ◦ white bullet (Geometric Shapes)
Bopomofo
Miscellaneous addition
- U+312D ㄭ Bopomofo letter ih Lo L
- * for analytic representation of apical vowel
CJK Strokes
CJK strokes
- U+31D0 ㇐ CJK stroke h So ON
- U+31D1 ㇑ CJK stroke s So ON
- U+31D2 ㇒ CJK stroke p So ON
- U+31D3 ㇓ CJK stroke sp So ON
- U+31D4 ㇔ CJK stroke d So ON
- U+31D5 ㇕ CJK stroke hz So ON
- U+31D6 ㇖ CJK stroke hg So ON
- U+31D7 ㇗ CJK stroke sz So ON
- U+31D8 ㇘ CJK stroke swz So ON
- U+31D9 ㇙ CJK stroke st So ON
- U+31DA ㇚ CJK stroke sg So ON
- U+31DB ㇛ CJK stroke pd So ON
- U+31DC ㇜ CJK stroke pz So ON
- U+31DD ㇝ CJK stroke tn So ON
- U+31DE ㇞ CJK stroke szz So ON
- U+31DF ㇟ CJK stroke swg So ON
- U+31E0 ㇠ CJK stroke hxwg So ON
- U+31E1 ㇡ CJK stroke hzzzg So ON
- U+31E2 ㇢ CJK stroke pg So ON
- U+31E3 ㇣ CJK stroke q So ON
CJK Unified Ideographs
- U+9FBC 龼 CJK Ideograph 9FBC Lo L
- U+9FBD 龽 CJK Ideograph 9FBD Lo L
- U+9FBE 龾 CJK Ideograph 9FBE Lo L
- U+9FBF 龿 CJK Ideograph 9FBF Lo L
- U+9FC0 鿀 CJK Ideograph 9FC0 Lo L
- U+9FC1 鿁 CJK Ideograph 9FC1 Lo L
- U+9FC2 鿂 CJK Ideograph 9FC2 Lo L
- U+9FC3 鿃 CJK Ideograph 9FC3 Lo L
Modifier Tone Letters
Africanist tone letters
- U+A71B ꜛ modifier letter raised up arrow Lm ON
- U+A71C ꜜ modifier letter raised down arrow Lm ON
- U+A71D ꜝ modifier letter raised exclamation mark Lm ON
- U+A71E ꜞ modifier letter raised inverted exclamation mark Lm ON
- U+A71F ꜟ modifier letter low inverted exclamation mark Lm ON
Latin Extended D
Egyptological additions
- U+A722 Ꜣ Latin capital letter egyptological alef Lu L
- U+A723 ꜣ Latin small letter egyptological alef Ll L
- U+A724 Ꜥ Latin capital letter egyptological ain Lu L
- U+A725 ꜥ Latin small letter egyptological ain Ll L
- * this is a case pair
- ref U+1D25 ᴥ Latin letter ain (Phonetic Extensions)
- ref U+1D5C ᵜ modifier letter small ain (Phonetic Extensions)
Mayanist additions
- U+A726 Ꜧ Latin capital letter heng Lu L
- U+A727 ꜧ Latin small letter heng Ll L
- U+A728 Ꜩ Latin capital letter tz Lu L
- U+A729 ꜩ Latin small letter tz Ll L
- U+A72A Ꜫ Latin capital letter tresillo Lu L
- U+A72B ꜫ Latin small letter tresillo Ll L
- U+A72C Ꜭ Latin capital letter cuatrillo Lu L
- U+A72D ꜭ Latin small letter cuatrillo Ll L
- U+A72E Ꜯ Latin capital letter cuatrillo with comma Lu L
- U+A72F ꜯ Latin small letter cuatrillo with comma Ll L
Medievalist additions
- U+A730 ꜰ Latin letter small capital f Ll L
- U+A731 ꜱ Latin letter small capital s Ll L
- U+A732 Ꜳ Latin capital letter aa Lu L
- U+A733 ꜳ Latin small letter aa Ll L
- U+A734 Ꜵ Latin capital letter ao Lu L
- U+A735 ꜵ Latin small letter ao Ll L
- U+A736 Ꜷ Latin capital letter au Lu L
- U+A737 ꜷ Latin small letter au Ll L
- U+A738 Ꜹ Latin capital letter av Lu L
- U+A739 ꜹ Latin small letter av Ll L
- U+A73A Ꜻ Latin capital letter av with horizontal bar Lu L
- U+A73B ꜻ Latin small letter av with horizontal bar Ll L
- U+A73C Ꜽ Latin capital letter ay Lu L
- U+A73D ꜽ Latin small letter ay Ll L
- U+A73E Ꜿ Latin capital letter reversed c with dot Lu L
- U+A73F ꜿ Latin small letter reversed c with dot Ll L
- U+A740 Ꝁ Latin capital letter K with stroke Lu L
- U+A741 ꝁ Latin small letter K with stroke Ll L
- U+A742 Ꝃ Latin capital letter K with diagonal stroke Lu L
- U+A743 ꝃ Latin small letter K with diagonal stroke Ll L
- U+A744 Ꝅ Latin capital letter K with stroke and diagonal stroke Lu L
- U+A745 ꝅ Latin small letter K with stroke and diagonal stroke Ll L
- U+A746 Ꝇ Latin capital letter broken l Lu L
- U+A747 ꝇ Latin small letter broken l Ll L
- U+A748 Ꝉ Latin capital letter L with high stroke Lu L
- U+A749 ꝉ Latin small letter L with high stroke Ll L
- U+A74A Ꝋ Latin capital letter O with long stroke overlay Lu L
- U+A74B ꝋ Latin small letter O with long stroke overlay Ll L
- U+A74C Ꝍ Latin capital letter O with loop Lu L
- U+A74D ꝍ Latin small letter O with loop Ll L
- U+A74E Ꝏ Latin capital letter oo Lu L
- U+A74F ꝏ Latin small letter oo Ll L
- U+A750 Ꝑ Latin capital letter P with stroke through descender Lu L
- U+A751 ꝑ Latin small letter P with stroke through descender Ll L
- U+A752 Ꝓ Latin capital letter P with flourish Lu L
- U+A753 ꝓ Latin small letter P with flourish Ll L
- U+A754 Ꝕ Latin capital letter P with squirrel tail Lu L
- U+A755 ꝕ Latin small letter P with squirrel tail Ll L
- U+A756 Ꝗ Latin capital letter Q with stroke through descender Lu L
- U+A757 ꝗ Latin small letter Q with stroke through descender Ll L
- U+A758 Ꝙ Latin capital letter Q with diagonal stroke Lu L
- U+A759 ꝙ Latin small letter Q with diagonal stroke Ll L
- U+A75A Ꝛ Latin capital letter R rotunda Lu L
- U+A75B ꝛ Latin small letter R rotunda Ll L
- U+A75C Ꝝ Latin capital letter rum rotunda Lu L
- U+A75D ꝝ Latin small letter rum rotunda Ll L
- U+A75E Ꝟ Latin capital letter V with diagonal stroke Lu L
- U+A75F ꝟ Latin small letter V with diagonal stroke Ll L
- U+A760 Ꝡ Latin capital letter vy Lu L
- U+A761 ꝡ Latin small letter vy Ll L
- U+A762 Ꝣ Latin capital letter visigothic z Lu L
- U+A763 ꝣ Latin small letter visigothic z Ll L
- U+A764 Ꝥ Latin capital letter thorn with stroke Lu L
- U+A765 ꝥ Latin small letter thorn with stroke Ll L
- U+A766 Ꝧ Latin capital letter thorn with stroke through descender Lu L
- U+A767 ꝧ Latin small letter thorn with stroke through descender Ll L
- U+A768 Ꝩ Latin capital letter vend Lu L
- U+A769 ꝩ Latin small letter vend Ll L
- U+A76A Ꝫ Latin capital letter et Lu L
- U+A76B ꝫ Latin small letter et Ll L
- U+A76C Ꝭ Latin capital letter is Lu L
- U+A76D ꝭ Latin small letter is Ll L
- U+A76E Ꝯ Latin capital letter con Lu L
- U+A76F ꝯ Latin small letter con Ll L
- U+A770 ꝰ modifier letter us Lm L
- U+A771 ꝱ Latin small letter dum Ll L
- U+A772 ꝲ Latin small letter lum Ll L
- U+A773 ꝳ Latin small letter mum Ll L
- U+A774 ꝴ Latin small letter num Ll L
- U+A775 ꝵ Latin small letter rum Ll L
- U+A776 ꝶ Latin letter small capital rum Ll L
- U+A777 ꝷ Latin small letter tum Ll L
- U+A778 ꝸ Latin small letter um Ll L
Insular and Celticist letters
- U+A779 Ꝺ Latin capital letter insular d Lu L
- U+A77A ꝺ Latin small letter insular d Ll L
- U+A77B Ꝼ Latin capital letter insular f Lu L
- U+A77C ꝼ Latin small letter insular f Ll L
- U+A77D Ᵹ Latin capital letter insular g Lu L
- * lowercase is 1D79
- U+A77E Ꝿ Latin capital letter turned insular g Lu L
- U+A77F ꝿ Latin small letter turned insular g Ll L
- U+A780 Ꞁ Latin capital letter turned l Lu L
- U+A781 ꞁ Latin small letter turned l Ll L
- U+A782 Ꞃ Latin capital letter insular r Lu L
- U+A783 ꞃ Latin small letter insular r Ll L
- U+A784 Ꞅ Latin capital letter insular s Lu L
- U+A785 ꞅ Latin small letter insular s Ll L
- U+A786 Ꞇ Latin capital letter insular t Lu L
- U+A787 ꞇ Latin small letter insular t Ll L
Modifier letters
- U+A788 ꞈ modifier letter low circumflex accent Sk ON
- U+A789 ꞉ modifier letter colon Sk L
- * used as a tone letter in some orthographies
- * Budu (Congo), Sabaot (Kenya), and several Papua New Guinea languages
- ref U+003A : colon (Basic Latin)
- U+A78A ꞊ modifier letter short equals sign Sk L
- * used as a tone letter in some orthographies
- * Budu (Congo)
- ref U+003D = equals sign (Basic Latin)
Orthographic letters for glottals
- U+A78B Ꞌ Latin capital letter saltillo Lu L
- U+A78C ꞌ Latin small letter saltillo Ll L
- * saltillos are used as a casing pair for glottal stop in some orthographies
- * Huasteco and other languages of Mexico, Izere (Nigeria)
- ref U+0027 ' apostrophe (Basic Latin)
- ref U+0242 ɂ Latin small letter glottal stop (Latin Extended B)
- ref U+0294 ʔ Latin letter glottal stop (IPA Extensions)
- ref U+02BC ʼ modifier letter apostrophe (Spacing Modifier Letters)
- ref U+02C0 ˀ modifier letter glottal stop (Spacing Modifier Letters)
Ancient Roman epigraphic letters
- U+A7FB ꟻ Latin epigraphic letter reversed f Lo L
- U+A7FC ꟼ Latin epigraphic letter reversed p Lo L
- U+A7FD ꟽ Latin epigraphic letter inverted m Lo L
- U+A7FE ꟾ Latin epigraphic letter I longa Lo L
- U+A7FF ꟿ Latin epigraphic letter archaic m Lo L
Combining Half Marks
Continuous macrons for Coptic
These are used in combinations to represent continuous macrons over a sequence of Coptic letters.
- U+FE24 ︤ combining macron left half Mn NSM
- U+FE25 ︥ combining macron right half Mn NSM
- U+FE26 ︦ combining conjoining macron Mn NSM
- ref U+0304 ̄ combining macron (Combining Diacritical Marks)
- ref U+035E ͞ combining double macron (Combining Diacritical Marks)
Musical Symbols
Rest
- U+1D129 𝄩 musical symbol multiple measure rest So L
- * used to represent rests of arbitrary lengths, extending across multiple measures
- ref U+1D13A 𝄺 musical symbol multi rest (Musical Symbols)
Altered Characters
In addition, 8 characters were altered in 5.1
A total of 3 characters changed their General Category
1 characters changed their General Category from Punctuation, Other to Punctuation, Dash
2 characters changed their General Category from Symbol, Modifier to Letter, Modifier
A total of 5 characters changed their Bidirectional Category
5 characters changed their Bidirectional Category from Right To Left Arabic to Arabic Number
Spacing Modifier Letters
U+02EC
ˬ modifier letter voicing had its
General Category changed from
Symbol, Modifier to
Letter, Modifier
Greek and Coptic
U+0374
ʹ Greek numeral sign had its
General Category changed from
Symbol, Modifier to
Letter, Modifier
Hebrew
U+05BE
־ Hebrew punctuation maqaf had its
General Category changed from
Punctuation, Other to
Punctuation, Dash
Arabic
U+0600
Arabic number sign had its
Bidirectional Category changed from
Right To Left Arabic to
Arabic Number
U+0601
Arabic sign sanah had its
Bidirectional Category changed from
Right To Left Arabic to
Arabic Number
U+0602
Arabic footnote marker had its
Bidirectional Category changed from
Right To Left Arabic to
Arabic Number
U+0603
Arabic sign safha had its
Bidirectional Category changed from
Right To Left Arabic to
Arabic Number
U+06DD
Arabic end of ayah had its
Bidirectional Category changed from
Right To Left Arabic to
Arabic Number
http://unicode.org
Some prose may have been lifted verbatim from unicode.org,
as is permitted by their terms of use at http://www.unicode.org/copyright.html