Rangestart: | 0180 |
Rangeend: | 024F |
Script1: | Latin |
Alphabets: | Africa alphabet Americanist Azerbaijani Khoisan Pan-Nigerian Pinyin Romanian |
1 0 0: | 113 |
1 1: | 35 |
3 0: | 30 |
3 2: | 1 |
4 0: | 4 |
4 1: | 11 |
5 0: | 14 |
Note: | Block range was extended by 80 code points in Unicode 1.1 during the unification with ISO 10646.[1] [2] |
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.[3]
Code | Glyph | Decimal | Description | |
---|---|---|---|---|
Non-European and historic Latin | ||||
U+0180 | ƀ | Latin Small Letter B with Stroke | ||
U+0181 | Ɓ | Latin Capital Letter B with Hook | ||
U+0182 | Ƃ | Latin Capital Letter B with Top Bar | ||
U+0183 | ƃ | Latin Small Letter B with Top Bar | ||
U+0184 | Ƅ | Latin Capital Letter Tone Six | ||
U+0185 | ƅ | Latin Small Letter Tone Six | ||
U+0186 | Ɔ | Latin Capital Letter Open O | ||
U+0187 | Ƈ | Latin Capital Letter C with Hook | ||
U+0188 | ƈ | Latin Small Letter C with Hook | ||
U+0189 | Ɖ | Latin Capital Letter African D | ||
U+018A | Ɗ | Latin Capital Letter D with Hook | ||
U+018B | Ƌ | Latin Capital Letter D with Top Bar | ||
U+018C | ƌ | Latin Small Letter D with Top Bar | ||
U+018D | ƍ | Latin Small Letter Turned Delta | ||
U+018E | Ǝ | Latin Capital Letter Reversed E | ||
U+018F | Ə | Latin Capital Letter Schwa | ||
U+0190 | Ɛ | Latin Capital Letter Open E (= Latin Capital Letter Epsilon) | ||
U+0191 | Ƒ | Latin Capital Letter F with Hook | ||
U+0192 | ƒ | Latin Small Letter F with Hook | ||
U+0193 | Ɠ | Latin Capital Letter G with Hook | ||
U+0194 | Ɣ | Latin Capital Letter Gamma | ||
U+0195 | ƕ | Latin Small Letter HV | ||
U+0196 | Ɩ | Latin Capital Letter Iota | ||
U+0197 | Ɨ | Latin Capital Letter I with Stroke | ||
U+0198 | Ƙ | Latin Capital Letter K with Hook | ||
U+0199 | ƙ | Latin Small Letter K with Hook | ||
U+019A | ƚ | Latin Small Letter L with Bar | ||
U+019B | ƛ | Latin Small Letter Lambda with Stroke | ||
U+019C | Ɯ | Latin Capital Letter Turned M | ||
U+019D | Ɲ | Latin Capital Letter N with Left Hook | ||
U+019E | ƞ | Latin Small Letter N with Long Right Leg | ||
U+019F | Ɵ | Latin Capital Letter O with Middle Tilde | ||
U+01A0 | Ơ | Latin Capital Letter O with Horn | ||
U+01A1 | ơ | Latin Small Letter O with Horn | ||
U+01A2 | Ƣ | Latin Capital Letter OI (= Latin Capital Letter Gha) | ||
U+01A3 | ƣ | Latin Small Letter OI (= Latin Small Letter Gha) | ||
U+01A4 | Ƥ | Latin Capital Letter P with Hook | ||
U+01A5 | ƥ | Latin Small Letter P with Hook | ||
U+01A6 | Ʀ | Latin Letter YR | ||
U+01A7 | Ƨ | Latin Capital Letter Tone Two | ||
U+01A8 | ƨ | Latin Small Letter Tone Two | ||
U+01A9 | Ʃ | Latin Capital Letter Esh | ||
U+01AA | ƪ | Latin Letter Reversed Esh Loop | ||
U+01AB | ƫ | Latin Small Letter T with Palatal Hook | ||
U+01AC | Ƭ | Latin Capital Letter T with Hook | ||
U+01AD | ƭ | Latin Small Letter T with Hook | ||
U+01AE | Ʈ | Latin Capital Letter T with Retroflex Hook | ||
U+01AF | Ư | Latin Capital Letter U with Horn | ||
U+01B0 | ư | Latin Small Letter U with Horn | ||
U+01B1 | Ʊ | Latin Capital Letter Upsilon | ||
U+01B2 | Ʋ | Latin Capital Letter V with Hook | ||
U+01B3 | Ƴ | Latin Capital Letter Y with Hook | ||
U+01B4 | ƴ | Latin Small Letter Y with Hook | ||
U+01B5 | Ƶ | Latin Capital Letter Z with Stroke | ||
U+01B6 | ƶ | Latin Small Letter Z with Stroke | ||
U+01B7 | Ʒ | Latin Capital Letter Ezh | ||
U+01B8 | Ƹ | Latin Capital Letter Ezh Reversed | ||
U+01B9 | ƹ | Latin Small Letter Ezh Reversed | ||
U+01BA | ƺ | Latin Small Letter Ezh with Tail | ||
U+01BB | ƻ | Latin Letter Two with Stroke | ||
U+01BC | Ƽ | Latin Capital Letter Tone Five | ||
U+01BD | ƽ | Latin Small Letter Tone Five | ||
U+01BE | ƾ | Latin Letter Inverted Glottal Stop with Stroke | ||
U+01BF | ƿ | Latin Letter Wynn | ||
African letters for clicks | ||||
U+01C0 | ǀ | Latin Letter Dental Click | ||
U+01C1 | ǁ | Latin Letter Lateral Click | ||
U+01C2 | ǂ | Latin Letter Alveolar Click | ||
U+01C3 | ǃ | Latin Letter Retroflex Click | ||
Croatian digraphs matching Serbian Cyrillic letters | ||||
U+01C4 | DŽ | Latin Capital Letter DZ with Caron | ||
U+01C5 | Dž | Latin Capital Letter D with Small Letter Z with Caron | ||
U+01C6 | dž | Latin Small Letter DZ with Caron | ||
U+01C7 | LJ | Latin Capital Letter LJ | ||
U+01C8 | Lj | Latin Capital Letter L with Small Letter J | ||
U+01C9 | lj | Latin Small Letter LJ | ||
U+01CA | NJ | Latin Capital Letter NJ | ||
U+01CB | Nj | Latin Capital Letter N with Small Letter J | ||
U+01CC | nj | Latin Small Letter NJ | ||
Pinyin diacritic-vowel combinations | ||||
U+01CD | Ǎ | Latin Capital Letter A with Caron | ||
U+01CE | ǎ | Latin Small Letter A with Caron | ||
U+01CF | Ǐ | Latin Capital Letter I with Caron | ||
U+01D0 | ǐ | Latin Small Letter I with Caron | ||
U+01D1 | Ǒ | Latin Capital Letter O with Caron | ||
U+01D2 | ǒ | Latin Small Letter O with Caron | ||
U+01D3 | Ǔ | Latin Capital Letter U with Caron | ||
U+01D4 | ǔ | Latin Small Letter U with Caron | ||
U+01D5 | Ǖ | Latin Capital Letter U with Diaeresis and Macron | ||
U+01D6 | ǖ | Latin Small Letter U with Diaeresis and Macron | ||
U+01D7 | Ǘ | Latin Capital Letter U with Diaeresis and Acute | ||
U+01D8 | ǘ | Latin Small Letter U with Diaeresis and Acute | ||
U+01D9 | Ǚ | Latin Capital Letter U with Diaeresis and Caron | ||
U+01DA | ǚ | Latin Small Letter U with Diaeresis and Caron | ||
U+01DB | Ǜ | Latin Capital Letter U with Diaeresis and Grave | ||
U+01DC | ǜ | Latin Small Letter U with Diaeresis and Grave | ||
Phonetic and historic letters | ||||
U+01DD | ǝ | Latin Small Letter Turned E | ||
U+01DE | Ǟ | Latin Capital Letter A with Diaeresis and Macron | ||
U+01DF | ǟ | Latin Small Letter A with Diaeresis and Macron | ||
U+01E0 | Ǡ | Latin Capital Letter A with Dot Above and Macron | ||
U+01E1 | ǡ | Latin Small Letter A with Dot Above and Macron | ||
U+01E2 | Ǣ | Latin Capital Letter AE with Macron | ||
U+01E3 | ǣ | Latin Small Letter AE with Macron | ||
U+01E4 | Ǥ | Latin Capital Letter G with Stroke | ||
U+01E5 | ǥ | Latin Small Letter G with Stroke | ||
U+01E6 | Ǧ | Latin Capital Letter G with Caron | ||
U+01E7 | ǧ | Latin Small Letter G with Caron | ||
U+01E8 | Ǩ | Latin Capital Letter K with Caron | ||
U+01E9 | ǩ | Latin Small Letter K with Caron | ||
U+01EA | Ǫ | Latin Capital Letter O with Ogonek | ||
U+01EB | ǫ | Latin Small Letter O with Ogonek | ||
U+01EC | Ǭ | Latin Capital Letter O with Ogonek and Macron (=Latin Capital Letter O with Macron and Ogonek) | ||
U+01ED | ǭ | Latin Small Letter O with Ogonek and Macron (=Latin Small Letter O with Macron and Ogonek) | ||
U+01EE | Ǯ | Latin Capital Letter Ezh with Caron | ||
U+01EF | ǯ | Latin Small Letter Ezh with Caron | ||
U+01F0 | ǰ | Latin Small Letter J with Caron | ||
U+01F1 | DZ | Latin Capital Letter DZ | ||
U+01F2 | Dz | Latin Capital Letter D with Small Letter Z | ||
U+01F3 | dz | Latin Small Letter DZ | ||
U+01F4 | Ǵ | Latin Capital Letter G with Acute | ||
U+01F5 | ǵ | Latin Small Letter G with Acute | ||
U+01F6 | Ƕ | Latin Capital Letter Hwair | ||
U+01F7 | Ƿ | Latin Capital Letter Wynn | ||
U+01F8 | Ǹ | Latin Capital Letter N with Grave | ||
U+01F9 | ǹ | Latin Small Letter N with Grave | ||
U+01FA | Ǻ | Latin Capital Letter A with Ring Above and Acute | ||
U+01FB | ǻ | Latin Small Letter A with Ring Above and Acute | ||
U+01FC | Ǽ | Latin Capital Letter AE with Acute | ||
U+01FD | ǽ | Latin Small Letter AE with Acute | ||
U+01FE | Ǿ | Latin Capital Letter O with Stroke and Acute | ||
U+01FF | ǿ | Latin Small Letter O with Stroke and Acute | ||
Additions for Slovenian and Croatian | ||||
U+0200 | Ȁ | Latin Capital Letter A with Double Grave | ||
U+0201 | ȁ | Latin Small Letter A with Double Grave | ||
U+0202 | Ȃ | Latin Capital Letter A with Inverted Breve | ||
U+0203 | ȃ | Latin Small Letter A with Inverted Breve | ||
U+0204 | Ȅ | Latin Capital Letter E with Double Grave | ||
U+0205 | ȅ | Latin Small Letter E with Double Grave | ||
U+0206 | Ȇ | Latin Capital Letter E with Inverted Breve | ||
U+0207 | ȇ | Latin Small Letter E with Inverted Breve | ||
U+0208 | Ȉ | Latin Capital Letter I with Double Grave | ||
U+0209 | ȉ | Latin Small Letter I with Double Grave | ||
U+020A | Ȋ | Latin Capital Letter I with Inverted Breve | ||
U+020B | ȋ | Latin Small Letter I with Inverted Breve | ||
U+020C | Ȍ | Latin Capital Letter O with Double Grave | ||
U+020D | ȍ | Latin Small Letter O with Double Grave | ||
U+020E | Ȏ | Latin Capital Letter O with Inverted Breve | ||
U+020F | ȏ | Latin Small Letter O with Inverted Breve | ||
U+0210 | Ȑ | Latin Capital Letter R with Double Grave | ||
U+0211 | ȑ | Latin Small Letter R with Double Grave | ||
U+0212 | Ȓ | Latin Capital Letter R with Inverted Breve | ||
U+0213 | ȓ | Latin Small Letter R with Inverted Breve | ||
U+0214 | Ȕ | Latin Capital Letter U with Double Grave | ||
U+0215 | ȕ | Latin Small Letter U with Double Grave | ||
U+0216 | Ȗ | Latin Capital Letter U with Inverted Breve | ||
U+0217 | ȗ | Latin Small Letter U with Inverted Breve | ||
Additions for Romanian | ||||
U+0218 | Ș | Latin Capital Letter S with Comma Below | ||
U+0219 | ș | Latin Small Letter S with Comma Below | ||
U+021A | Ț | Latin Capital Letter T with Comma Below | ||
U+021B | ț | Latin Small Letter T with Comma Below | ||
Miscellaneous additions | ||||
U+021C | Ȝ | Latin Capital Letter Yogh | ||
U+021D | ȝ | Latin Small Letter Yogh | ||
U+021E | Ȟ | Latin Capital Letter H with Caron | ||
U+021F | ȟ | Latin Small Letter H with Caron | ||
U+0220 | Ƞ | Latin Capital Letter N with Long Right Leg | ||
U+0221 | ȡ | Latin Small Letter D with Curl | ||
U+0222 | Ȣ | Latin Capital Letter OU | ||
U+0223 | ȣ | Latin Small Letter OU | ||
U+0224 | Ȥ | Latin Capital Letter Z with Hook | ||
U+0225 | ȥ | Latin Small Letter Z with Hook | ||
U+0226 | Ȧ | Latin Capital Letter A with Dot Above | ||
U+0227 | ȧ | Latin Small Letter A with Dot Above | ||
U+0228 | Ȩ | Latin Capital Letter E with Cedilla | ||
U+0229 | ȩ | Latin Small Letter E with Cedilla | ||
Additions for Livonian | ||||
U+022A | Ȫ | Latin Capital Letter O with Diaeresis and Macron | ||
U+022B | ȫ | Latin Small Letter O with Diaeresis and Macron | ||
U+022C | Ȭ | Latin Capital Letter O with Tilde and Macron | ||
U+022D | ȭ | Latin Small Letter O with Tilde and Macron | ||
U+022E | Ȯ | Latin Capital Letter O with Dot Above | ||
U+022F | ȯ | Latin Small Letter O with Dot Above | ||
U+0230 | Ȱ | Latin Capital Letter O with Dot Above and Macron | ||
U+0231 | ȱ | Latin Small Letter O with Dot Above and Macron | ||
U+0232 | Ȳ | Latin Capital Letter Y with Macron | ||
U+0233 | ȳ | Latin Small Letter Y with Macron | ||
Additions for Sinology | ||||
U+0234 | ȴ | Latin Small Letter L with Curl | ||
U+0235 | ȵ | Latin Small Letter N with Curl | ||
U+0236 | ȶ | Latin Small Letter T with Curl | ||
Miscellaneous addition | ||||
U+0237 | ȷ | Latin Small Letter Dotless J | ||
Additions for Africanist linguistics | ||||
U+0238 | ȸ | Latin Small Letter DB Digraph | ||
U+0239 | ȹ | Latin Small Letter QP Digraph | ||
Additions for Sencoten | ||||
U+023A | Ⱥ | Latin Capital Letter A with Stroke | ||
U+023B | Ȼ | Latin Capital Letter C with Stroke | ||
U+023C | ȼ | Latin Small Letter C with Stroke | ||
U+023D | Ƚ | Latin Capital Letter L with Bar | ||
U+023E | Ⱦ | Latin Capital Letter T with Diagonal Stroke | ||
Additions for Africanist linguistics | ||||
U+023F | ȿ | Latin Small Letter S with Swash Tail | ||
U+0240 | ɀ | Latin Small Letter Z with Swash Tail | ||
Miscellaneous additions | ||||
U+0241 | Ɂ | Latin Capital Letter Glottal Stop | ||
U+0242 | ɂ | Latin Small Letter Glottal Stop | ||
U+0243 | Ƀ | Latin Capital Letter B with Stroke | ||
U+0244 | Ʉ | Latin Capital Letter U Bar | ||
U+0245 | Ʌ | Latin Capital Letter Turned V | ||
U+0246 | Ɇ | Latin Capital Letter E with Stroke | ||
U+0247 | ɇ | Latin Small Letter E with Stroke | ||
U+0248 | Ɉ | Latin Capital Letter J with Stroke | ||
U+0249 | ɉ | Latin Small Letter J with Stroke | ||
U+024A | Ɋ | Latin Capital Letter Q with Hook Tail | ||
U+024B | ɋ | Latin Small Letter Q with Hook Tail | ||
U+024C | Ɍ | Latin Capital Letter R with Stroke | ||
U+024D | ɍ | Latin Small Letter R with Stroke | ||
U+024E | Ɏ | Latin Capital Letter Y with Stroke | ||
U+024F | ɏ | Latin Small Letter Y with Stroke |
The Latin Extended-B block contains ten subheadings for groups of characters: Non-European and historic Latin, African letters for clicks, Croatian digraphs matching Serbian Cyrillic letters, Pinyin diacritic-vowel combinations, Phonetic and historic letters, Additions for Slovenian and Croatian, Additions for Romanian, Miscellaneous additions, Additions for Livonian, and Additions for Sinology. The Non-European and historic, African clicks, Croatian digraphs, Pinyin, and the first part of the Phonetic and historic letters were present in Unicode 1.0; additional Phonetic and historic letters were added for version 3.0; and other Phonetic and historic, as well as the rest of the sub-blocks were the characters added for version 1.1.
The Non-European and historic Latin subheading contains the first 64 characters of the block, and includes various variant letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. It does not contain any standard letters with diacritics.
The four African letters for clicks are used in Khoisan orthography.
The Croatian digraphs matching Serbian Cyrillic letters are three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts, Serbo-Croatian being a digraphic language.
The 16 Pinyin diacritic-vowel combinations are used to represent the standard Mandarin Chinese vowel sounds with tone marks.
The 35 Phonetic and historic letters are largely various standard and variant Latin letters with diacritic marks.
The 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve.
The Additions for Romanian are 4 characters that were erroneously unified as having a cedilla, when they have a comma below. The conflation of S and T with cedilla vs. comma below continues to plague Romanian language implementation up to the present.[4]
The Miscellaneous additions subheading contains 39 characters of various description and origin.
The Additions for Livonian are 10 letters with diacritics for writing the Livonian language.
The Additions for Sinology are three lowercase letters with curls used in the study of classical Chinese language.
The Additions for Africanist linguistics are two lowercase letter with swash tails used in Africanist linguistics.
The Additions for Sencoten are 5 letters with strokes for writing Saanich.
The following table shows the number of letters in the Latin Extended-B block.
Type of subheading | Number of symbols | Range of characters |
---|---|---|
Non-European and historic Latin | 64 various letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. | U+0180 to U+01BF |
African letters for clicks | Four African letters for clicks are used in Khoisan orthography. | U+01C0 to U+01C3 |
Croatian digraphs matching Serbian Cyrillic letters | Three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts. | U+01C4 to U+01CC |
Pinyin diacritic-vowel combinations | Sixteen diacritic-vowel combinations which are used to represent the standard Mandarin Chinese vowel sounds with tone marks. | U+01CD to U+01DC |
Phonetic and historic letters | 35 Phonetic and historic letters which are largely various standard and variant Latin letters with diacritic marks. | U+01DD to U+01FF |
Additions for Slovenian and Croatian | 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve. | U+0200 to U+0217 |
Additions for Romanian | 4 characters that were erroneously unified as having a cedilla, when they have a comma below. | U+0218 to U+021B |
Miscellaneous additions | 14 characters of various description and origin. | U+021C to U+0229 |
Additions for Livonian | 10 letters with diacritics for writing the Livonian language. | U+022A to U+0233 |
Additions for Sinology | Three lowercase letters with curls used in the study of classical Chinese language. | U+0234 to U+0236 |
The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-B block:
Count | UTC ID | L2 ID | WG2 ID | Document | |||
---|---|---|---|---|---|---|---|
1.0.0 | U+0180..01EF | 112 | (to be determined) | ||||
N994 | |||||||
doc) | |||||||
N1105 | |||||||
N1162 | |||||||
N2859 | |||||||
U+01F0 | 1 | ||||||
1.1 | U+01F1..01F5, 01FA..0217 | 35 | (to be determined) | ||||
N994 | |||||||
doc) | |||||||
3.0 | U+01F6..01F7, 021C..021D | 4 | N1166 | ||||
N1547 | |||||||
N1549 | |||||||
N1603 | |||||||
N1681 | |||||||
N1894 | |||||||
U+01F8..01F9 | 2 | N1282 | |||||
doc) | |||||||
N1355 | |||||||
N1353 | |||||||
N1461 | |||||||
N1453 | |||||||
N1681 | |||||||
N1894 | |||||||
U+0218..021B | 4 | ||||||
N1361 | |||||||
N1353 | |||||||
N1440 | |||||||
N1507 | |||||||
N1453 | |||||||
doc) | |||||||
N1603 | |||||||
N1681 | |||||||
N1894 | |||||||
html, doc) | |||||||
U+021E..021F | 2 | ||||||
N1619 | |||||||
N1703 | |||||||
N1905 | |||||||
U+0222..0225 | 4 | N1741 | |||||
html) | |||||||
html, Figure 1) | |||||||
N1840 | |||||||
N1885 | |||||||
N1847 | |||||||
doc) | |||||||
N1920 | |||||||
html, doc) | |||||||
U+0226..0229 | 4 | doc) | |||||
N1920 | |||||||
html, doc) | |||||||
U+022A..0233 | 10 | N1322 | |||||
N1353 | |||||||
N1453 | |||||||
N1888 | |||||||
N1920 | |||||||
html, doc) | |||||||
3.2 | U+0220 | 1 | N2306R | ||||
doc) | |||||||
4.0 | U+0221, 0234..0236 | 4 | N2366 | ||||
N2366R | |||||||
N2403 | |||||||
4.1 | U+0237 | 1 | N2590 | ||||
U+0238..0239, 023C | 3 | ||||||
N2740 | |||||||
U+023A..023B, 023D..023E | 4 | N2784R | |||||
U+023F..0240 | 2 | ||||||
N2799 | |||||||
U+0241 | 1 | ||||||
N2789 | |||||||
5.0 | U+0242 | 1 | |||||
N2962R | |||||||
N2942 | |||||||
doc) | |||||||
U+0243..024F | 13 | ||||||
N2906 | |||||||
doc) | |||||||