Phonetic symbols in Unicode explained

Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

Phonetic scripts

The International Phonetic Alphabet (IPA) makes use of letters from other writing systems as most phonetic scripts do. IPA notably uses Latin, Greek and Cyrillic characters. Combining diacritics also add meaning to the phonetic text. Finally, these phonetic alphabets make use of modifier letters, that are specially constructed for phonetic meaning. A "modifier letter" is strictly intended not as an independent grapheme but as a modification of the preceding character[1] resulting in a distinct grapheme, notably in the context of the International Phonetic Alphabet. For example, ʰ should not occur on its own but modifies the preceding or following symbol. Thus, pronounced as /tʰ/ is a single IPA symbol, distinct from pronounced as /t/. In practice, however, several of these "modifier letters" are also used as full graphemes, e.g. ʿ as transliterating Semitic ayin or Hawaiian ʻokina, or ˚ transliterating Abkhaz ә.

From to Unicode

See main article: International Phonetic Alphabet.

Consonants

The following tables indicates the Unicode code point sequences for phonemes as used in the International Phonetic Alphabet. A bold code point indicates that the Unicode chart provides an application note such as "voiced retroflex lateral" for . An entry in bold italics indicates the character name itself refers to a phoneme such as

BilabialLabiodentalDentalAlveolarPostalveolarRetroflexLabialized palatalPostalveolar-velar
Plosivepronounced as /link/
<code>U+0070
pronounced as /link/
<code>U+0062
pronounced as /link/
U+0070 '''U+032A'''
pronounced as /link/
U+0062 '''U+032A'''
pronounced as /link/
U+0074 '''U+032A'''
pronounced as /link/
U+0064 '''U+032A'''
pronounced as /link/
<code>U+0074
pronounced as /link/
<code>U+0064
pronounced as /link/
<code>'''U+0288'''
pronounced as /link/
<code>'''U+0256'''
Implosivepronounced as /ɓ̥/
'''U+0253''' '''U+0325'''
pronounced as /link/
<code>'''U+0253'''
pronounced as /ɗ̪/
'''U+0257''' '''U+032A'''
pronounced as /link/
<code>'''U+0257'''
pronounced as //
'''U+1D91'''
Ejectivepronounced as /link/
U+0070 '''U+02BC'''
pronounced as /link/
U+0074 '''U+032A''' '''U+02BC'''
pronounced as /link/
U+0074 '''U+02BC'''
pronounced as /link/
'''U+0288''' '''U+02BC'''
Nasalpronounced as /link/
U+006D '''U+0325'''
pronounced as /link/
<code>U+006D
pronounced as /ɱ̊/
'''U+0271''' U+030A
pronounced as /link/
<code>'''U+0271'''
pronounced as /n̪̊/
U+006E '''U+032A''' U+030A
pronounced as /link/
U+006E '''U+032A'''
pronounced as /link/
U+006E '''U+0325'''
pronounced as /link/
<code>U+006E
pronounced as /link/
'''U+0273''' U+030A
pronounced as /link/
<code>'''U+0273'''
Trillpronounced as /link/
<code>'''U+0299'''
pronounced as /link/
U+0072 '''U+0325'''
pronounced as /link/
<code>U+0072
Tap or Flappronounced as /link/
'''U+2C71''' U+031F
pronounced as /link/
<code>'''U+2C71'''
pronounced as /link/
<code>'''U+027E'''
pronounced as /link/
<code>'''U+027D'''
Lateral flappronounced as /link/
<code>U+027A

'''U+1DF08'''
Fricativepronounced as /link/
<code>'''U+0278'''
pronounced as /link/
<code>U+03B2
pronounced as /link/
<code>U+0066
pronounced as /link/
<code>U+0076
pronounced as /link/
<code>U+03B8
pronounced as /link/
<code>U+00F0
pronounced as /link/
<code>U+0073
pronounced as /link/
<code>U+007A
pronounced as /link/
<code>'''U+0283'''
pronounced as /link/
<code>'''U+0292'''
pronounced as /link/
<code>'''U+0282'''
pronounced as /link/
<code>'''U+0290'''
pronounced as /link/
<code>'''U+0267'''
Lateral fricativepronounced as /link/
<code>'''U+026C'''
pronounced as /link/
<code>'''U+026E'''
pronounced as /link/
<code>'''U+A78E'''
Ejective fricativepronounced as /link/
U+0073 '''U+02BC'''
pronounced as /ʃʼ/
'''U+0283''' '''U+02BC'''
Ejective lateral fricativepronounced as /link/
'''U+026C''' '''U+02BC'''
Percussivepronounced as /ʬ/
<code>'''''U+02AC'''''
pronounced as /ʭ/
<code>'''''U+02AD'''''
Approximantpronounced as /β̞̊ /
U+03B2 U+031E U+030A
pronounced as /link/
U+03B2 U+031E
pronounced as /ʋ̥/
'''U+028B''' U+0325
pronounced as /link/
<code>'''U+028B'''
pronounced as /link/
U+00F0 '''U+031E'''
pronounced as /ɹ̥/
'''U+0279''' U+0325
pronounced as /link/
<code>'''U+0279'''
pronounced as /ɻ̊/
'''U+027B''' U+030A
pronounced as /link/
<code>'''U+027B'''
pronounced as /ɥ̊/
'''U+0265''' U+030A
pronounced as /link/
<code>'''U+0265'''
Lateral approximantpronounced as /link/
U+006C U+0325
pronounced as /link/
<code>U+006C
pronounced as /link/
<code>'''U+026D'''
Click consonantpronounced as /link/
<code>'''''U+0298'''''
pronounced as /link/
<code>'''''U+01C0'''''
pronounced as /link/
<code>'''''U+01C3'''''
pronounced as /link/ / pronounced as /link/
<code>'''''U+01C3''''' / <code>'''''U+01C2'''''
pronounced as /link/
<code>'''''U+1DF0A'''''
Lateral clickpronounced as /link/
<code>'''''U+01C1'''''
Alveolo-palatalPalatalLabial-velarVelarUvularPharyngealEpiglottalGlottal
Plosivepronounced as /link/
<code>U+0236
pronounced as /link/
<code>U+0221
pronounced as /link/
<code>U+0063
pronounced as /link/
<code>'''U+025F'''
pronounced as /link/
U+006B U+0361 U+0070
pronounced as /link/
U+0261 U+0361 U+0062
pronounced as /link/
<code>U+006B
pronounced as /link/
<code>U+0261
pronounced as /link/
<code>U+0071
pronounced as /link/
<code>'''U+0262'''
pronounced as /link/
<code>'''''U+02A1'''''
pronounced as /link/
<code>'''''U+0294'''''
Implosivepronounced as /link/
<code>'''U+0284'''
pronounced as /link/
<code>U+0260
pronounced as /link/
<code>'''U+029B'''
Ejectivepronounced as /link/
U+0063 '''U+02BC'''
pronounced as /link/
U+006B '''U+02BC'''
pronounced as /link/
U+0071 '''U+02BC'''
Nasalpronounced as /link/
<code>U+0235
pronounced as /link/
<code>'''U+0272'''
pronounced as /link/
U+014B U+0361 U+006D
pronounced as /link/
<code>U+014B
pronounced as /link/
<code>'''U+0274'''
Trillpronounced as /link/
<code>'''U+0280'''
Tap or Flap
Lateral flap
Fricativepronounced as /link/
<code>'''U+0255'''
pronounced as /link/
<code>'''U+0291'''
pronounced as /link/
U+00E7
pronounced as /link/
<code>'''U+029D'''
pronounced as /link/
<code>U+0078
pronounced as /link/
<code>'''U+0263'''
pronounced as /link/
<code>U+03C7
pronounced as /link/
<code>U+0281
pronounced as /link/
<code>U+0127
pronounced as /link/
<code>'''U+0295'''
pronounced as /link/
<code>'''U+029C'''
pronounced as /link/
<code>U+02A2
pronounced as /link/
<code>U+0068
pronounced as /link/
<code>'''U+0266'''
Approximantpronounced as /link/
<code>U+006A
pronounced as /link/
<code>U+028D
pronounced as /link/
<code>U+0077
pronounced as /link/
<code>'''U+0270'''
Lateral approximantpronounced as /link/
<code>U+0234
pronounced as /link/
<code>'''U+028E'''
pronounced as /link/
<code>'''U+029F'''

Vowels

pronounced as /vowels/The following figures depict the phonetic vowels and their Unicode / UCS code points, arranged to represent the phonetic vowel trapezium. Vowels appearing in pairs in the figure to the right indicate rounded and unrounded variations respectively. Again, characters with Unicode names referring to phonemes are indicated by bold text. Those with explicit application notes are indicated by bold italic text. Those from borrowed unchanged from another script (Latin, Greek or Cyrillic) are indicated by italics. Before and after a bullet are the unrounded • rounded vowels.

Unicode code points for phonetic vowels
FrontCentralBack
Closepronounced as /link/ • pronounced as /link/
''U+0069''<br>•<br>''U+0079''
pronounced as /link/ • pronounced as /link/
U+0268<br>•<br>U+0289
pronounced as /link/ • pronounced as /link/
U+026F<br>•<br>''U+0075''
Near-closepronounced as /link/ • pronounced as /link/
U+026A<br>•<br>U+028F
pronounced as /link/ • pronounced as /link/
U+026A U+0308<br>•<br>U+028A U+0308
pronounced as / • ʊ/
<code>&nbsp;<br>•<br>U+028A
Close-midpronounced as /link/ • pronounced as /link/
U+0065<br>•<br>U+00F8
pronounced as /link/ • pronounced as /link/
U+0258<br>•<br>U+0275
pronounced as /link/ • pronounced as /link/
U+0264<br>•<br>U+006F
Midpronounced as /link/ • pronounced as /link/
U+0065 U+031E<br>•<br>U+00F8 U+031E
pronounced as /link/
<code>&nbsp;<br>U+0259<br>&nbsp;
pronounced as /link/ • pronounced as /link/
U+0264 U+031E<br>•<br>U+006F U+031E
Open-midpronounced as /link/ • pronounced as /link/
U+025B<br>•<br>U+0153
pronounced as /link/ • pronounced as /link/
U+025C<br>•<br>U+025E
pronounced as /link/ • pronounced as /link/
U+028C<br>•<br>U+0254
Near-openpronounced as /link/ •
U+00E6<br>•<br>&nbsp;
pronounced as /link/
&nbsp;<br>U+0250<br>&nbsp;
Openpronounced as /link/ • pronounced as /link/
U+0061<br>•<br>U+0276
pronounced as /link/ •
U+0061 U+0308<br>•<br>&nbsp;
pronounced as /link/ • pronounced as /link/
U+0251<br>•<br>U+0252

Diacritics

Diacritics may be encoded as either modifier (e.g. ˳) or combining (e.g.

◌̥) characters.
VoicelessBreathy VoicedDentalSyllabic
˳◌̥
U+02F3 • U+0325
◌̤
U+0324
◌͏̪
U+032A
ˌ◌̩
U+02CC • U+0329
VoicedCreaky VoicedApicalNon-syllabic
ˬ◌̬
U+02EC • U+032C
˷◌̰
U+02F7 • U+0330
˽◌̺
U+02FD • U+033A
◌͏̯
U+032F
AspiratedLinguolabialLaminalMore Rounded
ʰ
U+02B0
◌͏̼
U+033C
◌͏̻
U+033B
˒◌̹
U+02D2 • U+0339
LabializedNasalizedPalatalizedLess Rounded
ʷ
U+02B7
◌̃
U+0303
ʲ
U+02B2
˓◌̜
U+02D3 • U+031C
AdvancedNasal releaseCentralizedVelarized
˖◌̟
U+02D6 • U+031F

U+207F
¨◌̈
U+00A8{{ref label|centralized_tag|1}} • U+0308
ˠ
U+02E0
RetractedLateral releaseMid-CentralizedPharyngealized
ˍ◌̠
U+02CD • U+0320
ˡ
U+02E1
˟◌̽
U+02DF • U+033D
ˤ
U+02E4
Advanced Tongue RootNo audible releaseRaisedVelarized or Pharyngealized
◌̘
U+AB6A • U+0318
˺◌̚
U+02FA • U+031A
˔◌̝
U+02D4 • U+031D
◌̴
U+0334
Retracted Tongue RootRhoticityLoweredLengthened
◌̙
U+AB6B • U+0319
˞
U+02DE
˕◌̞
U+02D5 • U+031E
ː
U+02D0
Notes

1.The codepoint refers to diaeresis, which takes up space but is not a Spacing Modifier Letter.

Unicode blocks

Unicode blocks with many phonetic symbols

Six Unicode blocks contain many phonetic symbols:

IPA Extensions (U+0250–02AF)

See main article: IPA Extensions (Unicode block).

Spacing Modifier Letters (U+02B0–02FF)

The characters in the "Spacing Modifier Letters" block are intended as forming a unity with the preceding letter (which they "modify"). E.g. the character isn't intended simply as a superscript h (h), but as the mark of aspiration placed after the letter being aspirated, as in pronounced as /pʰ/ "aspirated voiceless bilabial plosive". The block contains:

Phonetic Extensions (U+1D00–1D7F)

This block, together with Phonetic Extensions Supplement below, contains:

Phonetic Extensions Supplement (U+1D80–1DBF)

Modifier Tone Letters (U+A700–A71F)

Superscripts and Subscripts (U+2070–209F)

Input by selection from a screen

Many systems provide a way to select Unicode characters visually. ISO/IEC 14755 refers to this as a screen-selection entry method.

Microsoft Windows has provided a Unicode version of the Character Map program (find it by hitting then type charmap then hit) since version NT 4.0 – appearing in the consumer edition since XP. This is limited to characters in the Basic Multilingual Plane (BMP). Characters are searchable by Unicode character name, and the table can be limited to a particular code block. More advanced third-party tools of the same type are also available (a notable freeware example is BabelMap).

macOS provides a "character palette" with much the same functionality, along with searching by related characters, glyph tables in a font, etc. It can be enabled in the input menu in the menu bar under System Preferences → International → Input Menu (or System Preferences → Language and Text → Input Sources) or can be viewed under Edit → Emoji & Symbols in many programs.

Equivalent tools – such as gucharmap (GNOME) or kcharselect (KDE) – exist on most Linux desktop environments.

See also

External links

pronounced as /navigation/

Notes and References

  1. Web site: Spacing modifier letters . Everything2.com . 2002-08-29 . 2016-01-23.