Historical Chinese phonology explained

pronounced as /notice/Historical Chinese phonology deals with reconstructing the sounds of Chinese from the past. As Chinese is written with logographic characters, not alphabetic or syllabary, the methods employed in Historical Chinese phonology differ considerably from those employed in, for example, Indo-European linguistics; reconstruction is more difficult because, unlike Indo-European languages, no phonetic spellings were used.

Chinese is documented over a long period of time, with the earliest oracle bone writings dated to c. 1250 BC. However, since the writing is mostly with logographic characters, which do not directly specify the phonology of the language, reconstruction is in general quite difficult, and depends to a large extent on ancillary sources that more directly document the language's phonology. On the basis of these sources, historical Chinese is divided into the following basic periods:

Old Chinese, broadly from about 1250 BC to 25 AD, when the Han dynasty came back to power after the Xin dynasty. More narrowly, reconstructed "Old Chinese" is based on the rhymes of early poetry such as the Shijing and the phonological components of Chinese characters, and is assumed to represent the language of c. 1000-700 BC. Proto-Min developed from Old Chinese.
Middle Chinese, broadly from about the 5th century AD (Northern and Southern dynasties, Sui, Tang, Song) through to 12th century AD. More narrowly, reconstructed "Middle Chinese" is usually based on the detailed phonetic evidence of the Qieyun rime dictionary (601 AD), later expanded into "Guangyun". The Qieyun describes a compromise between the northern and southern varieties and such rhyming dictionaries were essential to write and read aloud poetry with a rhyming pattern.
Modern varieties, from about the 13th century AD (beginning of the Yuan dynasty, in which Early Mandarin was developed) to the present. Most modern varieties appear to have split off from a Late Middle Chinese koine of about 1000 AD (although some remnants of earlier periods are still present, ex. stops without release at the end of the syllable in Hakka and Yue).

Overview

Middle Chinese had a structure much like many modern varieties, with largely monosyllabic words, little or no derivational morphology, four tone-classes (though three phonemic tones), and a syllable structure consisting of initial consonant, glide, main vowel and final consonant, with a large number of initial consonants and a fairly small number of final consonants. Not counting the glide, no clusters could occur at the beginning or end of a syllable.

Old Chinese, on the other hand, had a significantly different structure. Most scholars have concluded that there were no tones, a lesser imbalance between possible initial and final consonants, and a significant number of initial and final clusters. There was a well-developed system of derivational and possibly inflectional morphology, formed using consonants added onto the beginning or end of a syllable. This system is similar to the system reconstructed for Proto-Sino-Tibetan and still visible, for example, in the written Tibetan language; it is also largely similar to the system that occurs in the more conservative Mon–Khmer languages, such as modern Khmer (Cambodian).

The main changes leading to the modern varieties have been a reduction in the number of consonants and vowels and a corresponding increase in the number of tones (typically through a pan-East-Asiatic tone split that doubled the number of tones while eliminating the distinction between voiced and unvoiced consonants). This has led to a gradual decrease in the number of possible syllables. In Standard Mandarin, this has progressed to a farther extent than elsewhere, with only about 1,200 possible syllables. The result, in Mandarin especially, has been the proliferation of the number of two-syllable compound words, which have steadily replaced former monosyllabic words, to the extent that the majority of words in Standard Mandarin are now composed of two syllables.

Periodization of Chinese

The terms "Old Chinese" and "Middle Chinese" refer to long periods of time in and of themselves, during which significant changes occurred. Although there is no standard system for subdividing these periods, the following is an approximate chronology leading from the oldest writings in the oracle bone script up through modern Standard Mandarin:

Axel Schuessler uses the term Early Zhou Chinese to refer to the language from the earliest records down to the end of the Western Zhou period (c. 1250 to 771 BC).
W. A. C. H. Dobson uses the term Early Archaic Chinese to refer to the same period ("10th to 9th century BC"), although Schuessler suggests that the term should refer to a slightly later period.
Dobson uses the term Late Archaic Chinese to refer to the "4th to 3rd century BC"; that is, the period near the beginning of the Han dynasty.
Late Han Chinese (LHC) is c. 200 AD. This is around the time that Min Chinese varieties diverged from the others.
Old Northwest Chinese (ONWC), c. 400 AD, is a reconstruction by Weldon South Coblin of the language of the northwestern Chinese provinces of Gansu and Shaanxi that is immediately ancestral to a set of northwestern dialects documented by various early Tang dynasty authors.
Early Middle Chinese (EMC), c. 600 AD, is the language of the Qieyun rime dictionary, the first stage for which we have direct and detailed phonetic evidence. This evidence is not enough by itself to directly determine the sound system of the language, however, as it only subdivides characters into an initial consonant and non-initial portion, without further decomposing the latter into phonemes.
Late Middle Chinese (LMC), c. 1000 AD, is the native language of the authors of the Yunjing rime table and the oldest stage that can be reconstructed from modern non-Min varieties by the comparative method.
Early Mandarin, c. 1300 AD (sometimes specifically given as 1269–1455. The Yuan dynasty conquered the whole of China in 1279 and was overpowered by the Ming dynasty in 1368), was the language inferred in the 'Phags-pa script, the first alphabetic writing system for Chinese. It is also documented in the Zhōngyuán Yīnyùn (Chinese: 中原音韻 "Sounds and Rhymes of the Central Plains", an opera manual of 1324 AD written by Zhou Deqing).
Middle Mandarin, up through c. 1800 (sometimes specifically given as 1455–1795), documented in numerous Chinese, Korean and European sources. Among these are Chinese-Korean pedagogical texts such as Hongmu Chôngyun Yôkhun (1455) and Sasông T’onghae (1517); the Portuguese-Chinese dictionary (1583–1588) of the Christian missionaries Matteo Ricci and Michele Ruggieri; a dictionary by a friar, Basilio Brollo (1692-1694), also known as Basile da Glemona, a grammar of Mandarin by Francisco Varo (1703) and Chinese rhyme manuals such as the Yunlüe Huitong (1642).
Modern Standard Chinese, a standardized form of the dialect of Beijing that is little changed from the 19th century. It was promulgated during the Republic of China in 1932. Today, it is the standard variety in the PRC, founded by Mao Zedong in 1949.

Chinese native phonological traditions

A native tradition of Chinese phonology developed in the Middle Chinese period. Chinese linguists had long compiled dictionaries and attempted to identify the pronunciation of difficult characters by specifying homophone characters. During the early centuries AD, however, the more advanced method of fanqie was developed, which allowed the pronunciation of any syllable to be specified unambiguously by using one character to indicate the initial consonant and another to indicate the remainder. By the sixth century AD, systematic attempts were made to compile lists of all characters and specify their pronunciation by way of fanqie, culminating in rime dictionaries such as the Qieyun (601 AD).

During the next few centuries, the increasing influence of Buddhism and Buddhist scholars brought Chinese linguists in touch with the tradition of Sanskrit grammar, which included a highly advanced understanding of phonology and phonetics, including a system of analyzing sounds by distinctive features, such as place of articulation and type of phonation. This led to rime tables such as the Yunjing (c. 1150 AD), a sophisticated analysis of the sound system of the Qieyun.

During the Qing dynasty (1644–1912), scholars such as Duan Yucai diligently studied the sound system of Middle and Old Chinese. Through careful examination of rime tables, rime dictionaries and patterns of rhyming among poets of various eras, these scholars were able to work out the system of categories of rhymes in Old Chinese, and discover additional Middle Chinese categories that had previously been overlooked. However, progress in Chinese linguistics was seriously hampered by the lack of any concept of a phoneme - i.e. a basic unit of sound, including vowels and vowel-like segments as well as consonants. This made it impossible to go beyond determination of systems of rhyming categories to reconstruction of the actual sounds involved.

In some ways, the lack of native Chinese development of the concept of a unit of sound is surprising, as it had already been developed by Sanskrit grammarians such as Pāṇini by the 4th century BC at the latest, and the phonological analysis of the Yunjing shows a close familiarity with the tradition of Sanskrit grammar. Furthermore, some non-Chinese writing systems within the Chinese cultural orbit, such as the Korean script and especially the Tibetan script, were developed under the close influence of Indian writing systems and have the concept of phoneme directly embedded in them. (Furthermore, the 'Phags-pa script, an alphabetic script of Tibetan origin, had been used to write Chinese itself during the Mongolian Yuan dynasty, c. 1270–1360, although it later fell out of use.) It seems likely, however, that

The strong influence and long tradition of Chinese writing, which included no concept of an alphabet and always treated the rhyming part of a syllable as a single unit, made it difficult to independently develop the concept of a unit of sound.
Lack of knowledge of Sanskrit by most Chinese scholars precluded direct reading of the original works on Sanskrit grammar.
Cultural attitudes that treated Koreans, Tibetans, Mongolians and most other foreigners as "barbarians" made it difficult for scientific knowledge from these cultures to diffuse into China.

Modern methods of reconstruction

As a result, the first reconstructions of the actual sound systems of Old and Middle Chinese were only done in the early 20th century, by Swedish sinologist Bernhard Karlgren. Armed with his knowledge of Western historical linguistics, he performed field work in China between 1910 and 1912, creating a list of 3,100 Chinese characters and collecting phonological data on the pronunciation of these characters in 19 Mandarin dialects as well as the dialects of Shanghai (Wu), Fuzhou (Eastern Min), and Guangdong (Cantonese). He combined this with the Sino-Japanese and Sino-Vietnamese pronunciations as well as previously published material on nine other dialects, along with the fanqie analysis of the Guangyun rime dictionary (a later version of the Qieyun of 601 AD). In 1915, he published his reconstruction of Middle Chinese, which underlies in one form or another all subsequent reconstructions. Walter Simon and Henri Maspero also made great contributions in the field during the early days of its development. Karlgren himself had no direct access to the Qieyun, which was thought lost; however, fragments of the Qieyun were discovered in the Dunhuang Caves in the 1930s, and a nearly complete copy was discovered in 1947 in the Palace Museum.

Karlgren was also instrumental in early reconstruction of Old Chinese. His early work on Middle Chinese made various suggestions about Old Chinese, and a detailed reconstruction appeared in Grammata Serica (1940), a dictionary of Middle and Old Chinese reconstructions. An expanded version, Grammata Serica Recensa, was published in 1957 and is still a commonly cited source.

The reconstruction of Middle Chinese draws its data (in approximate order of importance) from:

Rime dictionaries and rime tables of the Middle Chinese era, such as the Qieyun (601 AD) and Yunjing (c. 1150 AD).
Modern Chinese speaking variants such as Yue, Hakka, Mandarin, Min, Wu etc.
Sino-Xenic data - Chinese loanwords borrowed in large numbers into Vietnamese, Japanese and Korean, especially during the period of 500–1000 AD. This large-scale borrowing led to the so-called Sino-Vietnamese, Sino-Japanese and Sino-Korean vocabularies of these languages.
Other early cases of Chinese words borrowed into foreign languages or transcribed in foreign sources, e.g. Sanskrit works in India.
Early cases of transliteration of foreign words from other languages such as Sanskrit and Tibetan into Chinese.
Chinese written in the 'Phags-pa script, a brief period (c. 1270–1360) when Chinese was written in an alphabetic script.
Transcriptions of Chinese by foreigners, especially the Hangul transcriptions of Koreans such as Sin Sukchu (15th century) and works by various Christian missionaries and other Westerners, the oldest of which is Matteo Ricci's Portuguese–Chinese dictionary of 1583–1588. (Although these transcriptions, as well as the 'Phags-pa evidence, are significant in providing extensive documentation of earlier forms of Mandarin Chinese, their importance for reconstructing Middle Chinese pales in comparison with the much greater breadth provided by the pronunciation of Chinese variants and Sino-Xenic languages, despite their later attestation.)

Karlgren suggested that the Middle Chinese documented in the Qieyun was a live language of the Sui-Tang period. Today, with direct access to the Qieyun, this notion has been replaced by the view that the sound system in the Qieyun represents (or proposes) a literate reading adopted by the literate class of the period throughout the country, not any live language that once existed. For example, in some cases a former three-way distinction A, B, C among initials or finals gave way to a situation where one dialect group of the Qieyun period merged AB vs. C, while another group merged A vs. BC. In these cases, the Qieyun specifies A, B, C as all distinct, even though no living dialect of the time period made such a three-way distinction, and any earlier dialect that did make this distinction would have differed in other ways from the Qieyun system.

The reconstruction of Old Chinese is more controversial than that of Middle Chinese since it has to be extrapolated from the Middle Chinese data. Phonological information concerning Old Chinese is chiefly gained from:

The rhymed texts written before the Qin dynasty, chiefly the Shijing, the earliest anthology of Chinese poetry.
The fact that characters sharing the same phonetic component had very similar pronunciations when the characters were first created.
Comparison of Old Chinese with other Sino-Tibetan dialects.

Old Chinese phonology

See main article: Old Chinese phonology.

There are disagreements over exactly what the Old Chinese (OC) syllable looked like. The following is an approximate consensus, based on the system of William Baxter and (earlier) Li Fang-Kuei:

A syllable consisted of an initial consonant, an optional medial glide pronounced as //r// (but no pronounced as //j// or pronounced as //w//), a main vowel, an optional final (coda) consonant, and an optional post-final consonant pronounced as //s// or pronounced as //ʔ//. There were also various pre-initial consonants. (In recent systems, Baxter also constructs a distinction between "tightly bound" pre-initials pronounced as //C-// and "loosely bound" pre-initials pronounced as //Cə//. The tightly bound pre-initials interact in complex ways with the initial to produce EMC initials, but the loosely bound pre-initials mostly just disappear. Their presence, however, is revealed by the use of a "phonetic complement" with the corresponding tightly bound pre-initial in their character, and sometimes by early borrowings into other languages, especially Hmong–Mien languages and Tai languages.) Pre-initial and post-final consonants were frequently used in morphological derivation.
There was no MC-style tone, but there was a distinction of some sort between type-A and type-B syllables. Depending on the linguist, the distinction is variously thought to reflect either presence or absence of prefixes, an accentual or length distinction on the main vowel, or some sort of register distinction (e.g. pharyngealization of the initial consonant in type-A syllables). These different reconstructions may not be mutually exclusive (e.g. an earlier prefix distinction may have developed into a later register distinction).
Compared with EMC, there were no palatal or retroflex consonants, but there were labiovelar consonants (e.g. pronounced as //kʷ//). Baxter also reconstructs voiceless resonants, e.g. pronounced as //m̥//, pronounced as //n̥//.
There were on the order of six main vowels: pronounced as //a//, pronounced as //i//, pronounced as //e//, pronounced as //o//, pronounced as //u//, pronounced as //ə// (or pronounced as //ɨ//).
The system of final (coda) consonants was similar to EMC; however, there was no pronounced as //wŋ//. Baxter also reconstructs final pronounced as //r//, later becoming pronounced as //n//.

From Old Chinese to Early Middle Chinese

Initial consonants

Palatalisation of velar initials

William H. Baxter pointed out that some of the words that were reconstructed with Middle Chinese palatal initials were perhaps words that had velar initials in Old Chinese. For example, Baxter indicated that Chinese: 熱, reconstructed as nyet in Middle Chinese, was perhaps pronounced as ngjet in Old Chinese. He noted the distinction between Chinese: 支 (tsye), which had a MC palatal initial, and Chinese: 技, which had a velar initial in MC (gjeX, III), despite their common phonetic component. Some modern Min and Hakka varieties nonetheless retain velar initials in both.^[1] The two are reconstructed by Baxter to be kje and grjeʔ respectively in Old Chinese. Baxter posited that the medial -rj- cluster had blocked palatalisation.

Lenition of OC /ɡ/

The voiced velar plosive, *pronounced as /ɡ/, was lenited to a voiced fricative (*pronounced as /ɣ~ɦ/) during this development from Old Chinese to Middle Chinese. According to Laurent Sagart, this change is reflected when the Old Chinese plosive occurred in type A syllables. This is supported by Baxter's observation that modern Min dialects show a difference in pronunciation between Chinese: 厚 and Chinese: 後, both of which belong to the Middle Chinese Chinese: 匣 initial (reconstructed as *pronounced as /ɣ~ɦ/). For the first character, the Fuzhou dialect, Amoy Hokkien and the Teochew dialect, show pronunciations with a velar plosive (*pronounced as /k-/). This pronunciation is consistent with an earlier pronunciation with a velar plosive, most likely the Old Chinese *pronounced as /ɡ/. The second character is pronounced by these dialects with a null initial, possibly reflecting a voiced laryngeal in Old Chinese. The OC *pronounced as /ɡ/ remained intact in Type B syllables, which correspond to Division III Middle Chinese words.

Retroflex initials

The loss of the reconstructed OC medial "r", or the r-infix in Sagart's reconstruction, had not only influenced vowel quality in Middle Chinese but had also caused the retroflexion of coronal consonants.

Sonorant fortition

Both Baxter and Sagart have pointed out that Old Chinese had a series of voiceless sonorants, which typically do not occur in most modern varieties. These voiceless sonorants are pronounced as //m̥//, pronounced as //n̥//, pronounced as //ŋ̊//, pronounced as //ŋ̊ʷ//, pronounced as //l̥//, and possibly pronounced as //r̥//. Their reflexes in Middle Chinese are postulated to be:

**Old Chinese voiceless sonorants and corresponding Middle Chinese reflexes! sonorant !! reflex**
	,
	,,


	,,
	,,

Additionally, the OC lateral consonant is shown to have fortified to a coronal plosive in Type A syllables. Meanwhile, it developed into in Type B syllables, which palatalised in Middle Chinese. Sagart pointed out, however, that these changes are not true if the lateral is preceded by a reconstructed prefix. This position, whereby OC underwent fortition to become a plosive, is upheld by Baxter. Baxter pointed out xiesheng contacts between plosive series, sibilants and MC, and made the following reconstructions:

Chinese: 脫	<	<		'take off clothes'
Chinese: 兌	<	<	<	'glad'
Chinese: 說	<	<		'speak', 'explain'
Chinese: 悅	<	<		'pleased', 'glad'

Note that these reconstructions included voiceless sonorants, of which the developments have been consistent with their fortition and reflexes.

According to Sagart, this fortition was impeded if the lateral was preceded by an unspecified consonant or minor syllable. He reconstructs Chinese: 林 as, yielding MC . Furthermore, MC was said to have derived from an OC .

Medials and finals

The following are the main developments that produced Early Middle Chinese (EMC) from Old Chinese (OC):

Type-B syllables developed a pronounced as //j// glide. This glide combined with a previous coronal consonant to produce new palatal consonants. It also sometimes turned a preceding velar or laryngeal into a palatal sibilant and/or raised a following main vowel. (Contrarily, type-A syllables often lowered a following main vowel, with a high vowel diphthongizing, e.g. pronounced as //i// becoming pronounced as //ej//.)
The pronounced as //r// glide eventually disappeared, but before doing this it turned a previous coronal consonant into a retroflex consonant, and fronted (and often centralized) a following main vowel.
Changes to final consonants: pronounced as //r// became pronounced as //n//; pronounced as //j// dropped after pronounced as //a//; pronounced as //k// dropped before pronounced as //s//; pronounced as //t// before pronounced as //s// became pronounced as //j// (which remained, even after pronounced as //a//).
Tones developed from the former suffixes (post-final consonants), with MC tone 3 ("departing", reconstructed as having falling pitch) developing from pronounced as //s//, tone 2 ("rising", reconstructed as having rising pitch) developing from pronounced as //ʔ//, and tone 1 ("level", reconstructed as having level pitch) from other syllables. As the suffixes were part of the derivational morphology of OC, this often produced MC tonal variation, either in a single word or in semantically related words.
Back vowels pronounced as //o// and pronounced as //u//, when followed by a coronal consonant (pronounced as //j//, pronounced as //n//, pronounced as //r//, pronounced as //t//), broke into pronounced as //w// plus a front vowel.
Labiovelar initials were reanalyzed as a velar followed a pronounced as //w// glide, which merged with pronounced as //w// from breaking of pronounced as //o// and pronounced as //u//.
The main vowel developed in various complicated ways, depending on the surrounding sounds. For example, in type-A syllables, according to Baxter's reconstruction, OC pronounced as //ɨj// became pronounced as //-ɛj// after pronounced as //r//; otherwise, pronounced as //-ej// after coronal initials, pronounced as //-oj// after velar initials, and pronounced as //-woj// after labial initials. In type-B syllables, however, OC pronounced as //ɨj// became pronounced as //-ij// after pronounced as //r// or coronal initial, but pronounced as //-jɨj// otherwise.

Note that OC type-B syllables correspond closely to division-III, and (in Baxter's reconstruction at least) to EMC syllables with pronounced as //i// or pronounced as //j//.

From Early Middle Chinese to Late Middle Chinese

To a large degree, Late Middle Chinese (LMC) of c. 1000 AD can be viewed as the direct ancestor of all Chinese varieties except Min Chinese; in other words, attempting to reconstruct the parent language of all varieties excluding Min leads no farther back than LMC. See below for more information. Exactly which changes occurred between EMC and LMC depends on whose system of EMC and/or LMC reconstruction is used. In the following, Baxter's EMC reconstruction is compared to Pulleyblank's LMC reconstruction. To the extent that these two systems reflect reality, they may be significantly farther apart than the 400 or so years normally given between EMC and LMC, since Baxter's EMC system was designed to harmonize with Old Chinese while Pulleyblank's LMC system was designed to harmonize with later Mandarin developments. Furthermore, Baxter considers all the distinctions of the Qieyun to be real, while many of them are clearly anachronisms that no longer applied to any living form of the language in 600 AD. Finally, some of the resulting "changes" may not be actual changes at all so much as conceptual differences in the way the systems have been reconstructed; these are noted below.

Changes mostly involve initials, medials, and main vowels.

The class of EMC palatals is lost, with palatal sibilants becoming retroflex sibilants and the palatal nasal becoming a new phoneme pronounced as //ɻ//.
A new class of labiodentals emerges, from EMC labials followed by pronounced as //j// and an EMC back vowel.
EMC complex medial pronounced as //jw// becomes pronounced as //y// pronounced as /[ɥ]/, producing a six-way medial distinction between none, pronounced as //i//, pronounced as //ji//, pronounced as //w//, pronounced as //y//, pronounced as //jy//. The phonemic glides pronounced as //i// and pronounced as //y// are vocalic pronounced as /[i]/ and pronounced as /[y]/ before short vowels pronounced as //a// and pronounced as //ə//, but semivocalic pronounced as /[j]/ and pronounced as /[ɥ]/ before long vowel pronounced as //aː//.
The eight-way EMC distinction in main vowels is significantly modified, developing into a system with high vowels pronounced as //i//, pronounced as //y//, pronounced as //u// and (marginally) pronounced as //ɨ//, and non-high vowels pronounced as //a//, pronounced as //ə//, pronounced as //aː//. However, this is best analyzed as a system with a four-way main vowel distinction between no vowel and the three phonemic vowels pronounced as //a//, pronounced as //ə//, pronounced as //aː//; high vowels are then analyzed as phonemically consisting of a medial and no main vowel (pronounced as //ɨ// is phonemically a syllable containing only a bare consonant, with no medial and no main vowel).
High front medials/main vowels pronounced as //i// and pronounced as //y// are lost after EMC retroflex sibilants, prior to merging with palatals; contrarily, a pronounced as //j// sometimes appears after guttural consonants.

Few changes to final consonants occur; the main ones are the loss of pronounced as //j// after a high vowel, the disappearance of pronounced as //ɨ// (which might or might not be reckoned as a final consonant) in the rhyme pronounced as //-ɛɨ//, and (potentially) the appearance of pronounced as //jŋ// and pronounced as //jk// (which are suspect in various ways; see below).

The tones do not change phonemically. However, allophonically they evidently split into a higher-pitched allophone in syllables with voiceless initials, and a lower-pitched allophone in syllables with voiced initials. All modern Chinese varieties reflect such a split, which produces a new set of phonemic tones in most varieties due to later loss of voicing distinctions.

The following changes are in approximate order.

Labiodentalization

Early Middle Chinese (EMC) labials (pronounced as //p, pʰ, b, m//) become Late Middle Chinese (LMC) labiodentals (pronounced as //f, f, bv, ʋ//, possibly from earlier affricates) in certain circumstances involving a following glide. When this happens, the glide disappears. Using Baxter's reconstruction, the triggering circumstances can be expressed simply as whenever a labial is followed by a glide pronounced as //j// and the main vowel is a back vowel; other reconstructions word the rule differently. According to Baxter, however, labiodentalization might have occurred independently of each other in different areas. For example, some variants retain OC /m/ before the glide, while in other variants, it had developed into a labiodental initial (Chinese: 微): compare Cantonese Chinese: 文 man⁴ and Mandarin Chinese: 文 wén.

Vowel changes and mergers

In approximate order:

a. Some early changes

pronounced as //-e-// (without medial pronounced as //j//) becomes pronounced as //-jie-// after labial, velar and glottal consonants, pronounced as //-ie-// elsewhere. (Along with certain later changes, this means that, synchronically, an LMC speaker cannot distinguish original Division IV syllables from original III-4 chongniu syllables; likewise for Division III vs. III-3 chongniu. This explains why these chongniu pairs end up in grades 3 and 4, respectively.) A change of this sort occurs in all modern reconstructions of EMC and LMC, and is responsible for the creation of Division IV.
pronounced as //ŋ// and pronounced as //k// after a non-high front vowel (pronounced as //æ//, pronounced as //ɛ//, pronounced as //e//) become pronounced as //jŋ// and pronounced as //jk// (often viewed as palatalized final consonants). This may not be a real change; Pulleyblank's EMC system already includes pronounced as //jŋ// and pronounced as //jk//, whereas they are not present in any modern dialect. The most direct evidence for these sounds comes from Sino-Vietnamese vocabulary, which was borrowed from LMC and does reflect the sounds. In Pulleyblank's LMC system, pronounced as //jŋ// only occurs in the rhymes pronounced as //-aːjŋ// and pronounced as //-iajŋ//, which contrast with pronounced as //-aŋ// and pronounced as //-iaŋ//; likewise for pronounced as //-jk//. (pronounced as //-a-// and pronounced as //-aː-// do not contrast before velar finals, except possibly after EMC retroflex sibilants.) These contrasts would be reflected in some other way in a system without pronounced as //-jŋ// and pronounced as //-jk//.
The sequence pronounced as //-ɛɨ// merges into pronounced as //-ɛj//. In some dialects, however, it instead remains separate (perhaps in the form pronounced as //-ɛ//, not otherwise occurring in Baxter's system). According to Abraham Y.S. Chan, the former change was characteristic of the Luoyang dialect, while the latter reflected the Jinling dialect. The distinction between the two surfaces in Standard Mandarin as a respective distinction between ai and ya, or pai/pei and pa.

b. Mergers of high-front finals

EMC finals pronounced as //je//, pronounced as //ij//, pronounced as //i//, pronounced as //jɨj// merge into pronounced as //i//; likewise pronounced as //jɨ-// (which occurs before pronounced as //n// and pronounced as //t//) becomes pronounced as //i-//.
The hekou equivalents of the above (with additional medial pronounced as //w//) become pronounced as //yj//.
The III-4 chongniu equivalents of both of the above become pronounced as //ji// and pronounced as //jyj//, respectively.

c. Changes involving high back vowels, mostly generating pronounced as //ə//

EMC final pronounced as //u// becomes pronounced as //uə//; likewise, pronounced as //ju// becomes pronounced as //juə//. (Possibly not a real change.)
The EMC sequence pronounced as //-uw-// becomes pronounced as //-əw-//, and pronounced as //-jow-// becomes pronounced as //-yw-//.
All remaining pronounced as //-o-//, except in the sequences pronounced as //-oj// and pronounced as //-om//, become pronounced as //ə//.
All non-final pronounced as //-jə-// become pronounced as //-i-//, and pronounced as //-wə-// become pronounced as //-u-//.

d. Changes involving non-high vowels

When there is no medial pronounced as //j//, remaining pronounced as //-æ-// and pronounced as //-ɛ-// become pronounced as //-aː-//.
All remaining pronounced as //-æ-//, pronounced as //-e-//, pronounced as //-ɛ-//, pronounced as //-o-// become pronounced as //-a-//.

e. Changes involving medials

Non-final pronounced as //-jwi-// and pronounced as //-wji-// become pronounced as //-jy-//, while pronounced as //-jw-, wj-, -wi-, -ju-// become pronounced as //-y-//.
The medials pronounced as //-i-// and pronounced as //-j-// merge into a single phoneme, with pronounced as //-i-// occurring before pronounced as //-a-// and pronounced as //-ə-//, and pronounced as //-j-// elsewhere (before pronounced as //-aː-// and high vowels). Medial pronounced as //-u-// and pronounced as //-w-// merge in the same way.

f. Medial and similar changes triggered by specific initials

pronounced as //j// appears between a guttural consonant (velar or laryngeal) and a directly following pronounced as //-aː-//. This sets the stage for palatalized syllables in Standard Mandarin such as jia and xian.
A final pronounced as //i// directly following a dental sibilant becomes pronounced as //ɨ// (presumably pronounced as /[z̩]/).
After an EMC retroflex sibilant, a directly following high-front vowel or glide (pronounced as //i// or pronounced as //j//, along with front-rounded pronounced as //y// or pronounced as //ɥ//, if reconstructed) is removed, specifically:
1. A glide pronounced as //j// is lost.
2. A main vowel pronounced as //i// becomes pronounced as //ə// if non-final.
3. A main vowel pronounced as //i// becomes pronounced as //ɨ// (presumably pronounced as /[ʐ̩]/) if final.
4. If high front-rounded vocalics are reconstructed, they unround (pronounced as //y// -> pronounced as //u//, pronounced as //ɥ// -> pronounced as //j//).

Late changes to initial consonants

EMC palatals become retroflex, with palatal sibilants merging with retroflex sibilants and palatal pronounced as //ɲ// becoming a new phoneme pronounced as //ɻ// (still reflected as such in Standard Mandarin).
Voiced consonants are thought to have become breathy voiced. This is a non-phonemic change; it is postulated to account for the breathy-voiced consonants still present in Wu Chinese, and the common outcome elsewhere of originally voiced consonants as unvoiced aspirates. Karlgren reconstructed breathy voicing for EMC as well, but this is no longer thought to be the case due to lack of any evidence for it in borrowings to or from EMC (especially involving Sanskrit and Middle Indic languages, which had a distinction between normally voiced and breathy-voiced consonants). This may have occurred to differing extents in different places:
- Among the two Chinese varieties that have not merged voiced and unvoiced consonants, Wu reflects the EMC voiced consonants as breathy voiced, but Old Xiang reflects them as normally voiced.
- Gan and Hakka reflect all EMC voiced consonants as unvoiced aspirates, but many others (e.g. Mandarin) only have such aspirated consonants in syllables within the yang ping tone, the light-level tone. Similarly, many words in Hokkien Min that are read with the yang ping tone are unaspirated, developing from Old Chinese and Middle Chinese voiced initials. Some words have aspirated readings, however, which led some linguists to believe that there may be a voiced aspirated series in the Proto-Min language, which had branched off from Old Chinese before it developed into Middle Chinese.

From Late Middle Chinese to Standard Mandarin

Initials

Voiced stops and sibilants were devoiced; the stops became aspirated in syllables with tone 1 and unaspirated otherwise.
Retroflex stops merged into retroflex affricates.
Sonorants: retroflex nasal merged into alveolar nasal; pronounced as //ɻ//, formerly palatal nasal in EMC, became pronounced as //ɻ~ʐ// or sometimes the syllable er; velar nasal was dropped.
Before high front vowel or glide, velars ("back-tooth" stops and "throat" fricatives) and alveolar sibilants palatalized and merged as a new series of alveolo-palatal sibilants.
The glottal stop pronounced as //ʔ// was dropped; before a high front glide, the voiced velar fricative pronounced as //ɣʰ// was dropped.
Labiodentals: pronounced as //v//, devoiced, merged into pronounced as //f//; pronounced as //ʋ// became pronounced as //w//. (LMC labiodentals resulted from EMC labials preceding pronounced as //j// + back vowel.)

The following table illustrates the evolution of initials from Early Middle Chinese, through Late Middle Chinese, to Standard Mandarin.

Mapping of EMC, LMC, and Standard Mandarin initials
EMC	LMC	Old Mandarin	Modern Standard Mandarin
pronounced as //m//	pronounced as //m//	pronounced as //m//	pronounced as //m//
pronounced as //m//	pronounced as //ɱ// before pronounced as //j// followed by back vowel	pronounced as //v//	pronounced as //w//
pronounced as //n//	pronounced as //n//	pronounced as //n//	pronounced as //n//
pronounced as //ɳ//	pronounced as //ɳ//	pronounced as //n//	pronounced as //n//
pronounced as //ɲ//	pronounced as //ɲ//	pronounced as //ʒ//	pronounced as //ʐ~ɻ// or syllable pronounced as /[əɻ]/
pronounced as //ŋ//	pronounced as //ŋ//	dropped or pronounced as //n//	dropped or pronounced as //n//
pronounced as //l//	pronounced as //l//	pronounced as //l//	pronounced as //l//
pronounced as //p//, pronounced as //pʰ//, pronounced as //b//	pronounced as //p//, pronounced as //pʰ//, pronounced as //bʰ//	pronounced as //p//, pronounced as //pʰ//	pronounced as //p//, pronounced as //pʰ//
	pronounced as //f//, pronounced as //v// before pronounced as //j// followed by back vowel	pronounced as //f//	pronounced as //f//
pronounced as //t//, pronounced as //tʰ//, pronounced as //d//	pronounced as //t//, pronounced as //tʰ//, pronounced as //dʰ//	pronounced as //t//, pronounced as //tʰ//	pronounced as //t//, pronounced as //tʰ//
pronounced as //ʈ//, pronounced as //ʈʰ//, pronounced as //ɖ//	pronounced as //ʈ//, pronounced as //ʈʰ//, pronounced as //ɖʰ//	pronounced as //ʈʂ//, pronounced as //ʈʂʰ//	pronounced as //ʈʂ//, pronounced as //ʈʂʰ//
pronounced as //tʃ//, pronounced as //tʃʰ//, pronounced as //dʒ//	pronounced as //ʈʂ//, pronounced as //ʈʂʰ//, pronounced as //ɖʐʰ//
pronounced as //tɕ//, pronounced as //tɕʰ//, pronounced as //dʑ//	pronounced as //tɕ//, pronounced as //tɕʰ//, pronounced as //dʑ//
pronounced as //ts//, pronounced as //tsʰ//, pronounced as //dz//	pronounced as //ts//, pronounced as //tsʰ//, pronounced as //dzʰ//	pronounced as //ts//, pronounced as //tsʰ//	pronounced as //ts//, pronounced as //tsʰ//
		pronounced as //ts//, pronounced as //tsʰ//	--rowspan-->	pronounced as //tɕ//, pronounced as //tɕʰ// before pronounced as //i//, pronounced as //y//, or pronounced as //j//
pronounced as //k//, pronounced as //kʰ//, pronounced as //ɡ//	pronounced as //k//, pronounced as //kʰ//, pronounced as //ɡʰ//	pronounced as //tɕ//, pronounced as //tɕʰ// before pronounced as //i//, pronounced as //y//, or pronounced as //j//	pronounced as //tɕ//, pronounced as //tɕʰ//
		--rowspan-->	pronounced as //k//, pronounced as //kʰ//	pronounced as //k//, pronounced as //kʰ//
pronounced as //ʃ//, pronounced as //ʒ//	pronounced as //ʂ//, pronounced as //ʐ//	pronounced as //ʂ//	pronounced as //ʂ//
pronounced as //ɕ//, pronounced as //ʑ//	pronounced as //ɕ//, pronounced as //ʑ//	pronounced as //ʂ//	pronounced as //ʂ//
pronounced as //s//, pronounced as //z//	pronounced as //s//, pronounced as //zʰ//	pronounced as //s//	pronounced as //s//
pronounced as //s//, pronounced as //z//	pronounced as //s//, pronounced as //zʰ//	pronounced as //s//	--rowspan-->	pronounced as //ɕ// before pronounced as //i//, pronounced as //y//, or pronounced as //j//
pronounced as //x//, pronounced as //ɣ//	pronounced as //x//, pronounced as //ɣʰ// or dropped before pronounced as //j//	pronounced as //ɕ// before pronounced as //i//, pronounced as //y//, or pronounced as //j//	pronounced as //ɕ//
pronounced as //x//, pronounced as //ɣ//		--rowspan-->	pronounced as //x//	pronounced as //x//
pronounced as //ʔ//	pronounced as //ʔ//	dropped	dropped

Finals

In general, Mandarin preserves the LMC system of medials and main vowels fairly well (better than most other varieties) but drastically reduces the system of codas (final consonants). The systematic changes to medials and main vowels are loss of the chongniu distinctions i/ji and y/jy (which occur in all modern varieties) and loss of the distinction between pronounced as //a// and pronounced as //aː//. All final stop consonants are lost, and final nasals are reduced to a distinction between pronounced as //n// and pronounced as //ŋ//.

The exact changes involving finals are somewhat complex and not always predictable, in that in many circumstances there are multiple possible outcomes. The following is a basic summary; more information can be found in the table of EMC finals in Middle Chinese.

Changes to medials:

LMC medial classes pronounced as //ji// and pronounced as //i// merge, losing the pronounced as //j//; likewise for pronounced as //jy// and pronounced as //y//.
LMC front medials pronounced as //i// and pronounced as //y// (and corresponding main vowels) are lost after retroflex consonants. The operation of this change is exactly as for the similar change that occurred after EMC retroflexes. The difference between EMC retroflex and palatal sibilants is sometimes reflected in Mandarin, for example between she (EMC retroflex) and shi (EMC palatal).
LMC medial pronounced as //u// is lost after labials, and pronounced as //y// unrounds to pronounced as //i//.
LMC medial pronounced as //u// is sometimes lost after pronounced as //l// and pronounced as //n//.
Various other changes occur after particular initials.

Changes to main vowels:

Long vowel pronounced as //aː// shortens.
Various other complex changes; see Middle Chinese.

Changes to codas:

LMC coda pronounced as //m// becomes pronounced as //n//.
LMC stop codas pronounced as //p,t,k// are dropped, with pronounced as //k// sometimes becoming pronounced as //j// and pronounced as //w//.
LMC complex codas pronounced as //wŋ// and pronounced as //wk// become simple codas; likewise for pronounced as //jŋ// and pronounced as //jk//, but often with effects on the preceding vowel.

Tones

A tone split occurs as a result of the loss of the voicing distinction in initial consonants. The split tones then merge back together except for Middle Chinese tone 1; hence Middle Chinese tones 1,2,3 become Mandarin tones 1,2,3,4. (Some syllables with original Mandarin tone 3 move to tone 4; see below.) Syllables with a final stop consonant, originally toneless, get assigned to one of the four modern tones; for syllables with Middle Chinese unvoiced initials, this happens in a completely random fashion.

The specific relationship between Middle Chinese and modern tones:

V− = unvoiced initial consonant (Chinese: 全清 or Chinese: 次清)

L = sonorant initial consonant (Chinese: 次濁)

V+ = voiced initial consonant (not sonorant) (Chinese: 全濁)

Middle Chinese	Tone	Ping (Chinese: 平)			Shang (Chinese: 上)			Qu (Chinese: 去)			Ru (Chinese: 入)
Middle Chinese	Initial	V−	L	V+	V−	L	V+	V−	L	V+	V−	L	V+
Standard Chinese	Tone name	Yin Ping (Chinese: 陰平, 1)	Yang Ping (Chinese: 陽平, 2)		Shang (Chinese: 上, 3)		Qu (Chinese: 去, 4)				redistributed with no pattern	to Qu	to Yang Ping
Standard Chinese	Tone contour	55	35		214		51				redistributed with no pattern	to 51	to 35

Branching off of the modern varieties

Most modern Chinese varieties can be viewed as descendants of Late Middle Chinese (LMC) of c. 1000 AD. For example, all modern varieties other than Min Chinese have labiodental fricatives (e.g. pronounced as //f//), a change that occurred after Early Middle Chinese (EMC) of c. 600 AD. In fact, some post-LMC changes are reflected in all modern varieties, such as the loss of the chongniu distinction (between e.g. pronounced as //pian// and pronounced as //pjian//, using Edwin Pulleyblank's transcription). Other changes occurring in most modern varieties, such as the loss of initial voiced obstruents and corresponding tone split, are areal changes that spread across existing dialects; possibly the loss of chongniu distinctions can be viewed in the same way.

Min Chinese, on the other hand, is known to have branched off even before Early Middle Chinese (EMC) of c. 600 AD. Not only does it not reflect the development of labiodental fricatives or other LMC-specific changes, but a number of features already present in EMC appear never developed. An example is the series of retroflex stops in EMC, which developed from earlier alveolar stops followed by pronounced as //r//, and which later merged with retroflex sibilants. In Min, the corresponding words still have alveolar stops. This difference can be seen in the words for "tea" borrowed into various other languages: For example, Spanish te, English tea vs. Portuguese chá, English chai, reflecting the Amoy (Southern Min) pronounced as /[te]/ vs. Standard Mandarin pronounced as /[ʈʂʰa]/.

In the case of distinctions that appear to have never developed in Min, it could be argued that the ancestral language did in fact have these distinctions, but they later disappeared. For example, it could be argued that Min varieties descend from a Middle Chinese dialect where retroflex stops merged back into alveolar stops instead of merging with retroflex sibilants. However, this argument cannot be made if there are distinctions in Min that do not appear in EMC (and which reflect ancient features going back to Old Chinese or – ultimately – even Proto-Sino-Tibetan, so that they cannot be explained as secondary developments), and this does indeed appear to be the case. In particular, Proto-Min (the reconstructed ancestor of the Min varieties) appears to have had six series of stops corresponding to the three series (unvoiced, unvoiced aspirated, voiced) of Middle Chinese. The additional three series are voiced aspirated (or breathy voiced), unvoiced "softened", and voiced "softened".

Evidence for the voiced aspirated stops comes from tonal distinctions among the stops. When voiced stops became unvoiced in most varieties and triggered a tone split, words with these stops moved into new lowered (so-called yang) tone classes, while words with unvoiced stops appeared in raised (so-called yin) tone classes. The result is that the yin classes have words with both aspirated and unaspirated stops, while the yang classes have only one of the two, depending on how the formerly voiced stops developed. Min varieties, however, have both kinds of words in yang classes as well as yin classes. This has caused scholars to reconstruct voiced aspirates (probably realized as breathy voiced consonants) in Proto-Min, which develop into unvoiced aspirates in yang-class words.

In addition, in some Min varieties, some words with EMC stops are reflected with stops while others are reflected with "softened" consonants, typically voiced fricatives or approximants. Such "softened stops" occur in both yin and yang classes, suggesting that Proto-Min had both unvoiced and voiced "softened stops". Presumably "softened stops" were actually fricatives of some sort, but it is unclear exactly what they were.

Scholars generally assume that these additional Proto-Min sounds reflect distinctions in Old Chinese that vanished in Early Middle Chinese but remained in Proto-Min. Until recently, no reconstructions of Old Chinese specifically accounted for the Proto-Min distinctions, but the recent reconstruction of William Baxter and Laurent Sagart accounts for both voiced aspirates and softened stops. According to them, voiced aspirates reflect Old Chinese stops in words with particular consonant prefixes, while softened stops reflect Old Chinese stops in words with a minor syllable prefix, so that the stop occurred between vowels. The postulated development of the softened stops is very similar to the development of voiced fricatives in Vietnamese, which likewise occur in both yin and yang varieties and are reconstructed as developing from words with minor syllables.

References

Works cited

External links

Chinese Phonological History, Dylan W.H. Sung
Introduction to Chinese Historical Phonology, Guillaume Jacques
Traditional Chinese phonology
Periodization of Chinese Phonology, Marjorie K.M Chan
Reconstruction of Middle Chinese and Old Chinese as well as intermediate forms done by Sergei Starostin

Notes and References

Schuessler . Axel . Palatalization of Old Chinese Velars / 上古汉语舌根音的腭化 . Journal of Chinese Linguistics . 1996 . 24 . 2 . 197–211 . 24 October 2023 . 0091-3723.