Chinese character structures explained
Chinese character structures are the patterns or rules in which the characters are formed by their writing units. There are two aspects of Chinese character structures:The external structures are on the writing strokes, components and whole characters as well as their structural relations on the pure dimension of character forms. The internal structures studies the relationship between the forms, sounds and meanings of Chinese characters.
External structures
See main article: Chinese character forms. Chinese character external structure is on how the writing units are combined level by level into a complete character. There are three levels of structural units of Chinese characters: strokes, components, and whole characters. For example, character (character) is composed of two components, each of which is composed of three stokes: = 宀(㇔㇔㇇) + 子(㇇㇚㇐).
Strokes
See main article: Chinese character strokes.
Strokes are the smallest building units of Chinese characters. When writing a Chinese character, the trace of a dot or a line left on the writing material (such as paper) from pen-down to pen-up is called a stroke.
Strokes combine with each other in a Chinese character in different ways. There are three types of combinations between two strokes:
- Separation: the strokes are separated from each other. Such as:,, .
- Connection: the strokes are connected, such as,,,,, .
- Intersection: the strokes are intersected. Such as:,, .
Components
See main article: Chinese character components.
Chinese character components are Chinese character building blocks composed of strokes. In most cases, a component is larger than a stroke (i.e., consists of more than one stroke) and smaller than the whole character (combines with some other components to form a character). For example, in character "", there are two components (and), each with more than one strokes, (亻: ㇓㇑) and (牛: ㇓㇐㇐㇑). In the special cases of one-stroke characters, such as "" and "", a stroke is a component and is a character.
Chinese character component analysis is to divide or separate a character into components.There are two ways for Chinese character dividing, hierarchical dividing and plane dividing. Hierarchical dividing separate layer by layer from larger to smaller components, and finally get the primitive components. Plane dividing separate out the primitive components at one time. Hierarchical dividing can display the external structure of Chinese characters, while plane splitting can be regarded as omitting the higher splitting levels, and directly writing out the final separating result of primitive components.
Whole characters
See main article: Chinese whole characters.
A Chinese whole character is a complete character. It lies at the final level of the stroke-component-character composition.According to their structures, Chinese characters can be divided into undecomposable characters and decomposable characters.
An undecomposable character consists of one primitive component, which is directly formed by strokes and can not be decomposed into smaller components, for example, "".
A decomposable character consists of more than one components. There are two frequently-used modes of component combination in the study of Chinese character structures: first-level component combination and primitive component combination.
According to first-level component combination, the structures of decomposable characters can be divided into 13 categories:[1]
- Left to right (⿰, 2FF0), for example:,, and .
- Left to middle and right (⿲, 2FF2):, and .
- Above to below (⿱, 2FF1):, and .
- Above to middle and below (⿳, 2FF3):, and .
- Full surround (⿴, 2FF4):, and
- Surround from above (⿵, 2FF5):, and 風
- Surround from below (⿶, 2FF6):, and 函
- Surround from left (⿷, 2FF7):, and 匣
- Surround from upper left (⿸, 2FF8):, and .
- Surround from upper right (⿹, 2FF9):, and .
- Surround from lower left (⿺, 2FFA):, and .
- Surround from lower right (N/A):斗 and .
- Overlaid (⿻, 2FFB):, and .
According to primitive component combination, the structures of decomposable characters can be divided into:
- A. For characters composed of two primitive components, there are 9 different structures, as shown by the following example characters: .
- B. For characters composed of three components, there are 21 different structures, such as: .
- C. For characters composed of four components, there are 20 different structures, such as: .
- D. For characters composed of five components, there are 20 different structures, such as: .
- E. For characters composed of six components, there are 10 different structures, such as: .
- F. For characters composed of seven components, there are 3 different structures, such as: .
- G. For characters composed of eight components, there is 1 structure, such as: .
- H. For characters composed of nine components, there is 1 structure, such as: .
Internal structures
See main article: Chinese character internal structures.
In the analysis of internal structures, Chinese characters are decomposed into internal structural components in relations with the sound and meaning of the characters.
Traditional classification
In Shuowen Jiezi, Xu Shen proposed six categories of Chinese characters, including
- Pictograms, single-semantic-component characters which are drawings of the objects they represent.
- Simple ideograms, express an abstract idea with an iconic form.
- Compound ideographs, combine two or more semantic components to indicate the meaning of the character.
- Phono-semantic characters, consist of phonetic components and semantic components.
- Derivative cognates, two characters had similar Old Chinese pronunciations and may have had the same etymological root.
- Rebus (phonetic loan) characters, are characters borrowed to write other morphemes with similar pronunciations.
Modern classification
The traditional liushu presupposed that every internal component, usually called pianpang (偏旁), can either represent the sound or meaning of the character. But, after the long evolution of the Chinese writing systems, quite a few components can no longer effectively play the roles and have become pure form components, or pure signs. From the internal structure point of view, modern Chinese characters are composed of semantic components, phonetic components and pure form components . And they have formed seven categories of modern Chinese characters:
Semantic component characters are composed of semantic components and include:
- Pictograms, such as (field), (well), (door).
- Simple ideograms, such as (one), (two), (blade).
- Compound ideographs. For example, (take): (close) (hands) together to take; (break apart): (separate) with two (hands); (follow): one (person) follows another person; (tears): (water) from (eyes).
- Special methods, such as (cannot): turn (can) to the opposite (right) side; (none, not have): (have) taken away the (contents).
Phonetic component characters are composed of phonetic components. For example,
- Phonetic-loan, for example, character (huā, flower) was borrowed to mean 'spending' (huā).
- Used in a transliterated foreign word, e.g. the characters in words (dá, dozen) and (mǎdá, motor).
- Multi-phonetic component characters, for example, was originally a semantic-phonetic character, but its modern meaning of "new" has nothing to do with the original semantic component of (meaning "0.5kg" in modern Chinese), though the sounds are similar. In this way, (sounds "xīn") then has two phonetic components: (qīn) and (jīn). Another example, Vietname Chinese character Vietnamese: (Vietnamese: blăng; 'Moon') was created as a compound of Vietnamese: 巴 (Vietnamese: ba) and Vietnamese: 陵 (Vietnamese: lăng).
Pure form characters are composed of form components, which neither represent the sound nor the meaning of the characters. For example:
- (Sun): The character in modern regular script is no longer round like the Sun.
- (wide, broad): The phonetic component in the traditional character has been omitted for this simplified character.
- (deer): The oracle form resembled a deer.
Semantic-phonetic characters, also called "phono-semantic characters", consist of semantic components and phonetic components. There are six combinations:
- Left meaning (semantic) and right sound (phonetic), such as (sound: gān, meaning: liver), (jīng, fear), (hú, lake);
- Right meaning and left sound, such as (wǔ, parrot), (gāng, firm), (shēng, nephew);
- Upper meaning and lower sound: (lín, rain), (máo, cogongrass) and (gān, pole);
- Lower meaning and upper sound: (yú, bowl), (dài, Mount Tai), (shā, shark);
- Outer meaning and inner sounds: (yǎng, itch), (yuán, garden), (zhōng, heart), (zuò, seat), (qí, flag);
- Inner meaning and outer sound: (biàn, braid), (mèn, dull), (mó, imitation).
Semantic-form characters are composed of semantic components and pure form components.Many of these characters were originally semantic-phonetic characters. Due to subsequent changes in the shape or pronunciation of the phonetic components or the characters, the phonetic components could not effectively represent the pronunciation of the character and became pure form. For example:
- (bù, cloth): used to have semantic (scarf) and phonetic (fù), the phonetic component is no longer .
- (jí, urgent): used to have semantic (heart) and phonetic (jí). Now the upper component no longer looks like .
- (jī, chicken), is a (bird), but not read as (yòu).
Phonetic-form characters are composed of phonetic components and pure form components. They mostly came from ancient semantic-phonetic characters, where the semantic components lost their functions and became pure form. For example,
- (qiú, ball): Originally refers to a kind of beautiful jade, with semantic component (玉, jade). Later, it was borrowed to represent a ball, and then extended to any spherical object, and (jade) became a pure form component, while (qiú) remains a phonetic component.
- (bèn, stupid): Originally refers to the inner white layer of bamboo, with semantic component (bamboo) and phonetic (běn). Later, the character was borrowed by sound to mean 'stupid'.
- (huá, magnificent): This is a simplified character with phonetic and pure form component .
Semantic-phonetic-form characters consist of the three kinds of components. For example,
- (àn, bank, shore): originally had the semantic ⿱ and phonetic (gàn). In modern Chinese, ⿱ is not a character or radical with a sound or meaning, but (hill) can still express meaning, while remains a pure form component.
- (tīng, listen): semantic (ear) and phonetic (tǐng). In modern Chinese characters, the right part has become a pure form component.
Semantic–phonetic–form characters are very rare and the examples above are not quite persuasive. Whether they can be justified as an internal structural category remains to be further studied. If not a category, then the classification above can also be called "New six writings".
According to Yang, among the 3,500 frequently used Chinese characters of their experiment, semantic component characters are the least, accounting for about 5%; pure form component characters account for about 18%; Semantic–form and phonetic–form characters account for about 19%. The largest group is semantic-phonetic characters, accounting for about 58%.
Differences
For most characters, the dividing results for internal structures are similar to the first-level external structures. For example, 江 (river) is divided into components 氵and 工 in both cases. However, the explanations are not the same. External structure of 江: external component 氵+ external component 工. Internal structure of 江: semantic component 氵 + phonetic component 工.
In a few cases, even the physical structures are different, for examples [2] 辯 (biàn, debate), external structure: ⿲ 辛 + 言 + 辛, 辯 (biàn, debate), internal structure: ⿴ phonetic 辡 (biàn) + semantic 言 (speak); 裹 (guǒ, wrap), external structure: ⿳ 亠 + 果 +, 裹 (guǒ, wrap), internal structure: ⿴ semantic 衣 (cloth) + phonetic 果 (guǒ) 穎 (yǐng, ear of grain), external structure: ⿰⿱ 匕禾頁, 穎 (yǐng, ear of grain), internal structure: phonetic ⿹ 頃 (qǐng) + semantic 禾 (grain, rice plant).
See also
References
Works cited
- Book: Fu, Yonghe (傅永和) . 中文信息处理 (Chinese Information Processing) . 广东教育出版社 (Guangdong Education Press) . 1999 . 9-787540-640804 . 3rd . Guangzhou . zh. 84.
- Book: Li, Dasui 李大遂 . zh:简明实用汉字学 . Concise and Practical Chinese Characters . Peking University Press . 2013 . 978-7-301-21958-4 . 3rd . Beijing . zh.
- Book: National Language Commission . Ministry of Education, China . Specification of Common Modern Chinese Character Components and Component Names (现代常用字部件及部件名称规范) . 2009 . National Language Commission . Beining . 3 September 2023.
- Book: National Language Commission, Ministry of Education, China . Specification of the Undecomposable Characters Commonly Used in the Modern Chinese (现代常用独体字规范) . 2009a . National Language Commission . Beining . September 8, 2023.
- Book: Peking University, Modern Chinese Language Teaching and Research Office . Modern Chinese (现代汉语) . Commercial Press . 2004 . 7-100-00940-5 . Beijing . zh.
- Book: Qiu, Xigui 裘锡圭 . zh:文字学概要 . Chinese Writing . 商务印书馆 (Commercial Press) . 2013 . 978-7-100-09369-9 . 2nd . Beijing . zh.
- Book: Su, Peicheng (苏培成) . 现代汉字学纲要 (Essentials of Modern Chinese Characters) . 商务印书馆 (Commercial Press) . 2014 . 978-7-100-10440-1 . 3rd . Beijing . zh. 84.
- Book: Yin . Jiming 殷寄明 . Xiàndài hànyǔ wénzìxué . Wang . Rudong 汪如东 . Fudan University Press . 2007 . 978-7-309-05525-2 . Shanghai . zh . zh:现代汉语文字学 . Modern Chinese Writing.
- Zhang . Xiaoheng . Li . Xiaotong . 2024 . On the Classification of Modern Chinese Characters Based on Forms, Sounds and Meanings: A formation perspective (谈现代汉字的形音义分类——构字法篇) . The Journal of Modernization of Chinese Language Education (中文教学现代化学报) . 13 (2024) . 1 . 12-18.
Notes and References
- Ideographic Description Characters https://www.unicode.org/charts/PDF/U2FF0.pdf
- 辯 CJK Unified Ideograph 8FAF https://en.wiktionary.org/wiki/%E8%BE%AF