Chinese character internal structures
Chinese character forms studies the external structure of Chinese characters, i.e. strokes, components, and whole characters, and their structural relations to the pure dimension of forms or appearances.
The internal structure of Chinese characters studies the relationship between the forms, sounds, and meanings of the characters, thereby explaining the rationale for character formation.
In the analysis of internal structures, Chinese characters are decomposed into internal structural components in relation to the sound and meaning of the character.
Internal structural components
The character-building units obtained by analyzing the external structure of Chinese characters are external structural components. In internal structures, Chinese characters are analyzed according to the rationale of character formation, and the basic unit of character formation is internal structural components, or internal components in short, also called pianpang or characters.In most cases, the components of the internal structure of a Chinese character are similar to the first-level components in the external structure, for example, the character 江 is decomposed into 氵 and 工 in both analyses. However, they are not always the same. For example, the character "腾" is decomposed according to the internal structure as "semantically related to '马' and phonetically related to '朕'", in a semi-surrounding structure; while the external analysis then simply split it according to the left-right structure.
The external structure splitting method is used only when the internal structure analysis cannot be decomposed according to the character formation rationale.
According to their sound-meaning relationship with the whole character, internal components can be classified into three categories: semantic component and pure component.
- Any component related to the meaning of the character is a semantic component. For example: the component "扌" in characters "推" and "拉", and "心" in "思" and "想".
- A component related to the pronunciation of the character is a phonetic component. For example, "包" in "抱" and "苞".
- A pure component is neither related to the meaning nor to the pronunciation of the character. For example: "多" in "移", and "立" in 拉 ".
Traditional internal structural classification
In Shuowen Jiezi, Xu Shen proposed the six categories for the formation of Chinese characters, including- Pictograms – A pictographic character consists of one semantic component which is a drawing of the object it represents, such as: 日 and 月. When created, character 日 was a simplified picture of the sun and 月 was like the moon.
- Simple ideograms express an abstract idea with an iconic form, such as: 一, 二, 三, 上 and 下. The whole character is a semantic component.
- Compound ideographs, are compounds of two or more semantic components to suggest the meaning of the character, for example: 武 and 止 ) and 信 and 言 ).
- Phono-semantic compound characters ; A phono-semantic character consists of a phonetic component and a semantic component, for example, 江, 河.
- Derivative cognates are the smallest category and also the least understood. In the postface to the Shuowen Jiezi, Xu Shen gave as an example the characters 考 and 老, which had similar Old Chinese pronunciations and may have had the same etymological root, meaning "elderly person", but became lexicalized into two separate words.
- Rebus characters are characters that are "borrowed" to write another morpheme which is pronounced the same or nearly the same. For example, the characters 令 and 長, These two characters originally were official titles. The whole loan character is a phonetic component.
Modern internal structural classification
The traditional Six Writings classification presupposed that each component in a Chinese character can either represent the sound or meaning of the character. But, after the long evolution of the Chinese writing system, quite a few components can no longer effectively play the roles. For example, component 又 in the characters 邓 and 鸡 can neither represent sound nor meaning and has become a pure-form component.From the internal structure point of view, modern Chinese characters are composed of semantic components, phonetic components, and pure-form components.
These three types of components are used in combination to form the seven structures of modern Chinese characters: semantic component characters, phonetic component characters, pure form component characters, semantic-phonetic characters, semantic-form characters, phonetic-form characters, and semantic-phonetic-form characters.
Semantic component characters
Semantic component characters, or simply semantic characters, are composed of semantic components.Single semantic component characters
Single semantic component characters are composed of one semantic component, and most of them correspond to pictograms and simple ideograms in the traditional six writings. For example:- 田, 井, 門, 网 are ancient pictograms, and 门, 伞 are modern pictograms.
- 一, 二, 三, 刃 are ancient simple ideograms, and 丫, 凹, 凸, and 串 are later simple ideograms.
Multi-semantic component characters
Multi-semantic-component characters are composed of two or more semantic components. They include compound ideographs in the traditional Six writings. For example,Most multi-semantic component characters contain two semantic components, for example,
- 信 : semantic components 人 and 言, trust what people say.
- 尖 : 小 at the top and 大 at the bottom.
- 拿 : 合 your 手 together to take.
- 掰 : Separate something with both hands.
- 晶 : three 日 are very bright.
- 从 : Indicates that one 人 follows another person.
- 炎 : Two 火 represents rising flames.
- 森 : Three 木 means there are many trees.
- 尘 : 小 土 particles, representing dust.
- 灭 : Use 一 to suppress 火.
- 泪 : 氵 from 目.
叵 : turn 可 to the opposite side.
乌 /烏, 鳥/鸟 cannot see its eyes.
冇 : 有 taken away "二".
Phonetic component characters
A phonetic component character, or shortly phonetic character, is composed of one or more phonetic components.A single phonetic component character may be used to express a phonetic-loan meaning while its original or basic meaning is still understood by people. For example:
The pronunciation of the character "花" meaning "spending" is the same as that of the "花" which means "flower" in its original meaning. The latter can be regarded as the phonetic component of the former.
A single phonetic component character can also represent a syllable in a transliterated foreign word, for example, the characters in words "打" and "馬達".
Multi-phonetic component characters were produced during the development of writing systems. For example:
"新" was originally a semantic-phonetic character, but its modern meaning of "new" has nothing to do with the original semantic component of "斤", but the sounds are similar. In this way, "新" then has two phonetic components: "亲" and "斤".
"耻” used to be written as 恥 which is a semantic-phonetic character. The semantic component 心 has become 止, "耻” then has two phonetic components, "耳" and 止.
- 乒乓, both forms and sounds of the two characters are derived from 兵 with similar sounds.
Pure form component characters
A pure-form character is composed of one or more form components, which neither represent the sound nor the meaning of the characters.Single-component characters
These characters are composed of single pure form components. Many of them were originally ancient pictographic characters, but due to the evolution of the glyphs, they no longer look like the object represented. For example:- 日: The 日 character in the modern regular script is no longer of round shape.
- 月: It has become a ladder shape.
- 魚: Not quite like a fish now.
Some characters with single-form components are borrowed characters from ancient times. For example:
- 我: In oracle, it is like a weapon with a blade shaped like a saw, and was later used as a first-person pronoun. In modern Chinese, the original meaning is lost.
- 方: "Shuowen Jiezi" believes that the original meaning is a kind of boat, which has been borrowed to express the shape of a "square".
- 而: The ancient character was like a beard, now has been borrowed to be a conjunction in modern Chinese.
- 广: The traditional Chinese character is "廣".
- 农: The traditional Chinese character is "農".
- 书: The traditional Chinese character is "書".
- 专: The traditional Chinese character is "專".
- 门: The traditional Chinese character is "門".
Multi-component characters
These characters consist of two or more pure-form components. Some of these characters came from ancient pictographic characters, but later became non-pictographic. For example,- 角 : This character in the oracle bone script looks like an ox horn.
- 鼎 : The oracle bone inscriptions are in the shape of a tripod.
- 鹿 : In oracle bones, it resembles a deer.
- 騙 : It originally meant to jump on the horse. It now means deception, and the semantic component 馬 and phonetic 扁 have become pure forms.
- 特: semantic 牛 and sound 寺, it originally referred to a bull, but now it means "special" and "unusual", and both components are pure forms.
- 穌:semantic 禾 and sound 魚, Duan Yucai's note in Shuowen says: "If the grain is scattered, pick it up with a loaf." The character now expresses the meaning of awakening or is used in a person's name.
- 头 : The traditional Chinese character is “頭”, semantic component 頁 and phonetic component 豆. The simplified character component 大 and the two dots are pure-form components.
- 射 : The word "射" in oracle bone and bronze inscriptions is like pulling a bow and shooting, now neither 身 nor 寸 can express the sound or meaning of 射.
- 至 : in oracle bones, it is like an arrow shooting to the ground. According to the current glyph, the original meaning can no longer be seen, let alone the modern meaning of the word.
Semantic-phonetic characters
Semantic-phonetic characters consist of semantic components and phonetic components. The semantic component indicates the category of word meaning, and the phonetic component indicates the pronunciation of the character.The phonetic components of some semantic-phonetic characters are of exactly the same pronunciation as the whole character. For example,
- 搬 : 般.
- 銅 : 同.
- 辯 : 辡.
- 巍 : 魏.
- 拥 : 用.
- 帳 : 長.
there are 2292 items with characters and components of the same pronunciations and tones, accounting for 37.51%. There are 1110 items with characters and components of the same pronunciations but different tones, accounting for 18.17%
The phonetic components of some characters are also semantic components. For example:
- 娶 : semantic 女, phonetic 取, 取 also express the meaning.
- 駟 : semantic 馬, phonetic 四, 四 also represents meaning.
- 懈 : 解 represents sound and meaning.
- 珊 : 冊 is 删 with the right part omitted.
- 氮 : 炎 is 淡 with the left part omitted.
- 夜 : semantic 夕, the rest is 亦 with some strokes omitted.
- 耆, semantic 老 with lower part omitted, phonetic 旨
There are six combinations of semantic components and phonetic components:
- Left meaning and right sound, such as 肝, 惊, 湖 ;
- Right meaning and left sound, such as 鵡, 剛, 甥 ;
- Upper meaning and the lower sound: 霖, 茅, and 竿 ;
- Lower meaning and upper sound: 盂, 岱, 鯊 ;
- Outer meaning and inner sounds: 癢, 園, 衷, 座, 旗 ;
- Inner meaning and outer sound: 辮, 悶, 摹.
- 甭, meaning 不用, and sound derived from “bùyòng”.
- 巰, 氫 + 硫, with parts omitted.
According to statistics, among the 7,000 modern common characters of the simplified Chinese character writing system, semantic-phonetic characters account for only 56.7%. The traditional Chinese character system is slightly higher.
Semantic-form characters
Semantic-form characters are composed of semantic components and pure-form components. They are also called semi-semantic components and semi-pure form components.Many of these characters were originally semantic-phonetic characters. Due to subsequent changes in the pronunciation of the phonetic components or of the characters, the phonetic components could not effectively represent the pronunciation of the character and became pure-form components. For example:
- 布 : used to have semantic 巾 and phonetic 父, the phonetic component is no longer 父.
- 江 : used to have semantic 水 and phonetic 工, now in Mandarin 工 does not pronounce 江.
- 急 : used to have semantic 心 and sound 及. Now the upper component no longer looks like 及.
- 灿, not read as 山.
- 鸡, not read as 又.
- 环, not read as 不.
- 栗 : The upper part of the ancient Chinese character resembles the fruit on a chestnut tree. Now 覀 is a pure form component.
- 泉 : The oracle bone character looks like water flowing out of a cave. Now it has become components 白 and 水. 白 is a pure-form component.
- 桑 Mulberry: in the oracle bone script, the upper part of 桑 resembles lush branches and leaves. The current 叒 is a pure-form component.
Phonetic-form characters
Phonetic-form characters are composed of phonetic components and pure-form components. This type of character mainly comes from ancient semantic-phonetic characters, and the semantic components lost their semantic roles and became pure-form components. For example,- 球 : Originally refers to a kind of beautiful jade, with the semantic component 王. Later, it was borrowed to represent a ball, and then extended to a round three-dimensional object, and 王 became a pure-form component. The other component, 求, remains a phonetic component.
- 笨 : Originally refers to the inner white layer of bamboo, with semantic component 竹 and phonetic 本. Later, the character was borrowed by sound to mean stupid.
- 华:This is a simplified character with the phonetic 化, and 十 is a pure-form component.
Semantic-phonetic-form characters
A semantic-phonetic-form character consists of all three kinds of components: semantic, phonetic, and pure-form components.For example,
- 岸, originally had the semantic component ⿱山厂 and phonetic 干. In modern Chinese, ⿱山厂 is not a character or radical with a sound or meaning, but 山 can still express meaning, while 厂 remains a pure form component.
- 聽, semantic 耳 and phonetic 壬. In modern Chinese characters, the right part has become a pure form component.
Statistics on the internal structures of modern Chinese characters
According to Yang, among the 3500 frequently used Chinese characters of his experiment, semantic component characters are the least, accounting for about 5%;Pure-form component characters account for about 18%;
semantic-form and phonetic-form characters account for about 19%.
The largest group is semantic-phonetic characters, accounting for about 58%.
The rationality of characters
Using texts to record a language is to establish a fixed connection between text symbols and language words. If this connection is arbitrary, it is irrational; if there is a reason for it, it is rational.English words are mainly phonetic text, and their rationality mainly lies in using a combination of letters to represent the pronunciation of the corresponding word. Chinese characters are phonetic and semantic characters, and their rationality is mainly reflected in the use of phonetic components to express sounds and semantic components to express meanings.
Generally speaking, words with higher rationality are easier to learn and use. Because the unreasonable parts often require rote memorization. There are thousands of modern Chinese characters, and it is unrealistic to require every character to have high rationality.
The phonetic components and semantic components are related to the pronunciation and meaning of the character, so they are reasonable; the pure-form components are not related to the pronunciation or meaning of the character, and they are irrational.
Therefore, semantic component characters, phonetic component characters, semantic-phonetic characters, pure form component characters are irrational characters. The other characters are semi-rational characters.
Su defined the rationality of a character set as the proportion of rational components in all the internal components.
The formula is:
Rationality = /.
Professor Su's preliminary experiment results showed the rationality of modern Chinese characters to be about 50%, which is far lower than that of ancient Chinese characters.
Rational characters are easier to learn and often arouse students' interest.
In modern Chinese teaching, in order to enhance the rationality of Chinese characters, the traceability method is often used. For example: "日, 月, 山, 水, 牛, 羊, 网, 木, 目, 門 and 刀 ” are all pictographic characters from the etymology point of view. If the teacher makes some etymology analysis with the evolution of glyphs, he/she may achieve twice the result with half the effort.
The traceability analysis mentioned here is only for the convenience of teaching, rather than a comprehensive analysis of the origin and evolution of Chinese characters.
There is no need to trace the origin of characters that can be explained from the current situation. Only for those characters where the rationale cannot be seen from the current situation and it is easy to trace the origin to explain the rationale, can this method be employed.