Hokkien phonology


Hokkien is a Southern Min language spoken in southern Fujian and Taiwan. It has one of the most diverse phoneme inventories among Sinitic languages.
Along with other Min languages, which are not directly descended from Middle Chinese, Hokkien is of considerable interest to historical linguists for reconstructing Old Chinese.
This article uses Pe̍h-ōe-jī and IPA for transcription.

Syllables

A syllable in Hokkien consists of an initial, a final, and a tone.
In total, Hokkien uses around 800 toneless combinations of initials and finals, and around 2250~2450 total syllables with tones.
The number of syllables in Hokkien is far greater than in any other Sinitic language. For comparison, Cantonese and Hokchew use around 1800 toned syllables, Beijing Mandarin has 1350 syllables, and Suzhou Wu has only 1100 syllables.

Initials

Hokkien has aspirated, unaspirated as well as voiced consonant initials.
A total of 15 initials are used in Hokkien. This number does not include the three nasal consonants, which are usually considered allophones of the non-nasal voiced initials. This allophony also leads to a notable feature of the Hokkien accent in other languages, such as Japanese or Mandarin, when the nasal sounds like are denasalized into non-nasal voiced consonants like .
Syllables starting with vowels or approximants are considered to have the zero initial /∅/.
Affricates and fricatives are palatalized before, becoming,, ~,.
The phoneme /l/ in Hokkien has many possible articulations. It ranges between,, and . Its nasal allophone is uniformly described as . In directly-borrowed loanwords, foreign /d/ may be represented with Hokkien /l/:,,,.
As a phoneme, /dz/ is found predominantly in southern dialects of Hokkien. In many northern dialects, such as urban Quanzhou and Amoy, it has merged with /l/. This merger is still incomplete in some peripheral northern dialects, such as those of Tong'an or Yongchun, where /dz/ is reported to be present in some localities, in the speech of older speakers, or in a limited set of words. Some Southern Hokkien and Lengna dialects merge /dz/ with /g/.
While generally preserving /dz/, Zhangzhou Hokkien may still merge it with /l/ in some words, usually before /-u-/. E.g., characters like jūn 'wet', joán 'soft', jǒa 'how much' are now pronounced lūn, loán, lǒa even in Zhangzhou, although older Hokkien dictionaries record them with /dz/.

Finals

A final in Hokkien consists of a nucleus, with an optional medial and coda.
There are around 80 to 90 finals in Hokkien. The exact number can vary depending on the specific dialect, as well as the inclusion of marginal finals from onomatopoeia and contractions. Lengna Min, which is sometimes included in Hokkien, has around 60 finals, due to the loss of the /-ʔ/ coda.
In the tables below, rare rimes used in a small number of words are shaded. Finals used only in the northern or southern dialects of Hokkien are marked with N and S respectively.

Open-vowel finals

Finals with the coda /-ʔ/ are considered to be the checked tone counterparts for the open-vowel finals.
rowspan="3" zero
medial
-a
-o̤
N
-o͘
or
-o
-e
or

S
-ṳ
N
-ai
-au
/-i-/-ia
-i
-io
-iu
-iau
-
/-u-/-oa
-u
-oe
-ui
-oai
-
rowspan="3" zero
medial
-ah
-o̤h
N
-o͘ h
-oh
-eh
NS
-ɛh
S
-ṳh
N
-auh
/-i-/-iah
-ih
-io͘ h
S
-ioh
-iuh
-iauh
-
/-u-/-oah
-uh
-oeh
NS
-uih
N
-

The vowel -a is usually.
The vowel ɛ is only found in Southern dialects of Hokkien.
The final -o͘ is realized as a diphthong -ou in many rural Zhangzhou dialects, similarly to Teochew. Likewise, many of those dialects diphthongize -e into -ei. Some dialects, such as Zhangpu, realize them as triphthongs and. Changtai dialect uniquely pronounces general Hokkien -o͘ as. However, it still uses the vowel, mainly in place of general Hokkien.
-o̤ and -ṳ are found in many Northern Hokkien dialects, including Quanzhou and Tong'an. In Amoy, Jinjiang and among some younger urban Quanzhou speakers, -o̤ is merged with /e/. Similarly, the final -ṳ may be merged with /i/ or /u/ in those dialects. These two finals are not found in Southern dialects of Hokkien.
Chawan dialect in Zhangzhou also has -o̤ and -ṳ. Thus, Chawan distinguishes the following finals: , -o̤, and -ei . The dialects of Yunxiao and rural Dongshan are notable for having , -e and -ei as distinct finals.
In Northern Hokkien dialects where the final -o̤ is present, it is generally realized as, and -o is realized as . In dialects where -o̤ is absent, is a possible realization of -o.
rowspan="3" zero
medial
-aⁿ
-o͘ ⁿ
or
-eⁿ
N
-ɛⁿ
S
-aiⁿ
-auⁿ
-m
-ng
/-i-/-iaⁿ
-iⁿ
-io͘ ⁿ
S
-iuⁿ
NS
-iauⁿ
-
/-u-/-oaⁿ
-oeⁿ
S
-uiⁿ
NS
-oaiⁿ
-
rowspan="3" zero
medial
-ahⁿ
-ohⁿ
-ehⁿ
N
-ɛhⁿ
S
-aihⁿ
N
-auhⁿ
-mh
-ngh
/-i-/-iahⁿ
-ihⁿ
-iuhⁿ
N
-iauhⁿ
-
/-u-/-oehⁿ
S
-uihⁿ
N
-oaihⁿ
-

Finals with codas

The exact realization of and varies throughout the dialects. For most of them, they are described as / or /. Many Hokkien dialects in rural Zhangzhou and SEA have them as and . In urban Quanzhou and Jinjiang, is merged with, but is preserved.
In dialects with -ṳng and -ng, the two finals are often confounded. Likewise, -m and -o̤m may be used interchangeably. -m is assigned mainly to the syllables with zero initial, e.g. in 毋 "not", 媒 m̂, hm̂ "matchmaker", 梅 "plum", 莓 "berry", etc.
Hui'an dialect merges -im and -iam into, or -ip and -iap into, etc, and thus it has the following rhymes not found in other dialects:,,,,.
While -o̤ and -ṳ are phonemically distinct as standalone finals, they are not distinct in compound finals, and /ɯŋ/, /ɯn/, /ɯk/, /ɯt/ may be described as /əŋ/, /ən/, /ək/, /ət/ as well. In Quanzhou operatic pronunciation, this sound is pronounced as in compound finals. These rimes also share some phonological constraints with rather than, e.g. they do not cooccur with labial initials.

Rimes used in minor dialects

Rimes with medial /-ɯ-/

Finals with the medial -ṳ- are mentioned in Lūi-im Biāu-gō͘, an early 19th century Northern Hokkien rimebook, but now they are obsolete in most dialects of Hokkien. They are found only in a few exceptionally conservative dialects, such as Quanzhou operatic, or, per Ang Ui-jin's survey, in the Taiwanese "Old Anxi accent", spoken among older generations in some areas of New Taipei, in Baozhong Township, and in a few villages in Xihu and Puyan. For these dialects, Ang Ui-jin describes this medial as -o̤- rather than -ṳ-, except in the final -ṳiⁿ.
-ṳa
N
-ṳo
N
-ṳe
N
-ṳah
N
-ṳeh
N
-ṳaⁿ
N
-ṳiⁿ
N

/uɛ/ and related rimes

Some Southern Hokkien dialects have and as distinct finals. The latter is used in a small number of vernacular readings:
  • in,,,,,, in both of these dialects
  • in,,,, additionally in the Chawan dialect
Similarly, those dialects differentiate between and, but the latter is used exclusively in.
Furthermore, in Chawan dialect, the rime and are differentiated:
  • is used in,, and
  • is used in,,,,, and
Other rural Zhangzhou dialects have in both groups of characters.
Zhangpu dialect uses, and consistently in place of, and.

/õ/ and related rimes

Some Southern Zhangzhou dialects differentiate between the rimes o͘ ⁿ and oⁿ.o͘ ⁿ is used in a small number of characters, including ngó͘, ngō͘, nô͘, nó͘, nō͘, ngó͘, ngō͘, and ngó͘. Chawan dialect also has this rime in hó͘ ⁿ, perhaps due to Teochew influence.oⁿ is used in a much larger set of words, both derived from historical syllables with a nasal initial and those that never had a nasal consonant
Changtai dialect also dfferentiates between these rimes, where they are pronounced as and respectively. Similar distinction is found in other Southern Min languages, such as Teochew or Luichow, but in most dialects of Hokkien the two rimes are merged into o͘ ⁿ.
The rimes ioⁿ and ohⁿ may be also described as and for the aforementioned Southern Zhangzhou dialects. There are, however, no rimes or.

Marginal finals

Some marginal finals may occur in specific contexts, such as contractions. For example, in Dongshan dialect there is a final -iohⁿ, used in 即樣 chiohⁿ "like this" and 迄樣 hiohⁿ "like that". In Tong'an dialect, there is a final -iai, used in contractions or in words with the final -ia suffixed with á.

Tones

Traditionally, four Middle Chinese tones are called "level" 平 piâⁿ, "rising" 上 chiǔⁿ, "departing" 去 khṳ̀ and "entering" 入 ji̍p. These names are mnemonics illustrating the corresponding tone, e.g. the word "level" 平 piâⁿ has the level tone, the word "to enter" 入 ji̍p has the entering tone, etc. In modern languages, these four tones are further divided into two categories: the "dark" or "upper" tones and the "light" or "lower" tones, giving a total of 8 tones in traditional system.
The tones can be counted in two patterns: the "dark—light" order is more common in works published in China, and the "level—rising—departing—entering" order is more popular in Taiwan. This article follows the latter numbering system.
In most dialects of Hokkien, there are only 7 distinct citation tones, as some of the 8 traditional tones merge into a single tone. Certain dialects distinguish all eight tones, and some have only 6 citation tones due to additional mergers. Many Northern dialects merge certain tones in the citation form, but not in the sandhi form.
Tone contours vary across the Hokkien dialects.
;"Dark level" tone ①陰平
;"Dark rising" tone ②陰上
;"Dark departing" tone ③陰去
;"Dark entering" tone ④陰入
;"Light level" tone ⑤陽平
;"Light rising" tone ⑥陽上
;"Light departing" tone ⑦陽去
;"Light entering" tone ⑧陽入

Tone sandhi

A phrase in Hokkien is divided into "tone groups", where each syllable except the last one undergoes the tone sandhi.
In examples below, the syllables that do not undergo tone sandhi are in bold. The Pe̍h-ōe-jī orthography of examples is adjusted to concisely represent as much dialectal variations as possible, e.g. distinguishing eight tones, the initial j-, the vowel ɛ along with the vowel , etc — note that no single Hokkien dialect maintains all of these distinctions.
The last syllable of a noun does not undergo sandhi. A noun may be preceded by a classifier with a numeral or a demonstrative pronoun, all of which do undergo sandhi. If the noun is omitted, however, the classifier preserves its original tone.
Components of the numerals generally undergo the sandhi, except the words 萬 bān and 千 chhṳiⁿ/chheng:
Noun adjuncts generally undergo tone sandhi:
However, in a series of noun adjuncts, only the last one undergoes tone sandhi:
The "part-of-a-whole" constructions, particularly the extended place names, are divided into separate tone groups word-by-word:
A name with a surname is treated as a single tone group:

Sandhi in four-character idioms

Four-character compounds are usually divided into two tone groups two characters each:
Some four-character idioms are divided into tone groups of one and three characters, where the first character is the subject of an idiom:
Words 之 --chi and 者 --chiá are pronounced with neutral tone in idioms:
Some non-literary idioms read with vernacular readings form a single tone segment:

Neutral tone

Hokkien has neutral tone.
Neutral tone is pronounced as mid-low level 33~22. The syllable before neutral tone does not undergo tone sandhi, but preserves its original tone. Aside from having the neutral tone, unstressed syllables may undergo other changes, the most prominent of them being the loss the glottal stop and voicing of the initial:
The following combinations with the generic classifier 個 ê may have the preceding coda voiced and reduplicated:
Neutral tone is used in the following contexts:
  • in the possessive particle 兮 --e
  • in some verbal particles, as well as the result and direction complements
  • in sentence-final particles
  • in suffixes used in direction words, such as 爿 --pêng, 勢 --si, 邊 --piⁿ, and 頭 --thau
  • in certain time phrases
  • in personal pronouns when they are used as direct objects
  • particularly, in the indefinite pronoun 儂 lâng
  • in titles after surnames
  • particularly, in some words with 哥 ko
  • particularly, in placenames formed from a surname and the word 厝 chhù 'house'

The suffix 仔 ''-á''

The suffix 仔 is related to some special phonetic changes.
Syllables before 仔 may induce its change due to assimilation.
Some assimilations are dialect-specific. E.g. in Tong'an dialect, a syllable ending in -a changes it to -ai before 仔 : chhia-á > chhiai-á, ah-á > aih-á, nâ-á > nâi-á, saⁿ-á > saiⁿ-á, etc.
The tone sandhi before 仔 is different from general Hokkien tone sandhi.

Historical phonology and internal differences

The earliest sources on the Hokkien phonology are the rhyme dictionaries and . The former describes the Quanzhou Hokkien, while the latter describes the Zhangzhou Hokkien.
Current Hokkien-speaking area mostly coincides with the 10th century Chheng-goân Circuit, a de facto independent polity that emerged after the fall of the Min Empire. The polity was divided into two prefectures, Quanzhou and Zhangzhou, and the border between these medieval prefectures roughly coincides with certain modern Hokkien isoglosses. The Chiang-bú Circuit, which was under the rule of Wuyue, covers the Hokchew-speaking area, and Tiong-gī Circuit ruled by Southern Tang lies in Inland Min- and Hakka-speaking area in Fujian.
Changtai dialect contains features of both Northern and Southern dialect areas, atop of having some of its own unique characteristics. Changtai was a part of Quanzhou prefecture in 6—10 centuries, until being transferred under Zhangzhou's jurisdiction in 980.
Chawan dialect is a distinct variety of Hokkien. It may have received some influence from Teochew, but its amount is contestable.
The Eastern Namoa dialect shows some traits of Zhangzhou Hokkien, as this half of the Namoa island was previously included in the Zhangzhou prefecture, yet in most aspects it still clusters more with Teochew.
Hai Lok Hong dialect has even more features typical for southern dialects of Hokkien, and may be classified as a distinct dialect of either Teochew or Hokkien, or a variety of Southern Min separate from both of them. The charts below follow the classification of The Language Atlas of China, where Hai Lok Hong is included in Teochew.
The Lengna and Zhangping dialects are very different from mainstream Hokkien. At the same time, they form a continuum with Zhangzhou dialects. They are sometimes classified as the Western branch of Hokkien.
Datian Min is usually included in Southern Min as a distinct variety, apart from Hokkien and Teochew. It is divided into two dialects, Qianlu and Houlu, the former lying closer to Hokkien, and the latter having more Central Min influence. An undescribed variety of Southern Min in the north of Dehua is reported to be quite different from other Hokkien dialects and may belong to the same Hokkien—Central Min transitionary area as Datian Min.
Hinghwa is a language closely affiliated with Hokkien, yet it has received heavy Hokchew influence and is not usually considered a part of Hokkien itself.

Initial correspondences

Denasalization

One large difference between Hokkien and Teochew is the degree of denasalization. Teochew /n/, /m/ and /ŋ/ are usually considered phonemes rather than allophones of the voiced plosives /d/, /b/, /g/.
In Teochew, most syllables with codas preserve the nasal initial, with a few exceptions: denasalization frequently occurs in some specific syllables, like buang, bak, leng, long, lang, lung, bung. It may also sporadically occur in some individual characters:,,,, etc. In Hokkien on the other hand, syllables with codas can never have nasal initials.
In Hinghwa, Hokkien voiced consonants /b/, /dz/, /g/, including cases when they are derived from nasal initials, are further devoiced into /p/, /ts/, /k/.

Final correspondences

The charts below illustrate the common correspondences in rimes between various dialects of Hokkien, as well as related Southern Min languages. Middle Chinese finals are transcribed using Baxter's transcription, and Proto-Southern-Min reconstructions are per Kwok Bit-chee.
In the example characters, and readings are marked by different types of underlines. Note that the examples are given primarily for Hokkien correspondences, and other languages may lack corresponding readings for some of the example characters.

Open-vowel finals

In the tables below, characters after the double line ‖ have nasalization at least in some dialects of Hokkien.

Finals with -n/-t

Teochew has mostly merged -n/-t with -ŋ/-k, except for some peripheral dialects. The dialect of Fenghuang County in Chaozhou preserves the most -n/-t finals. The Eastern Namoa dialect preserves only -in and -un. In Hai Lok Hong, while some dialects also preserve -in and -un, most Western Hai Lok Hong dialects only preserve -un, and most Eastern Hai Lok Hong dialects merge all -n/-t finals with -ŋ/-k, like in Teochew.
The choose of -ian/-iat or -uan/-uat for a given character derived from MC rhymes 仙三合 -jwen and 先四合 -wen is not consistent among different languages. For 仙三合 -jwen, the generally used reflex is -uan/-uat for most Southern Min languages, except Hinghwa and Lengna, where it is -ian/-iat. However, there is a tendency in Hokkien to have -ian/-iat here when the MC initial was 以 y-, either as the only reading or a non-standard popular variant. For 先四合 -wen, the general reflex is -ian/-iat.
The shape of a character may influence the choose of -ian/-iat or -uan/-uat. Characters with 肙 as the phonetic element tend to have -uan in Hokkien, but -ian in other languages. Characters derived from 矞, 血 and 穴 tend to have -iat in Hokkien, but -uat in Teochew. Characters derived from 夬 and 癸 tend to have -uat in mainstream Hokkien and Teochew, but -iat in Hinghwa, Lengna, Hai Lok Hong, etc.

Finals with -ʔ

Finals with the coda -ʔ are all used in vernacular readings. Their literary counterparts almost always have -p, -t, -k as a coda in Hokkien.

Nasalized finals

Nasalized finals in Hokkien have two principal etymological sources.
First category includes the nasalized finals that are cognate to finals with a full nasal coda. They are used only in vernacular readings.
Another type of nasalized finals is used in syllables with nasal initials that did not undergo denasalization. Such syllables may be alternatively analyzed as having a plain, non-nasalized final and a nasal initial. Although this analysis is not typical for Hokkien, it is more common in the descriptions of Teochew. This type of nasal finals occurs in both literary and vernacular readings.
While finals like /ĩ/, /ɛ̃/, /ã/, /iũ/, /uã/ may be induced by both contexts, finals /ãi/, /ãu/, /iãu/, /õ~ɔ̃/ are used exclusively in syllables that did not have an etymological nasal coda. Additionally, depending on the dialect, words like 'congee' and 'sister' may have finals /uẽ/ or /uãi/. Teochew preserves some other combinations of nasal initials and finals not found in Hokkien, such as ngiá 'beautiful' and ngú 'language'.
Occasionally, nasal finals occur in characters that never had a nasal coda or a nasal initial, e.g. táⁿ, phīⁿ, phàⁿ.

Other correspondences

The following correspondences are less regular and common, and as such, they are illustrated by specific characters in which they occur.