Vietnamese phonology
The phonology of Vietnamese features 19 consonant phonemes, with 5 additional consonant phonemes used in Vietnamese's Southern dialect, and 4 exclusive to the Northern dialect. Vietnamese also has 14 vowel nuclei, and 6 tones that are integral to the interpretation of the language. Older interpretations of Vietnamese tones differentiated between "sharp" and "heavy" entering and departing tones. This article is a technical description of the sound system of the Vietnamese language, including phonetics and phonology. Two main varieties of Vietnamese, Hanoi and Saigon, which are slightly different from each other, are described below.
Initial consonants
Initial consonants which exist only in the Northern dialect are in red, while those that exist only in the Southern dialect are in blue.- /w/ is the only consonant sound permitted to form consonant clusters with other consonants.
- In many regions of Northern Vietnam, the pair and have merged into one, they are no longer two opposing phonemes. Some native Vietnamese speakers who lack linguistic knowledge believe that pronouncing the initial consonant of a word whose orthographic form begins with the letter l as, n as is nói ngọng. The phenomenon of no longer distinguishing from in words whose orthographic form begins with the letter n or l has three manifestations:
- The initial consonant of all words whose orthographic form begins with n or l is.
- The initial consonant of all words is.
- In some words, the initial consonant corresponding to the letter n at the beginning of the spelling form of the word is, with l being, in some other words the sound corresponding to n is, with l being.
- In Northern dialects, some words have the initial consonant as the voiced palatal nasal, such as nhuộm, nhức, nhỏ, nhổ, nhốt, have phonetic variants with the initial consonant. This sound is written with the letter d or gi or r depending on the word.
- Some words with the initial consonant being the voiced velar nasal also have phonetic variants with the initial consonant being the voiced velar fricative, which are used in some places in the North. For example, the words ngáy, ngẫm also have phonetic variants gáy, gẫm.
- In Northern dialects, the bilabial plosive">Bilabial consonant">bilabial plosive is only the initial consonant in a few loanwords from other languages, mainly from French. In writing, the sound is written with the letter p, as in sâm panh, derived from French champagne. Not every word in another language that has the initial consonant have the corresponding Vietnamese loanword with the initial consonant. In some words, the sound is replaced by the sound. For example, both syllables of the word búp bê have the initial consonant, not. In Southern dialects, the initial consonant of words whose spelling form begins with the letter p is in many speakers.
- The glottalized stops are preglottalized and voiced: . This glottal closure is often not released before the release of the oral closure, resulting in the characteristic implosive pronunciation. However, sometimes the glottal closure is released prior to the oral release in which case the stops are pronounced. Therefore, the primary characteristic is preglottalization with implosion being secondary.
- are bilabial, while are labiodental.
- are denti-alveolar, while are apico-alveolar.
- are phonetically lamino-alveolar.
- are often slightly affricated, but they are unaspirated.
- A glottal stop is inserted before words that begin with a vowel or in Northern dialects:
Hanoi initials
- are denti-lamino-alveolar:.
- is apico-alveolar:.
- d, gi and r are all pronounced.
- ch and tr are both pronounced, while x and s are both pronounced.
- The highly salient merger of and as mentioned above, characteristic of the speech of many lower- and working-class Vietnamese in the Red River Delta, is sometimes consciously manipulated to humorous and/or pejorative effect in colloquial Hanoi speech.
- occur in a small number of foreign loans, e.g. < panne 'breakdown', < garage, < billiard. For many speakers, however, is realized as and as.
- There are no retroflex consonants,,, instead there are palato-alveolar consonants:,, in spelling pronunciations taught in schools.
Saigon initials
- is apico-alveolar.
- is palatalized lamino-alveolar:.
- Some people pronounce d as, and gi as in situations where the distinction is necessary, most people pronounce both as.
- Historically, is pronounced in common speech, merging with d and gi. However, it is becoming distinct and pronounced as, especially in careful speech or when reading a text. In traditional performance including Cải lương, Đờn ca tài tử, Hát bội (Tuồng) and some old speakers of Overseas Vietnamese, it is pronounced as consonant cluster or. In loanwords, it is pronounced, or, for example, va li is pronounced, or.
- Historically, a distinction is made between ch and tr, as well as between x and s. However, in many speakers, these two pairs are becoming merged as and respectively.
- In southern speech, the phoneme, generally represented in Vietnamese linguistics by the letter, has a number of variant pronunciations depending on the speaker. A person can also have many pronunciations. It may occur as a retroflex fricative, an alveolar approximant|alveolar approximant], an alveolar flap, a trill, or a tapped fricative/postalveolar trills#Voiced [alveolar fricative">postalveolar consonant">postalveolar trills#Voiced [alveolar fricative trill|fricative trill]. In the border area between Ho Chi Minh City and Long An province, the letter is pronounced as a palatal approximant. In many areas in the Mekong Delta, the letter is pronounced as a velar fricative.
Simplification of consonant clusters in southern dialects
- In southern speech,,, and are usually pronounced, including among educated urban speakers. The cluster go is very rare, seen only in goá ‘widowed’. The cluster ngw shows greater loss in rural varieties than in urban ones.
- In informal speech, the voiceless velar fricative is often transformed into the corresponding voiceless bilabial and labiodental consonants, and the prevocalic is deleted, for example: cá khoai is pronounced as cá phai, khóa máy is pronounced as phá máy, khỏe không? is pronounced as phẻ không?. This pronunciation is observed only in rural southern dialects, and it does not occur in the speech of educated speakers.
- There are only a few words where the bilabial and labiodental consonants are followed by the prevocalic. Most of them are French loanwords, for example: tiền boa, đậu pơ-ti-poa, xe buýt, vải voan. The initial consonant is kept and the prevocalic is lost and pronounced as: tiền bo, đậu bo, xe bít, vải von.
- After the consonant clusters of the remaining articulators followed by the prevocalic, the initial consonant is kept and the prevocalic is lost as above, for example: vô duyên is pronounced as vô diên, cái loa is pronounced as cái la.
Comparison of initials
Vowels
Vowel nuclei
| Front | Central | Back | |
| Centering | iə | ɯə | uə |
| Close | |||
| Close-mid/ Mid | |||
| Close-mid/ Mid | |||
| Open-mid/ Open | |||
| Open-mid/ Open |
The IPA chart of vowel nuclei above is based on the sounds in Hanoi Vietnamese; other regions may have slightly different inventories. Vowel nuclei consist of monophthongs and three centering diphthongs. The Standard Vietnamese vowel inventory comprises 9 monophthong qualities and three falling diphthongs /iə ɯə uə/, although other acoustic analysis suggests eleven monophthongs and three diphthongs, and.
- All vowels are unrounded except for the four back rounded vowels:.
- In the South, the high vowels are all diphthongized in open syllables:, Ba Vì .
- and are pronounced shorter than the other vowels. These short vowels only occur in closed syllables.
- The vowels and are marginal. As with the other short/long vowel pairs, short and long and are only distinguished in closed syllables. For some speakers the distinction may be one of vowel quality or of the articulation of the syllable coda in addition to or instead of vowel quantity.
- : Many descriptions, such as Thompson,,, consider this vowel to be unrounded vowel|close back unrounded]:. However, Han's instrumental analysis indicates that it is more central than back., and also transcribe this vowel as central. But according to Kirby, is frequently realized as mid centralized, leading some authors to transcribe it as .
Closing sequences
says that in Hanoi, words spelled with ưu and ươu are pronounced, respectively, whereas other dialects in the Tonkin delta pronounce them as and. This observation is also made by.
Finals
When stops occur at the end of words, they have no audible release :When the velar consonants are after, they are articulated with a simultaneous bilabial closure or are strongly labialized.
Hanoi finals
Analysis of final ''ch'', ''nh''
The pronunciation of syllable-final ch and nh in Hanoi Vietnamese has had different analyses. One analysis, that of has them as being phonemes, where contrasts with both syllable-final t and c, and contrasts with syllable-final n and ng. Final is, then, identified with syllable-initial.Another analysis has final and as representing different spellings of the velar phonemes and that occur after upper front vowels and . This analysis interprets orthographic ⟨ach⟩ and ⟨anh⟩ as an underlying, which becomes phonetically open and diphthongized: →, →. This diphthongization also affects ⟨êch⟩ and ⟨ênh⟩: →, →.
Arguments for the second analysis include the limited distribution of final and, the gap in the distribution of and which do not occur after and, the pronunciation of ⟨ach⟩ and ⟨anh⟩ as and in certain conservative central dialects, and the patterning of ~ and ~ in certain reduplicated words. Additionally, final is not articulated as far forward as the initial : and are pre-velar with no alveolar contact.
The first analysis closely follows the surface pronunciation of a slightly different Hanoi dialect than the second. In this dialect, the in and is not diphthongized but is actually articulated more forward, approaching a front vowel. This results in a three-way contrast between the rimes ăn vs. anh vs. ăng. For this reason, a separate phonemic is posited.
Table of Hanoi finals
The following rimes ending with velar consonants have been diphthongized in the Hanoi dialect, but, and are more open:With the above phonemic analyses, the following is a table of rimes ending in in the Hanoi dialect:
| , | ||||||||||||||
| ăn | an | en | on | ân | ơn | ên | ôn | in | ưn | un | iên | ươn | uôn | |
| ăt | at | et | ot | ât | ơt | êt | ôt | it | ưt | ut | iêt | ươt | uôt | |
| ăng | ang | anh | ong | âng | – | ênh | ông | inh | ưng | ung | iêng | ương | uông | |
| ăc | ac | ach | oc | âc | – | êch | ôc | ich | ưc | uc | iêc | ươc | uôc |
Saigon finals
Merger of finals
While the variety of Vietnamese spoken in Hanoi has retained finals faithfully from Middle Vietnamese, the variety spoken in Ho Chi Minh City has drastically changed its finals. Rimes ending in merged with those ending in, respectively, so they are always pronounced, respectively, after the short front vowels . However, they are always pronounced after the other vowels. After rounded vowels, many speakers close their lips, i.e. they pronounce as. Subsequently, vowels of rimes ending in labiovelars have been diphthongized, while vowels of rimes ending in alveolar have been centralized. Otherwise, some Southern speakers distinguish and after in formal speech, but there are no Southern speakers who pronounce "ch" and "nh" at the end of syllables as.Table of Saigon finals
The short back vowels in the rimes have been diphthongized and centralized, meanwhile, the consonants have been labialized. Similarly, the short front vowels have been centralized which are realized as central vowels and the "unspecified" consonants have been affected by coronal spreading from the preceding front vowels which are surfaced as coronals .The other closed dialects which have also been merged in codas, but some vowels are pronounced differently in some dialects:
| Huế | Quảng Nam | Bình Định | Ho Chi Minh City | |
| ung, uc | , | , | , | , |
| un, ut | , | , | , | , |
| ênh, êch | , | , | , | , |
| ên, êt | , | , | , | , |
| inh, ich | , | , | , | , |
| in, it | , | , | , | , |
The ông, ôc rimes are merged into ong, oc as, in many Southern speakers, but not with ôn, ôt as pronounced,. The oong, ooc and eng, ec rimes are few and are mostly loanwords or onomatopoeia. The ôông, ôôc '' rimes are the "archaic" form before becoming ông, ôc by diphthongization and still exist in the North Central dialect in many placenames. The articulation of these rimes in the North Central dialect are, without a simultaneous bilabial closure or labialization.
With the above phonemic analyses, the following is a table of rimes ending in in the Ho Chi Minh City dialect:
Tone
Vietnamese vowels are all pronounced with an inherent tone. Tones differ in- pitch
- length
- contour melody
- intensity
- phonation
There is much variation among speakers concerning how tone is realized phonetically. There are differences between varieties of Vietnamese spoken in the major geographic areas and smaller differences within the major areas. In addition, there seems to be variation among individuals. More research is needed to determine the remaining details of tone realization and the variation among speakers.
Six-tone analysis
In the modern Vietnamese orthography, tone is indicated by diacritics written above or below the vowel, of which there are six categories. The six-tone analysis corresponds most closely with the way the Vietnamese language is written, perceived and taught, whether to those in the school system or to learners of Vietnamese as a foreign language.Northern varieties
The six tones in the Hanoi and other northern varieties are:tone
- The ngang tone is level at around the mid level and is produced with modal voice phonation. Alexandre de Rhodes describes this as "level"; describes it as "high level".
tone
- The huyền tone starts low-mid and falls. Some Hanoi speakers start at a somewhat higher point. It is sometimes accompanied by breathy voice phonation in some speakers, but this is lacking in other speakers: bà =. Alexandre de Rhodes describes this as "grave-lowering"; describes it as "low falling".
tone
- The hỏi tone starts a mid level and falls. It starts with modal voice phonation, which moves increasingly toward tense voice with accompanying harsh voice. reports that in Hanoi, the tone is mid falling. In other northern speakers, the tone is mid falling and then rises back to the mid level. This characteristic gives this tone its traditional description as "dipping". However, the falling-rising contour is most obvious in citation forms or when syllable-final; in other positions and when in fast speech, the rising contour is negligible. The hỏi also is relatively short compared with the other tones, but not as short as the nặng tone. Alexandre de Rhodes describes this as "smooth-rising"; describes it as "dipping-rising".
tone
- The ngã tone is mid rising. Many speakers begin the vowel with modal voice, followed by strong creaky voice starting toward the middle of the vowel, which is then lessening as the end of the syllable is approached. Some speakers with more dramatic glottalization have a glottal stop closure in the middle of the vowel. In Hanoi Vietnamese, the tone starts at a higher pitch than other northern speakers. Alexandre de Rhodes describes this as "chesty-raised"; describes it as "creaking-rising".
tone
- The sắc tone starts as mid and then rises in much the same way as the ngã tone. It is accompanied by tense voice phonation throughout the duration of the vowel. In some Hanoi speakers, the ngã tone is noticeably higher than the sắc tone, for example: má = ; mã = . Alexandre de Rhodes describes this as "acute-angry"; describes it as "high rising".
tone
- The nặng tone starts mid or low-mid and rapidly falls in pitch. It starts with tense voice that becomes increasingly tense until the vowel ends in a glottal stop closure. This tone is noticeably shorter than the other tones. Alexandre de Rhodes describes this as "chesty-heavy"; describes it as "constricted".
Southern varieties
The nặng tone is pronounced as low rising tone in fast speech or low falling-rising tone in more careful utterance.
The ngã and hỏi tone are merged into a mid falling-rising , which is somewhat similar to the hỏi tone of the non-Hanoi Northern accent mentioned above. This merged hỏi–ngã tone is characteristic of Southern Vietnamese accents.
North-central and Central varieties
North-central and Central Vietnamese varieties are fairly similar with respect to tone although within the North-central dialect region there is considerable internal variation.It is sometimes said that people from Nghệ An pronounce every tone as a nặng tone.
Eight-tone analysis
An older analysis assumes eight tones rather than six. This follows the lead of traditional Chinese phonology. In Middle Chinese, syllables ending in a vowel or nasal allowed for three tonal distinctions, but syllables ending with, or had no tonal distinctions. Rather, they were consistently pronounced with a short high tone, which was called the entering tone and considered a fourth tone. Similar considerations lead to the identification of two additional tones in Vietnamese for syllables ending in,, and. These are not phonemically distinct from the sắc and nặng tones, however, and hence not considered as separate tones by modern linguists and are not distinguished in the orthography.Syllables and phonotactics
According to, there are 4,500 to 4,800 possible spoken syllables, and the standard national orthography can represent 6,200 syllables. A description of syllable structure and exploration of its patterning according to the Prosodic Analysis approach of J.R. Firth is given in Henderson.The Vietnamese syllable structure follows the scheme:
where
In other words, a syllable has an obligatory nucleus and tone, and can have an optional consonant onset, an optional on-glide, and an optional coda or off-glide.
More explicitly, the syllable types are as follows:
C1: Any consonant may occur in as an onset with the following exception:
- does not occur in native Vietnamese words
- does not occur after labial consonants
- does not occur after in native Vietnamese words
G: The offglide may be or. Together, V and G must form one of the diphthongs or triphthongs listed in the section on Vowels.
- offglide does not follow the front vowels
- offglide does not follow the rounded vowels
- with some exceptions, the offglide cannot occur if the syllable contains a onglide
T: Syllables are spoken with an inherent tone contour:
- Six tone contours are possible for syllables with offglides, closed syllables with nasal codas, and open syllables—i.e., those without consonant codas.
- If the syllable is closed with one of the oral stops, only two contours are possible: the sắc and the nặng tones.
- Less common rimes may not be represented in this table.
- The nặng tone mark has been added to all rimes in this table for illustration purposes only. It indicates which letter tone marks in general are added to, largely according to the "new style" rules of Vietnamese orthography as stated in Quy tắc đặt dấu thanh trong chữ quốc ngữ. In practice, not all these rimes have real words or syllables that have the nặng tone.
- The IPA representations are based on Wikipedia's conventions. Different dialects may have different pronunciations.