Vietnamese alphabet
The Vietnamese alphabet is the modern writing script for the Vietnamese language. It is a Latin-based script whose spelling conventions are derived from the orthography of Romance languages such as Portuguese, Italian, and French. It was originally developed by Francisco de Pina and other Jesuit missionaries in the early 17th century.
The Vietnamese alphabet contains 29 letters, including 7 letters using four diacritics:,,,,,, and. There are an additional 5 diacritics used to designate tone. The complex vowel system and the large number of letters with diacritics, which can stack twice on the same letter, makes it easy to distinguish the Vietnamese orthography from other writing systems that use the Latin alphabet.
The Vietnamese system's use of diacritics produces an accurate transcription for tones despite the limitations of the Roman alphabet. On the other hand, sound changes in the spoken language have led to different letters, digraphs and trigraphs now representing the same sounds.
Letter names and pronunciation
Vietnamese uses 22 letters of the ISO basic Latin alphabet. The 4 remaining letters aren't considered part of the Vietnamese alphabet although they are used to write loanwords, languages of other ethnic groups in the country based on Vietnamese phonetics to differentiate the meanings or even Vietnamese dialects, for example: or for southerner pronunciation of in standard Vietnamese.In total, there are 12 vowels and 17 consonants.
;Notes:
- The vowels in the table are bolded and italicized.
- The use of the terms bê bò or bờ bò to refer to and as pê phở or pờ phở to refer to is to avoid confusion in some contexts, the same for as sờ mạnh or sờ nặng and as xờ nhẹ, as i ngắn and as y dài.
- is always followed by in every word and phrase in Vietnamese, e.g. quần 'trousers', quyến rũ 'to attract', etc.
- The name i-cờ-rét for is from the French name for the letter: i grec, referring to the letter's origin from the Greek letter upsilon. The other obsolete French pronunciations include and .
- The Vietnamese alphabet lacks the 4 letters,, and . However, these letters are often used for foreign loanwords or may be kept for foreign names.
- is most commonly treated as a vowel along with. represents 'short ' and represents 'long '. can have tones as well as other vowels e.g. Mỹ 'America'. It may also act as a consonant. It can sometimes be used to replace, e.g. bánh mì 'bread' can sometimes be written bánh mỳ by some people, but it is not generally considered standard or accurate.
- and are similar to each other in sound in Northern Vietnamese dialects or with some Southern Vietnamese speakers and can sometimes be used interchangeably between these speakers, e.g. sương xáo or sương sáo 'grass jelly'.
Middle Vietnamese alphabet
The Vietnamese alphabet in the Dictionarium Annamiticum Lusitanum et Latinum of Alexandre de Rhodes has 23 letters:| Upper case | A | B | ꞗ | C | D | đ | E | G | H | I | K | L | M | N | O | P | Q | R | S | T | V | X | Y |
| Lower case | a | b | ꞗ | c | d | đ | e | g | h | i | k | l | m | n | o | p | q | r | ſ/s | t | v/u | x | y |
In this dictionary, there are fewer letters than the modern alphabet. The letters ă, â, ê, ô, ơ, and ư are regarded as separate letters in the modern alphabet and are used in the dictionary, but Rhodes does not regard them as separate letters. In the dictionary, a letter with diacritics, like à, ạ, ă, ằ, and ặ, are not separate from the letter a; à, ạ, ă, ằ, and ặ are just regarded as the letter a with diacritics.
In the alphabet, there is a letter, the letter b with flourish ꞗ, that has fallen out of use. It was used to represents the voiced bilabial fricative /β/.
Two letters, ꞗ and đ, are neither upper nor lower case. So according to that orthography, the names of the two provinces Đồng Nai and Lâm Đồng will be đồng Nai and Lâm đồng. In the modern alphabet, the lower case version of đ is đ, and upper case version of đ is Đ.
There are two variants of minuscule s: the long s, ſ, and the short s, s. In the modern alphabet, the long s, ſ, is no longer used, and the short s, s, is the only variant of s.
Normal v in the dictionary has two variants: the normal v, v, and the curving-bottom v, u. In the 17th century, v and u were not different letters, v being a variant of u.
Consonants
The alphabet is largely derived from Portuguese with some influence from French, although the usage of and was borrowed from Italian and that for from Greek and Latin, mirroring the English usage of these letters.There is one trigraph,, and ten digraphs:,,,,,,,,,.
- The consonants also called with its phoneme with ờ, except k. So b will be bờ, c will be cờ and so on.
Vowels
Pronunciation
The correspondence between the orthography and pronunciation is somewhat complicated. In some cases, the same letter may represent several different sounds, and different letters may represent the same sound. This is because the orthography was designed centuries ago and the spoken language has changed, as shown in the chart directly above that contrasts the difference between Middle and Modern Vietnamese.and are mostly equivalent, and there is no concrete rule that says when to use one or the other, except in sequences like and . There have been attempts since the late 20th century to standardize the orthography by replacing with when it represents a vowel, the latest being a decision from the Vietnamese Ministry of Education in 1984. These efforts seem to have had limited effect. In textbooks published by Nhà Xuất bản Giáo dục, is used to represent only in Sino-Vietnamese words that are written with one letter alone, at the beginning of a syllable when followed by, after and in the sequence ; therefore such forms as *lý and *kỹ are not "standard", though they are much preferred elsewhere. Most people and the popular media continue to use the spelling that they are most accustomed to.
| Spelling | Sound |
| a | except as below in au and ay before syllable-final nh and ch, see Vietnamese phonology#Analysis of final ch, nh in ưa, ia and ya in ua except after q |
| ă | |
| â | |
| e | |
| ê | except as below before syllable-final nh and ch, see Vietnamese phonology#Analysis of final ch, nh in iê and yê |
| i | except as below after any vowel letter |
| o | except as below before ng and c after any vowel letter before any vowel letter except i |
| ô | except as below before ng and c except after a u that is not preceded by a q in uô except after q |
| ơ | except as below in ươ |
| u | except as below after q or any vowel letter before any vowel letter except a, ô and i Before a, ô and i: if preceded by q, otherwise |
| ư | |
| y | except as below after any vowel letter except u |
The uses of and to represent the phoneme can be categorized as "standard" and "non-standard" as follows.
This "standard" set by Nhà Xuất bản Giáo dục is not definite. It is unknown why the literature books use Lí while the history books use Lý.
Spelling
Vowel nuclei
The table below shows the vowels of Hanoi Vietnamese and the corresponding orthographic symbols.Notes:
- The vowel is:
- *usually written : = sĩ.
- *sometimes written after,,,,,,,, : = Mỹ 'America'
- **It is always written when:
- The vowel is written before or : = oóc 'organ '; = kính coong. This generally only occurs in recent loanwords or when representing dialectal pronunciation.
- Similarly, the vowel is written before or : = . But unlike being frequently used in onomatopoeia, transcriptions from other languages and words "borrowed" from Nghệ An/Hà Tĩnh dialects, seems to be used solely to convey the feel of the Nghệ An/Hà Tĩnh accents. In transcriptions, is preferred.
Diphthongs and triphthongs
Notes:The glide is written:
- after
- in front of,, or except after
- following and
- in all other cases; is written as instead of *, and that is written as after
The diphthong is written:
- at the end of a syllable: = mía 'sugar cane'
- iê before a consonant or off-glide: = miếng 'piece'; = xiêu 'to slope, slant'
- at the end of a syllable: = mua 'to buy'
- before a consonant or off-glide: = muôn 'ten thousand'; = xuôi 'down'
- at the end of a syllable: = mưa 'to rain'
- before a consonant or off-glide: = mương 'irrigation canal'; = tưới 'to water, irrigate, sprinkle'
Tone marks
Vietnamese is a tonal language, so the meaning of each word depends on the pitch in which it is pronounced. Tones are marked in the IPA as suprasegmentals following the phonemic value. Some tones are also associated with a glottalization pattern.There are six distinct tones in the standard northern dialect. The first one is not marked and the other five are indicated by diacritics applied to the vowel part of the syllable. The tone names are chosen such that the name of each tone is spoken in the tone it identifies.
In the south, there is a merging of the hỏi and ngã tones, in effect leaving five tones.*:
Z and 0 keys are used to remove the mark. For example, in VNI, U2 →, then press 0 →.- Unmarked vowels are pronounced with a level voice, in the middle of the speaking range.
- The grave accent indicates that the speaker should start somewhat low and drop slightly in tone, with the voice becoming increasingly breathy.
- The hook indicates in Northern Vietnamese that the speaker should start in the middle range and fall, but in Southern Vietnamese that the speaker should start somewhat low and fall, then rise.
- In the North, a perispomeni indicates that the speaker should start mid, break off, then start again and rise like a question in tone. In the South, it is realized identically to the Hỏi tone.
- The acute accent indicates that the speaker should start mid and rise sharply in tone.
- The dot or cross signifies in Northern Vietnamese that the speaker starts low and fall lower in tone, with the voice becoming increasingly creaky and ending in a glottal stop.
In lexical ordering, differences in letters are treated as primary, differences in tone markings as secondary and differences in case as tertiary differences. Ordering according to primary and secondary differences proceeds syllable by syllable. According to this principle, a dictionary lists tuân thủ before tuần chay because the secondary difference in the first syllable takes precedence over the primary difference in the second syllable.
Structure
In the past, syllables in multisyllabic words were concatenated with hyphens, but this practice has died out and hyphenation is now reserved for word-borrowings from other languages. A written syllable consists of at most three parts, in the following order from left to right:- An optional beginning consonant part
- A required vowel syllable nucleus and the tone mark, if needed, applied above or below it
- An ending consonant part, can only be one of the following:,,,,,,,, or nothing.
History
Since the beginning of the Chinese rule in 111 BC, literature, government papers, scholarly works, and religious scripture were all written in classical Chinese while indigenous writing with chữ Hán started around the ninth century. In the 12th century, several Vietnamese words began to be written in chữ Nôm, adapted from Chinese characters. The system was based on Chinese characters but supplemented with Vietnamese-invented characters to represent native Vietnamese words. These characters were adapted or created using methods such as phono-semantic compounds, double-phonetic compounds, and borrowing the character for its pronunciation.Name
People have called the Latinized script of Vietnamese chữ Quốc ngữ at least since 1867. In 1867, scholar Trương Vĩnh Ký published two grammar books. The first book is Mẹo luật dạy học tiếng pha-lang-sa, a Vietnamese book written in chữ Quốc ngữ about French grammar. In this book, the Latinized script of Vietnamese was called chữ quốc ngự. The second book is Abrégé de grammaire annamite, a French book about Vietnamese grammar. In this book, the Latinized script of Vietnamese was called l'alphabet européen, les caractères latins. On Gia Dinh Bao April 15th issue of 1867, when mentioned the French book about Vietnamese grammar, the name chữ quốc ngữ was used to indicate the Latinized script of Vietnamese.Creation of
As early as 1620, with the work of Francisco de Pina, Portuguese and Italian Jesuit missionaries in Vietnam began using Latin script to transcribe the Vietnamese language as an assistance for learning the language. The work was continued by the Avignonese Alexandre de Rhodes. Building on previous dictionaries by Gaspar do Amaral and António Barbosa, Rhodes compiled the Dictionarium Annamiticum Lusitanum et Latinum, a Vietnamese-Portuguese-Latin dictionary, which was later printed in Rome in 1651, using their spelling system. These efforts led eventually to the development of the present Vietnamese alphabet. For 200 years, chữ Quốc ngữ was used within the Catholic community. However, works written in the Vietnamese alphabet saw limited use, while Catholic texts in chữ Nôm were significantly more widespread. Chữ Nôm thus remained the principal writing system used by Vietnamese Catholics during this period.Colonial period
In 1910, the French colonial administration enforced chữ Quốc ngữ. The Latin alphabet then became a means to publish Vietnamese popular literature, which was disparaged as vulgar by the Chinese-educated imperial elites. Historian Pamela A. Pears asserted that by instituting the Latin alphabet in Vietnam, the French cut the Vietnamese from their traditional Hán Nôm literature. An important reason why Latin script became the standard writing system in Vietnam but not in Cambodia and Laos, which were both dominated by the French for a similar amount of time under the same colonial framework, had to do with the Nguyễn Emperors of Vietnam heavily promoting its usage. According to the historian Liam Kelley in his 2016 work "Emperor Thành Thái’s Educational Revolution" neither the French nor the revolutionaries had enough power to spread the usage of chữ Quốc ngữ down to the village level. It was by the imperial decree of Emperor Thành Thái in 1906 that parents could decide whether their children would follow a curriculum in Hán văn or Nam âm. This decree was issued at the same time when other social changes, such as the cutting of long male hair, were occurring. The main reason for the popularisation of the Latin alphabet in Vietnam/Đại Nam during the Nguyễn dynasty was because of the pioneering efforts by intellectuals from French Cochinchina combined with the progressive and scientific policies of the French government in French Indochina that created the momentum for the usage of chữ Quốc ngữ to spread.Since the 1920s, the Vietnamese mostly use chữ Quốc ngữ, and new Vietnamese terms for new items or words are often calqued from Hán Nôm. Some French had originally planned to replace Vietnamese with French, but this never was a serious project, given the small number of French settlers compared with the native population. The French had to reluctantly accept the use of chữ Quốc ngữ to write Vietnamese since this writing system, created by Portuguese missionaries, is based on Portuguese orthography, not French.
Mass education
Between 1907 and 1908, the short-lived Tonkin Free School promulgated chữ Quốc ngữ and taught French language to the general population.In 1917, the French system suppressed Vietnam's Confucian examination system, viewed as an aristocratic system linked with the "ancient regime", thereby forcing Vietnamese elites to educate their offspring in the French language education system.
While traditional nationalists favoured the Confucian examination system and the use of chữ Hán, Vietnamese revolutionaries, progressive nationalists, and pro-French elites viewed the French education system as a means to "liberate" the Vietnamese from old Chinese domination and the unsatisfactory "outdated" Confucian examination system, to democratize education and to help bridge Vietnamese to European philosophies.
The French colonial system then set up another educational system, teaching Vietnamese as a first language using chữ Quốc ngữ in primary school and then the French language. Hundreds of thousands of textbooks for primary education began to be published in chữ Quốc ngữ, with the unintentional result of turning the script into the popular medium for the expression for Vietnamese culture.
Late 20th century to present
Typesetting and printing Vietnamese has been challenging due to its number of accents/diacritics. This had led to the use of accent and diacritic-less names in Overseas Vietnamese, such as Viet instead of the proper Việt. Contemporary Vietnamese texts sometimes include words which have not been adapted to modern Vietnamese orthography, especially for documents written in chữ Hán. The Vietnamese language itself has been likened to a system akin to ruby characters elsewhere in Asia. French, which left a mark on the Vietnamese language in the form of loanwords and other influences, is no longer as widespread in Vietnam, with English or International English the preferred European language for commerce.Computing
The universal character set Unicode has full support for the Latin Vietnamese writing system, although it does not have a separate segment for it. The required characters that other languages use are scattered throughout the Basic Latin, Latin-1 Supplement, Latin Extended-A and Latin Extended-B blocks; those that remain are placed in the Latin Extended Additional block. An ASCII-based writing convention, Vietnamese Quoted Readable and several byte-based encodings including VSCII (TCVN), VNI, VISCII and Windows-1258 were widely used before Unicode became popular. Most new documents now exclusively use the Unicode format UTF-8.Unicode allows the user to choose between precomposed characters and combining characters in inputting Vietnamese. Because in the past some fonts implemented combining characters in a nonstandard way, most people use precomposed characters when composing Vietnamese-language documents.
Most keyboards on modern phone and computer operating systems, including iOS, Android and MacOS, have now supported the Vietnamese language and direct input of diacritics by default. Previously, Vietnamese users had to manually install free software such as Unikey on computers or Laban Key on phones to type Vietnamese diacritics. These keyboards support input methods such as Telex.