A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity. They are usually voiced, and are closely involved in prosodic variation such as tone, intonation and stress.
The word vowel comes from the Latin word vocalis, meaning "vocal". In English, the word vowel is commonly used to refer both to vowel sounds and to the written symbols that represent them.


There are two complementary definitions of vowel, one phonetic and the other phonological.
The phonetic definition of "vowel" does not always match the phonological definition. The approximants and illustrate this: both are without much of a constriction in the vocal tract, but they occur at the onset of syllables which suggests that phonologically they are consonants. A similar debate arises over whether a word like bird in a rhotic dialect has an r-colored vowel or a syllabic consonant. The American linguist Kenneth Pike suggested the terms "vocoid" for a phonetic vowel and "vowel" for a phonological vowel, so using this terminology, and are classified as vocoids but not vowels. However, Maddieson and Emmory demonstrated from a range of languages that semivowels are produced with a narrower constriction of the vocal tract than vowels, and so may be considered consonants on that basis. Nonetheless, the phonetic and phonemic definitions would still conflict for the syllabic /l/ in table, or the syllabic nasals in button and rhythm.


The traditional view of vowel production, reflected for example in the terminology and presentation of the International Phonetic Alphabet, is one of articulatory features that determine a vowel's quality as distinguishing it from other vowels. Daniel Jones developed the cardinal vowel system to describe vowels in terms of the features of tongue height, tongue backness and roundedness. These three parameters are indicated in the schematic quadrilateral IPA vowel diagram on the right. There are additional features of vowel quality, such as the velum position, type of vocal fold vibration, and tongue root position.
This conception of vowel articulation has been known to be inaccurate since 1928. Peter Ladefoged has said that "early phoneticians... thought they were describing the highest point of the tongue, but they were not. They were actually describing formant frequencies." The IPA Handbook concedes that "the vowel quadrilateral must be regarded as an abstraction and not a direct mapping of tongue position."
Nonetheless, the concept that vowel qualities are determined primarily by tongue position and lip rounding continues to be used in pedagogy, as it provides an intuitive explanation of how vowels are distinguished.


Theoretically, vowel height refers to the vertical position of either the tongue or the jaw relative to either the roof of the mouth or the aperture of the jaw. In practice, however, it refers to the first formant, abbreviated F1, which is associated with the height of the tongue. In close vowels, also known as high vowels, such as and, the first formant is consistent with the tongue being positioned close to the palate, high in the mouth, whereas in open vowels, also known as low vowels, such as, F1 is consistent with the jaw being open and the tongue being positioned low in the mouth. Height is defined by the inverse of the F1 value: The higher the frequency of the first formant, the lower the vowel. In John Elsing's usage, where fronted vowels are distinguished in height by the position of the jaw rather than the tongue, only the terms 'open' and 'close' are used, as 'high' and 'low' refer to the position of the tongue.
The International Phonetic Alphabet defines seven degrees of vowel height, but no language is known to distinguish all of them without distinguishing another attribute:
The letters are typically used for either close-mid or true-mid vowels. However, if more precision is required, true-mid vowels may be written with a lowering diacritic. The Kensiu language, spoken in Malaysia and Thailand, is highly unusual in that it contrasts true-mid with close-mid and open-mid vowels, without any difference in other parameters like backness or roundness.
It appears that some varieties of German have five vowel heights that contrast independently of length or other parameters. The Bavarian dialect of Amstetten has thirteen long vowels, which can be analyzed as distinguishing five heights each among the front unrounded, front rounded, and back rounded vowels as well as an open central vowel, for a total of five vowel heights:. Otherwise, no language is known to contrast more than four degrees of vowel height.
The parameter of vowel height appears to be the primary cross-linguistic feature of vowels in that all spoken languages that have been researched till now use height as a contrastive feature. No other parameter, even backness or rounding, is used in all languages. Some languages have vertical vowel systems in which at least at a phonemic level, only height is used to distinguish vowels.


Vowel backness is named for the position of the tongue during the articulation of a vowel relative to the back of the mouth. As with vowel height, however, it is defined by a formant of the voice, in this case the second, F2, not by the position of the tongue. In front vowels, such as, the frequency of F2 is relatively high, which generally corresponds to a position of the tongue forward in the mouth, whereas in back vowels, such as, F2 is low, consistent with the tongue being positioned towards the back of the mouth.
The International Phonetic Alphabet defines five degrees of vowel backness:
To them may be added front-central and back-central, corresponding to the vertical lines separating central from front and back vowel spaces in several IPA diagrams. However, front-central and back-central may also be used as terms synonymous with near-front and near-back. No language is known to contrast more than three degrees of backness nor is there a language that contrasts front with near-front vowels nor back with near-back ones.
Although some English dialects have vowels at five degrees of backness, there is no known language that distinguishes five degrees of backness without additional differences in height or rounding.


is named after the rounding of the lips in some vowels. Because lip rounding is easily visible, vowels may be commonly identified as rounded based on the articulation of the lips. Acoustically, rounded vowels are identified chiefly by a decrease in F2, although F1 is also slightly decreased.
In most languages, roundedness is a reinforcing feature of mid to high back vowels rather than a distinctive feature. Usually, the higher a back vowel, the more intense is the rounding. However, in some languages, roundedness is independent from backness, such as French and German, most Uralic languages, Turkic languages, and Vietnamese with back unrounded vowels.
Nonetheless, even in those languages there is usually some phonetic correlation between rounding and backness: front rounded vowels tend to be more front-central than front, and back unrounded vowels tend to be more back-central than back. Thus, the placement of unrounded vowels to the left of rounded vowels on the IPA vowel chart is reflective of their position in formant space.
Different kinds of labialization are possible. In mid to high rounded back vowels the lips are generally protruded outward, a phenomenon known as exolabial rounding because the insides of the lips are visible, whereas in mid to high rounded front vowels the lips are generally "compressed" with the margins of the lips pulled in and drawn towards each other, a phenomenon known as endolabial rounding. However, not all languages follow that pattern. Japanese, for example, is an endolabial back vowel, and sounds quite different from an English exolabial. Swedish and Norwegian are the only two known languages in which the feature is contrastive; they have both endo- and exo-labial close front vowels and close central vowels, respectively. In many phonetic treatments, both are considered types of rounding, but some phoneticians do not believe that these are subsets of a single phenomenon and posit instead three independent features of rounded and compressed and unrounded. The lip position of unrounded vowels may also be classified separately as spread and neutral. Others distinguish compressed rounded vowels, in which the corners of the mouth are drawn together, from compressed unrounded vowels, in which the lips are compressed but the corners remain apart as in spread vowels.

Front, raised and retracted

The conception of the tongue moving in two directions, high-low and front-back, is not supported by articulatory evidence and does not clarify how articulation affects vowel quality. Vowels may instead be characterized by the three directions of movement of the tongue from its neutral position: front, raised, and retracted. Front vowels, can be secondarily qualified as close or open, as in the traditional conception, but this refers to jaw rather than tongue position. In addition, rather than there being a unitary category of back vowels, the regrouping posits raised vowels, where the body of the tongue approaches the velum, and retracted vowels, where the root of the tongue approaches the pharynx :
Membership in these categories is scalar, with the mid-central vowels being marginal to any category.


occurs when air escapes through the nose. Vowels are often nasalised under the influence of neighbouring nasal consonants, as in English hand. Nasalised vowels, however, should not be confused with nasal vowels. The latter refers to vowels that are distinct from their oral counterparts, as in French vs..
In nasal vowels, the velum is lowered, and some air travels through the nasal cavity as well as the mouth. An oral vowel is a vowel in which all air escapes through the mouth. Polish and Portuguese also contrast nasal and oral vowels.


describes whether the vocal cords are vibrating during the articulation of a vowel. Most languages have only voiced vowels, but several Native American languages, such as Cheyenne and Totonac, contrast voiced and devoiced vowels. Vowels are devoiced in whispered speech. In Japanese and in Quebec French, vowels that are between voiceless consonants are often devoiced.
Modal voice, creaky voice, and breathy voice are phonation types that are used contrastively in some languages. Often, they co-occur with tone or stress distinctions; in the Mon language, vowels pronounced in the high tone are also produced with creaky voice. In such cases, it can be unclear whether it is the tone, the voicing type, or the pairing of the two that is being used for phonemic contrast. The combination of phonetic cues is known as register or register complex.


is used to describe the opposition of tense vowels vs. lax vowels. This opposition has traditionally been thought to be a result of greater muscular tension, though phonetic experiments have repeatedly failed to show this.
Unlike the other features of vowel quality, tenseness is only applicable to the few languages that have this opposition, whereas the vowels of the other languages cannot be described with respect to tenseness in any meaningful way.
One may distinguish the English tense vs. lax vowels roughly, with its spelling. Tense vowels usually occur in words with the final silent e, as in mate. Lax vowels occur in words without the silent e, such as mat. In American English, lax vowels do not appear in stressed open syllables.
In traditional grammar, long vowels vs. short vowels are more commonly used, compared to tense and lax. The two sets of terms are used interchangeably by some because the features are concomitant in some varieties of English. In most Germanic languages, lax vowels can only occur in closed syllables. Therefore, they are also known as checked vowels, whereas the tense vowels are called free vowels since they can occur in any kind of syllable.

Tongue root position

Advanced tongue root is a feature common across much of Africa, the Pacific Northwest, and scattered other languages such as Modern Mongolian. The contrast between advanced and retracted tongue root resembles the tense-lax contrast acoustically, but they are articulated differently. Those vowels involve noticeable tension in the vocal tract.

Secondary narrowings in the vocal tract

occur in some languages like Sedang and the Tungusic languages. Pharyngealisation is similar in articulation to retracted tongue root but is acoustically distinct.
A stronger degree of pharyngealisation occurs in the Northeast Caucasian languages and the Khoisan languages. They might be called epiglottalized since the primary constriction is at the tip of the epiglottis.
The greatest degree of pharyngealisation is found in the strident vowels of the Khoisan languages, where the larynx is raised, and the pharynx constricted, so that either the epiglottis or the arytenoid cartilages vibrate instead of the vocal cords.
Note that the terms pharyngealized, epiglottalized, strident, and sphincteric are sometimes used interchangeably.

Rhotic vowels

are the "R-colored vowels" of American English and a few other languages.

Reduced vowels

Some languages, such as English and Russian, have what are called 'reduced', 'weak' or 'obscure' vowels in some unstressed positions. These do not correspond one-to-one with the vowel sounds that occur in stressed position, and they tend to be mid-centralized in comparison, as well as having reduced rounding or spreading. The IPA has long provided two letters for obscure vowels, mid and lower, neither of which are defined for rounding. Dialects of English may have up to four phonemic reduced vowels:,, and higher unrounded and rounded.


The acoustics of vowels are fairly well understood. The different vowel qualities are realized in acoustic analyses of vowels by the relative values of the formants, acoustic resonances of the vocal tract which show up as dark bands on a spectrogram. The vocal tract acts as a resonant cavity, and the position of the jaw, lips, and tongue affect the parameters of the resonant cavity, resulting in different formant values. The acoustics of vowels can be visualized using spectrograms, which display the acoustic energy at each frequency, and how this changes with time.
The first formant, abbreviated "F1", corresponds to vowel openness. Open vowels have high F1 frequencies, while close vowels have low F1 frequencies, as can be seen in the accompanying spectrogram: The and have similar low first formants, whereas has a higher formant.
The second formant, F2, corresponds to vowel frontness. Back vowels have low F2 frequencies, while front vowels have high F2 frequencies. This is very clear in the spectrogram, where the front vowel has a much higher F2 frequency than the other two vowels. However, in open vowels, the high F1 frequency forces a rise in the F2 frequency as well, so an alternative measure of frontness is the difference between the first and second formants. For this reason, some people prefer to plot as F1 vs. F2 – F1.
In the third edition of his textbook, Peter Ladefoged recommended using plots of F1 against F2 – F1 to represent vowel quality. However, in the fourth edition, he changed to adopt a simple plot of F1 against F2, and this simple plot of F1 against F2 was maintained for the fifth edition of the book. Katrina Hayward compares the two types of plots and concludes that plotting of F1 against F2 – F1 "is not very satisfactory because of its effect on the placing of the central vowels", so she also recommends use of a simple plot of F1 against F2. In fact, this kind of plot of F1 against F2 has been used by analysts to show the quality of the vowels in a wide range of languages, including RP, the Queen's English, American English, Singapore English, Brunei English, North Frisian, Turkish Kabardian, and various indigenous Australian languages.
R-colored vowels are characterized by lowered F3 values.
Rounding is generally realized by a decrease of F2 that tends to reinforce vowel backness. One effect of this is that back vowels are most commonly rounded while front vowels are most commonly unrounded; another is that rounded vowels tend to plot to the right of unrounded vowels in vowel charts. That is, there is a reason for plotting vowel pairs the way they are.

Prosody and intonation

In addition to variation in vowel quality as described above, vowels vary as a result of differences in prosody. The most important prosodic variables are pitch, loudness and length. However, the features of prosody are usually considered to apply not to the vowel itself, but to the syllable in which the vowel occurs. In other words, the domain of prosody is the syllable, not the segment. We can list briefly the effect of prosody on the vowel component of a syllable.
A vowel sound whose quality does not change over the duration of the vowel is called a monophthong. Monophthongs are sometimes called "pure" or "stable" vowels. A vowel sound that glides from one quality to another is called a diphthong, and a vowel sound that glides successively through three qualities is a triphthong.
All languages have monophthongs and many languages have diphthongs, but triphthongs or vowel sounds with even more target qualities are relatively rare cross-linguistically. English has all three types: the vowel sound in hit is a monophthong, the vowel sound in boy is in most dialects a diphthong, and the vowel sounds of flower,, form a triphthong or disyllable, depending on dialect.
In phonology, diphthongs and triphthongs are distinguished from sequences of monophthongs by whether the vowel sound may be analyzed into different phonemes or not. For example, the vowel sounds in a two-syllable pronunciation of the word flower phonetically form a disyllabic triphthong, but are phonologically a sequence of a diphthong and a monophthong. Some linguists use the terms diphthong and triphthong only in this phonemic sense.

Written vowels

The name "vowel" is often used for the symbols that represent vowel sounds in a language's writing system, particularly if the language uses an alphabet. In writing systems based on the Latin alphabet, the letters A, E, I, O, U, Y, W and sometimes others can all be used to represent vowels. However, not all of these letters represent the vowels in all languages that use this writing, or even consistently within one language. Some of them, especially W and Y, are also used to represent approximant consonants. Moreover, a vowel might be represented by a letter usually reserved for consonants, or a combination of letters, particularly where one letter represents several sounds at once, or vice versa; examples from English include igh in "thigh" and x in "x-ray". In addition, extensions of the Latin alphabet have such independent vowel letters as Ä, Ö, Ü, Å, Æ, and Ø.
The phonetic values vary considerably by language, and some languages use I and Y for the consonant, e.g., initial I in Italian or Romanian and initial Y in English. In the original Latin alphabet, there was no written distinction between V and U, and the letter represented the approximant and the vowels and. In Modern Welsh, the letter W represents these same sounds. Similarly, in Creek, the letter V stands for. There is not necessarily a direct one-to-one correspondence between the vowel sounds of a language and the vowel letters. Many languages that use a form of the Latin alphabet have more vowel sounds than can be represented by the standard set of five vowel letters. In English spelling, the five letters A E I O and U can represent a variety of vowel sounds, while the letter Y frequently represents vowels ; W is used in representing some diphthongs and to represent a monophthong in the borrowed words "" and "".
Other languages cope with the limitation in the number of Latin vowel letters in similar ways. Many languages make extensive use of combinations of letters to represent various sounds. Other languages use vowel letters with modifications, such as ä in Swedish, or add diacritical marks, like umlauts, to vowels to represent the variety of possible vowel sounds. Some languages have also constructed additional vowel letters by modifying the standard Latin vowels in other ways, such as æ or ø that are found in some of the Scandinavian languages. The International Phonetic Alphabet has a set of 28 symbols to represent the range of basic vowel qualities, and a further set of diacritics to denote variations from the basic vowel.
The writing systems used for some languages, such as the Hebrew alphabet and the Arabic alphabet, do not ordinarily mark all the vowels, since they are frequently unnecessary in identifying a word. Technically, these are called abjads rather than alphabets. Although it is possible to construct simple English sentences that can be understood without written vowels, extended passages of English lacking written vowels can be difficult to understand; consider dd, which could be any of dad, dada, dado, dead, deed, did, died, diode, dodo, dud, dude, odd, add, or aided. The Masoretes devised a vowel notation system for Hebrew Jewish scripture that is still widely used, as well as the trope symbols used for its cantillation; both are part of oral tradition and still the basis for many bible translations—Jewish and Christian.


The differences in pronunciation of vowel letters between English and its related languages can be accounted for by the Great Vowel Shift. After printing was introduced to England, and therefore after spelling was more or less standardized, a series of dramatic changes in the pronunciation of the vowel phonemes did occur, and continued into recent centuries, but were not reflected in the spelling system. This has led to numerous inconsistencies in the spelling of English vowel sounds and the pronunciation of English vowel letters.

Audio samples


The importance of vowels in distinguishing one word from another varies from language to language. Nearly all languages have at least three phonemic vowels, usually as in Classical Arabic and Inuktitut, though Adyghe and many Sepik languages have a vertical vowel system of. Very few languages have fewer, though some Arrernte, Circassian, Ndu languages have been argued to have just two, and, with being epenthetic.
It is not straightforward to say which language has the most vowels, since that depends on how they are counted. For example, long vowels, nasal vowels, and various phonations may or may not be counted separately; indeed, it may sometimes be unclear if phonation belongs to the vowels or the consonants of a language. If such things are ignored and only vowels with dedicated IPA letters are considered, then very few languages have more than ten. The Germanic languages have some of the largest inventories: Standard Danish has 11 to 13 short vowels, while the Amstetten dialect of Bavarian has been reported to have thirteen long vowels:. The situation can be quite disparate within a same family language: Spanish and French are two closely related Romance languages but Spanish has only five pure vowel qualities,, while classical French has eleven: and four nasal vowels. The Mon–Khmer languages of Southeast Asia also have some large inventories, such as the eleven vowels of Vietnamese:. Wu dialects have the largest inventories of Chinese; the Jinhui dialect of Wu has also been reported to have eleven vowels: ten basic vowels,, plus restricted ; this does not count the seven nasal vowels.
One of the most common vowels is ; it is nearly universal for a language to have at least one open vowel, though most dialects of English have an and a —and often an, all open vowels—but no central. Some Tagalog and Cebuano speakers have rather than, and Dhangu Yolngu is described as having, without any peripheral vowels. is also extremely common, though Tehuelche has just the vowels with no close vowels. The third vowel of Arabic-type three-vowel system,, is considerably less common. A large fraction of the languages of North America happen to have a four-vowel system without : ; Nahuatl and Navajo are examples.
In most languages, vowels serve mainly to distinguish separate lexemes, rather than different inflectional forms of the same lexeme as they commonly do in the Semitic languages. For example, while English man becomes men in the plural, moon is a completely different word.

Words without vowels

In rhotic dialects of English, as in Canada and the United States, there are many words such as bird, learn, girl, church, worst, wyrm, myrrh that some phoneticians analyze as having no vowels, only a syllabic consonant. However, others analyze these words instead as having a rhotic vowel,. The difference may be partially one of dialect.
There are a few such words that are disyllabic, like , curtain, and turtle:, and , and even a few that are trisyllabic, at least in some accents, such as purpler, hurdler, gurgler, and certainer.
The word and frequently contracts to a simple nasal ’n, as in lock 'n key. Words such as will, have, and is regularly contract to ’ll, ’ve, and s. However, none of them are pronounced alone without vowels, so they are not phonological words. Onomatopoeic words that can be pronounced alone, and that have no vowels or ars, include hmm, pst!, shh!, tsk!, and zzz. As in other languages, onomatopoeiae stand outside the normal phonotactics of English.
There are other languages that form lexical words without vowel sounds. In Serbo-Croatian, for example, the consonants and can act as a syllable nucleus and carry rising or falling tone; examples include the tongue-twister
na vrh brda vrba mrda and geographic names such as Krk. In Czech and Slovak, either or can stand in for vowels: vlk "wolf", krk "neck". A particularly long word without vowels is čtvrthrst, meaning "quarter-handful", with two syllables. Whole sentences can be made from such words, such as Strč prst skrz krk, meaning "stick a finger through your neck", and Smrž pln skvrn zvlhl z mlh "A morel full of spots wetted from fogs". In Russian, there are also prepositions that consist of a single consonant letter, like k "to", v "in", and s "with". However, these forms are actually contractions of ko, vo, and so respectively, and these forms are still used in modern Russian before words with certain consonant clusters for ease of pronunciation.
In Kazakh and certain other Turkic languages, words without vowel sounds may occur due to reduction of weak vowels. A common example is the Kazakh word for one:
bir, pronounced. Among careful speakers, however, the original vowel may be preserved, and the vowels are always preserved in the orthography.
In Southern varieties of Chinese, such as Cantonese and Minnan, some monosyllabic words are made of exclusively nasals, such as "no" and "five".
So far, all of these syllabic consonants, at least in the lexical words, have been sonorants, such as,,, and, which have a voiced quality similar to vowels. However, there are languages with lexical words that not only contain no vowels, but contain no sonorants at all, like
shh! in English. These include some Berber languages and some languages of the American Pacific Northwest, such as Nuxalk. An example from the latter is scs "seal fat", and a longer one is clhp'xwlhtlhplhhskwts
"he had had in his possession a bunchberry plant". Berber examples include "you took it off" and "you gave it". Some words may contain one or two consonants only: "be", "feed on". In the Japonic language Miyako, there are words with no voiced sounds, such as ss 'dust', kss 'breast/milk', pss 'day', ff 'a comb', kff 'to make', fks 'to build', ksks 'month', sks 'to cut', psks 'to pull'.

Words consisting of only vowels

It is not uncommon for short grammatical words to consist of only vowels, such as a and I in English. Lexical words are somewhat rarer in English and are generally restricted to a single syllable: eye, awe, owe, and in non-rhotic accents air, ore, err. Vowel-only words of more than one syllable are generally foreign loans, such as ai for the maned sloth, or proper names, such as Iowa.
However, vowel sequences in hiatus are more freely allowed in some other languages, most famously perhaps in Bantu and Polynesian languages, but also in Japanese and Finnic languages. In such languages there tends to be a larger variety of vowel-only words. In Swahili, for example, there is ' 'to survey' and ' 'to purify' ; in Japanese, 青い 'blue/green' and oioi 追々 'gradually' ; and in Finnish, aie 'intention' and auo 'open!', although some dialects pronounce them as aije and auvo. Hawaiian, and the Polynesian languages generally, have unusually large numbers of such words, such as aeāea, which is three syllables: ae.āe.a. Most long words involve reduplication, which is quite productive in Polynesian: ioio 'grooves', eaea 'breath', uaua 'tough', auēuē 'crying', uoa or uouoa 'false mullet'. The longest continuous vowel sequence is in the Finnish word hääyöaie.