Tatar language
Tatar[] is a Turkic language spoken by the Tatars mainly located in modern Republic of Tatarstan, wider Volga-Ural region, as well as many other regions of Russia. Tatar belongs to the same branch of Turkic languages such as Bashkort, Kazakh, Nogai and Kyrgyz.
The two main dialects of Tatar are the Central Dialect, and the Western Dialect. The literary Tatar language is based on the Central Dialect and on a local variant of Türki. Tatar should not be confused with Crimean Tatar or Siberian Tatar, which are different languages, although also part of the Kipchak language group.
Like other Turkic languages, Tatar was traditionally written in the Arabic script for most of its history. Since 1939, the alphabet has been Cyrillic, though a number of Latin-based versions have also been used over the years.
Geographic distribution
The Tatar language is spoken in Russia by about 5.3 million people, and also by communities in Azerbaijan, China, Finland, Georgia, Israel, Kazakhstan, Latvia, Lithuania, Romania, Turkey, Ukraine, the United States, Uzbekistan, and several other countries. Globally, there are more than 7 million speakers of Tatar.Tatar is also the mother tongue for several thousand Mari, a Finnic people; Mordva's Qaratay group also speak a variant of Kazan Tatar.
In the 2010 census, 69% of Russian Tatars claimed at least some knowledge of the Tatar language. In Tatarstan, 93% of Tatars and 3.6% of Russians claimed to have at least some knowledge of the Tatar language. In neighbouring Bashkortostan, 67% of Tatars, 27% of Bashkirs, and 1.3% of Russians claimed to understand basic Tatar language.
Official status
Tatar, along with Russian, is the official language of the Republic of Tatarstan. The official script of the Tatar language is based on the Cyrillic script with some additional letters. The Republic of Tatarstan passed a law in 1999, which came into force in 2001, establishing an official Tatar Latin alphabet. A Russian federal law overrode it in 2002, making Cyrillic the sole official script in Tatarstan since. Unofficially, other scripts are used as well, mostly Latin and Arabic. All official sources in Tatarstan must use Cyrillic on their websites and in publishing. In other cases, where Tatar has no official status, the use of a specific alphabet depends on the preference of the author.The Tatar language was made a de facto official language in Russia in 1917, but only within the Tatar Autonomous Soviet Socialist Republic. Tatar is also considered to have been the official language in the short-lived Idel-Ural State, briefly formed during the Russian Civil War.
The usage of Tatar declined during the 20th century. By the 1980s, the study and teaching of Tatar in the public education system was limited to rural schools. However, Tatar-speaking pupils had little chance of entering university because higher education was available in Russian almost exclusively.
As of 2001, Tatar was considered a potentially endangered language while Siberian Tatar received "endangered" and "seriously endangered" statuses, respectively. Higher education in Tatar can only be found in Tatarstan, and is restricted to the humanities. In other regions Tatar is primarily a spoken language and the number of speakers as well as their proficiency tends to decrease. Tatar is popular as a written language only in Tatar-speaking areas where schools with Tatar language lessons are situated. On the other hand, Tatar is the only language in use in rural districts of Tatarstan.
Since 2017, Tatar language classes are no longer mandatory in the schools of Tatarstan. According to the opponents of this change, it will further endanger the Tatar language and is a violation of the Tatarstan Constitution which stipulates the equality of Russian and Tatar languages in the republic.
Dialects
There are two main dialects of Tatar:- Central or Middle
- Western
Spoken idioms of Siberian Tatars, which differ significantly from the above two, are often considered as the third dialect group of Tatar by some, but as an independent language on its own by others.
Central or Middle
The Central or Middle dialectal group is spoken in Kazan and most of Tatarstan and is the basis of the standard literary Tatar language. Middle Tatar includes the Nagaibak dialect.Mishar
The Western dialect is distinguished from the Central dialect most clearly by the absence of the uvular q and ğ and the rounded å of the first syllable. Letters ç and c are pronounced as affricates. Regional differences exist also.Mishar Dialect, and especially its regional variant in Sergachsky district, is said to be "faithfully close" to the ancient Kipchak language. Some linguists, such as Radlov, Samoylovich, think that Mishar traditionally belongs to the Kipchak-Cuman group of languages, rather than to the Kipchak-Bulgar group.
Mishar is the dialect spoken by the Tatar minority of Finland.
Siberian Tatar
Two main isoglosses that characterize Siberian Tatar are ç as and c as, corresponding to standard and. There are also grammatical differences within the dialect, scattered across Siberia.Many linguists claim the origins of Siberian Tatar dialects are actually independent of Volga–Ural Tatar; these dialects are quite remote both from Standard Tatar and from each other, often preventing mutual comprehension. The claim that this language is part of the modern Tatar language is typically supported by linguists in Kazan, Moscow and by Siberian Tatar linguists and denounced by some Russian and Tatar ethnographs.
Over time, some of these dialects were given distinct names and recognized as separate languages after detailed linguistic study. However, the Chulym language was never classified as a dialect of Tatar language. Confusion arose because of the endoethnonym "Tatars" used by the Chulyms. The question of classifying the Chulym language as a dialect of the Khakass language was debatable. A brief linguistic analysis shows that many of these dialects exhibit features which are quite different from the Volga–Ural Tatar varieties, and should be classified as Turkic varieties belonging to several sub-groups of the Turkic languages, distinct from Kipchak languages to which Volga–Ural Tatar belongs.
Phonology
Vowels
There exist several interpretations of the Tatar vowel phonemic inventory. In total Tatar has nine or ten native vowels, and three or four loaned vowels.According to Baskakov Tatar has only two vowel heights, high and low. There are two low vowels, front and back, while there are eight high vowels: front and back, round and unround, normal and short.
Poppe proposed a similar yet slightly different scheme with a third, higher mid, height, and with nine vowels.
According to Makhmutova Tatar has three vowel heights: high, mid and low, and four tongue positions: front, front-central, back-central and back.
The mid back unrounded vowel 'ë is usually transcribed as ı, though it differs from the corresponding Turkish vowel.
The tenth vowel ï'' is realized as the diphthong ëy, which only occurs word-finally, but it has been argued to be an independent phoneme.
Phonetically, the native vowels are approximately thus :
In polysyllabic words, the front-back distinction is lost in reduced vowels: all become mid-central. The mid reduced vowels in an unstressed position are frequently elided, as in кеше keşe > 'person', or кышы qışı > ' winter'. Low back is rounded in the first syllable and after, but not in the last, as in бала bala 'child', балаларга balalarğa 'to children'. In Russian loans there are also,,, and, written the same as the native vowels: ы, е/э, о, а''' respectively.
Historical shifts
Historically, the Old Turkic mid vowels have raised from mid to high, whereas the Old Turkic high vowels have become the Tatar reduced mid series.| Vowel | Old Turkic | Kazakh | Tatar | Bashkir | Gloss |
| *e | *et | et | it | it | 'meat' |
| *ö | *söz | söz | süz | hüź | 'word' |
| *o | *sol | sol | sul | hul | 'left' |
| *i | *it | it | et | et | 'dog' |
| *ï | *qïz | qız | qız | qıź | 'girl' |
| *u | *qum | qum | qom | qom | 'sand' |
| *ü | *kül | kül | köl | köl | 'ash' |
Consonants
;Notes:Palatalization
Tatar consonants usually undergo slight palatalization before front vowels. However, this allophony is not significant and does not constitute a phonemic status. This differs from Russian where palatalized consonants are not allophones but phonemes on their own. There are a number of Russian loanwords which have palatalized consonants in Russian and are thus written the same in Tatar. The Tatar standard pronunciation also requires palatalization in such loanwords; however, some Tatar may pronounce them non-palatalized.Syllables
In native words there are six types of syllables :- V
- VC
- CV
- CVC
- VSC
- CVSC
Prosody
Stress is usually on the final syllable. However, some suffixes cannot be stressed, so the stress shifts to the syllable before that suffix, even if the stressed syllable is the third or fourth from the end. A number of Tatar words and grammatical forms have the natural stress on the first syllable. Loanwords, mainly from Russian, usually preserve their original stress.Phonetic alterations
Tatar phonotactics dictate many pronunciation changes which are not reflected in the orthography.- Unrounded vowels ı and e become rounded after o or ö:
- Nasals are assimilated to the following stops:
- Stops are assimilated to the preceding nasals :
- Voicing may also undergo assimilation:
- Unstressed vowels may be syncopated or reduced:
- Vowels may also be elided:
- In consonant clusters longer than two phones, ı or e is inserted into speech as an epenthetic vowel.
- Final consonant clusters are simplified:
- Final devoicing is also frequent:
Grammar
Like other Turkic languages, Tatar is an agglutinative language.Nouns
Tatar nouns are inflected for cases and numbers. Case suffixes change depending on the last consonants of the noun, while nouns ending in for example p/k are voiced to b/g when a possessive suffix is added. Suffixes below are in back vowel, with front variant can be seen at #Phonology section.The declension of possessive suffixes is even more irregular, with the dative suffix -а used in 1st singular and 2nd singular suffixes, and the accusative, dative, locative, and ablative endings -н, -на, -нда, -ннан is used after 3rd person possessive suffix. Nouns ending in -и, -у, or -ү, although phonologically vowels, take consonantic endings.
| Person | After consonants | After vowels |
| 1st singular | -ым -ım | -м -m |
| 2nd singular | -ың -ıñ | -ң -ñ |
| 3rd | -ы -ı | -сы -sı |
| 1st plural | -ыбыз -ıbız | -быз -bız |
| 2nd plural | -ыгыз -ığız | -гыз -ğız |
Declension of pronouns
The declension of personal and demonstrative pronouns tends to be irregular. Irregular forms are in bold.| Case | Who? | What? |
| Nominative | кем kem | нәрсә närsä |
| Accusative | кемне kemne | нәрсәне närsäne |
| Genitive | кемнең kemneñ | нәрсәнең närsäneñ |
| Dative | кемгә kemgä | нәрсәгә närsägä |
| Locative | кемдә kemdä | нәрсәдә närsädä |
| Ablative | кемнән kemnän | нәрсәдән närsädän |
Verbs
The distribution of present tense suffixes is complicated, with the former is used with verb stems ending in consonants, and the latter is used with verb stem ending in vowels. The distribution of indefinite future tense is more complicated in consonant-ending stems, it is resolved by -арга/-ырга infinitives. However, because some have verb citation forms in verbal noun, this rule becomes somewhat unpredictable.Tenses are negated with -ма, however in the indefinite future tense and the verbal participle they become -mas / -мас and -mıyça / -мыйча instead, respectively. Alongside vowel-ending stems, the suffix also becomes -мый when negates the present tense. To form interrogatives, the suffix -мы is used.
| Type | 1st singular | 2nd singular | 3rd singular | 1st plural | 2nd plural | 3rd plural |
| I | -мын/-м -mın/-m | -сың -sıñ | -∅ | -быз -bız | -сыз -sız | -лар/-нар -lar/-nar |
| II | -м -m | -ң -ñ | -∅ | -к -q, -k | -гыз -ğız | -лар/-нар -lar/-nar |
| Imperative | -ыйм -ıym | -∅ | -сын -sın | -ыйк -ıyq | -гыз -ığız | -сыннар -sınnar |
Definite past and conditional tenses use type II personal inflections instead. When in the case of present tense, short ending is used. After vowels, the first person imperative forms deletes the last vowel, similar to the present tense does. Like plurals of nouns, the suffix -лар change depending the preceding consonants.
Anomalous verbs
Some verbs, however, fall into this category. Dozens of them have irregular stems with a final mid vowel, but obscured on the infinitive. The verbs qoru / кору "to build", tanu / тану "to disclaim", taşu / ташу "to spill" have contrastive meanings with verbs with their final vowelled counterparts, meaning "to dry", "to know", "to carry".The verb "to say" is significantly more irregular than any other verbs: its 2nd person singular imperative is digen, while its expected regular form is repurposed as the present tense forms.
Predicatives
These predicative suffixes have now fallen into disuse, or rarely used.Writing system
During its history, Tatar has been written in Arabic, Latin and Cyrillic scripts.Before 1928, Tatar was mostly written in Arabic script.
During the 19th century, Russian Christian missionary Nikolay Ilminsky devised the first Cyrillic alphabet for Tatar. This alphabet is still used by Christian Tatars.
In the Soviet Union after 1928, Tatar was written with a Latin alphabet called Jaꞑalif.
In 1939, in Tatarstan and all other parts of the Soviet Union, a Cyrillic script was adopted and is still used to write Tatar. It is also used in Kazakhstan.
The Republic of Tatarstan passed a law in 1999 that came into force in 2001 establishing an official Tatar Latin alphabet. A Russian federal law overrode it in 2002, making Cyrillic the sole official script in Tatarstan since. In 2004, an attempt to introduce a Latin-based alphabet for Tatar was further abandoned when the Constitutional Court ruled that the federal law of 15 November 2002 mandating the use of Cyrillic for the state languages of the republics of the Russian Federation does not contradict the Russian constitution. In accordance with this Constitutional Court ruling, on 28 December 2004, the Tatar Supreme Court overturned the Tatarstani law that made the Latin alphabet official.
In 2012 the Tatarstan government adopted a new Latin alphabet but with limited usage.
In 2024, the modified Common Turkic Alphabet replaced letter ä with ə, which was already in use in Azerbaijani, as well as among Tatar activists using the Latin alphabet.
- Tatar Perso-Arabic alphabet :
- Tatar Old Latin alphabet :
- Tatar Old Cyrillic alphabet :
- Tatar Cyrillic alphabet :
- 1999 Tatar Latin alphabet, made official by a law adopted by Tatarstani authorities but annulled by the Tatar Supreme Court in 2004:
- 2012 Tatar Latin alphabet
History
The ancestors of Tatar are the extinct Turkic Bulgar and Kipchak languages.The literary Tatar language is based on the Central Tatar dialect and on Türki, also known as Old Tatar Language. Both are members of the Volga-Ural subgroup of the Kipchak group of Turkic languages, although they also partly derive from the ancient Volga Bulgar language.
Crimean Tatar, although similar by name, belongs to another subgroup of the Kipchak languages. Unlike Kazan Tatar, Crimean Tatar is heavily influenced by Turkish and Nogai languages.
Influences in Tatar
Most of the Uralic languages in the Volga River area have strongly influenced the Tatar language, as have the Arabic, Persian and Russian languages.Arabic and Persian
The Arabic and Persian influence on Tatar can be seen most clearly in loan words but also in specific sounds. For example, Tatar ğ / г is the Arabic ghayn غ. However, in Arabic words and names where there is an ayin ع, Tatar adds the ghayn instead. In the Mishar Tatar Dialect, ğ is not pronounced, and thus, a word like şiğır is şigır or şiyır for Mishars.When it comes to Arabic and Persian loanwords, in the Tatar Latin script, alif is realised as the letter a, and when there is no alif, it is ä . When the alif has hamza on top, it is also ä, but Tatar İske imlâ spells it without. Vowel harmony as well is a deciding factor. Similarly with ö/o. However, this rule is often inconsistent when transliterating from Cyrillic to Latin.
During the Golden Horde, the ancestors of modern Tatars used Persian in addition to their Turkic language to a relatively significant extent, especially in poetry and even after the Golden Horde. For example, the long-serving Khan of the Kazan Khanate, Möxəmməd-Əmin, wrote poetry in Persian. In religious and legal matters Arabic was used. Many Persian and Arabic works are considered part of Tatar literature today.
Sample text
Article 1 of the Universal Declaration of Human Rights in Tatar :Article 1 of the Universal Declaration of Human Rights in Tatar :
International Phonetic Alphabet transcription:
Article 1 of the Universal Declaration of Human Rights in English: