Turkic languages


The Turkic languages are a language family of more than 35 documented languages spoken by the Turkic peoples of Eurasia, from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia, West Asia, and Canada. The Turkic languages originated in a region of East Asia spanning from Mongolia to Northwest China, where Proto-Turkic is thought to have been spoken, and from where they expanded to Central Asia and farther west during the first millennium. They are characterized as a dialect continuum.
Turkic languages are spoken by some 200 million people. The Turkic language with the greatest number of speakers is Turkish, spoken mainly in Anatolia and the Balkans; its native speakers account for about 38% of all Turkic speakers, followed by Uzbek.
Characteristic features such as vowel harmony, agglutination, subject-object-verb order, and lack of grammatical gender, are almost universal within the Turkic family. There is a high degree of mutual intelligibility, upon moderate exposure, among the various Oghuz languages, which include Turkish, Azerbaijani, Turkmen, Qashqai, Chaharmahali Turkic, Gagauz, and Balkan Gagauz, as well as Oghuz-influenced Crimean Tatar. Other Turkic languages demonstrate varying amounts of mutual intelligibility within their subgroups as well. Although methods of classification vary, the Turkic languages are usually considered to be divided into two branches: Oghur, of which the only surviving member is Chuvash, and Common Turkic, which includes all other Turkic languages.
Turkic languages show many similarities with the Mongolic, Tungusic, Koreanic, and Japonic languages. These similarities have led some linguists to propose an Altaic language family, though this proposal is widely rejected by historical linguists. Similarities with the Uralic languages even caused these families to be regarded as one for a long time under the Ural-Altaic hypothesis. However, there has not been sufficient evidence to conclude the existence of either of these macrofamilies. The shared characteristics between the languages are attributed presently to extensive prehistoric language contact.
December 15 is declared as "World Turkic Language Family Day" by UNESCO. On 15 December 1893, Orkhon Inscriptions, one of the first Turkic texts were decrypted.

Characteristics

Turkic languages are null-subject languages, have vowel harmony, converbs, extensive agglutination by means of suffixes and postpositions, and lack of grammatical articles, noun classes, and grammatical gender. Subject–object–verb word order is universal within the family. In terms of the level of vowel harmony in the Turkic language family, Tuvan is characterized as almost fully harmonic whereas Uzbek is the least harmonic or not harmonic at all. Taking into account the documented historical-linguistic development of Turkic languages overall, both inscriptional and textual, the family provides over one millennium of documented stages as well as scenarios in the linguistic evolution of vowel harmony which, in turn, demonstrates harmony evolution along a confidently definable trajectory Though vowel harmony is a common characteristic of major language families spoken in Inner Eurasia, the type of harmony found in them differs from each other; specifically, Uralic and Turkic have a shared type of vowel harmony whereas Mongolic and Tungusic represent a different type.

History

Pre-history

The homeland of the Turkic peoples and their language is suggested to be somewhere between the Transcaspian steppe and Northeastern Asia, with genetic evidence pointing to the region near South Siberia and Mongolia as the "Inner Asian Homeland" of the Turkic ethnicity. Similarly several linguists, including Juha Janhunen, Roger Blench and Matthew Spriggs, suggest that modern-day Mongolia is the homeland of the early Turkic language. Relying on Proto-Turkic lexical items about the climate, topography, flora, fauna, people's modes of subsistence, Turkologist Peter Benjamin Golden locates the Proto-Turkic Urheimat in the southern, taiga-steppe zone of the Sayan-Altay region.
Extensive contact took place between Proto-Turks and Proto-Mongols approximately during the first millennium BC; the shared cultural tradition between the two Eurasian nomadic groups is called the "Turco-Mongol" tradition. The two groups shared a similar religion system, Tengrism, and there exists a multitude of evident loanwords between Turkic languages and Mongolic languages. Although the loans were bidirectional, today Turkic loanwords constitute the largest foreign component in Mongolian vocabulary.
Italian historian and philologist Igor de Rachewiltz noted a significant distinction of the Chuvash language from other Turkic languages. According to him, the Chuvash language does not share certain common characteristics with Turkic languages to such a degree that some scholars consider it an independent Chuvash family similar to Uralic and Turkic languages. Turkic classification of Chuvash was seen as a compromise solution for the classification purposes.
Some lexical and extensive typological similarities between Turkic and the nearby Tungusic and Mongolic families, as well as the Korean and Japonic families has in more recent years been instead attributed to prehistoric contact amongst the group, sometimes referred to as the Northeast Asian sprachbund. A more recent contact between "core Altaic" is distinguished from this, due to the existence of definitive common words that appear to have been mostly borrowed from Turkic into Mongolic, and later from Mongolic into Tungusic, as Turkic borrowings into Mongolic significantly outnumber Mongolic borrowings into Turkic, and Turkic and Tungusic do not share any words that do not also exist in Mongolic.
File:Kuli Chur inscription.jpg|thumb|right|Old Turkic Kul-chur inscription with the Old Turkic alphabet. Töv Province, Mongolia
Turkic languages also show some Chinese loanwords that point to early contact during the time of Proto-Turkic.

Early written records

The first established records of the Turkic languages are the eighth century AD Orkhon inscriptions by the Göktürks, recording the Old Turkic language, which were discovered in 1889 in the Orkhon Valley in Mongolia. The Compendium of the Turkic Dialects, written during the 11th century AD by Kaşgarlı Mahmud of the Kara-Khanid Khanate, constitutes an early linguistic treatment of the family. The Compendium is the first comprehensive dictionary of the Turkic languages and also includes the first known map of the Turkic speakers' geographical distribution. It mainly pertains to the Southwestern branch of the family.
The Codex Cumanicus concerning the Northwestern branch is another early linguistic manual, between the Kipchak language and Latin, used by the Catholic missionaries sent to the Western Cumans inhabiting a region corresponding to present-day Hungary and Romania. The earliest records of the language spoken by Volga Bulgars, debatably the parent or a distant relative of Chuvash language, are dated to the 13th–14th centuries AD.

Geographical expansion and development

With the Turkic expansion during the Early Middle Ages, Turkic languages, in the course of just a few centuries, spread across Central Asia, from Siberia to the Mediterranean. Various terminologies from the Turkic languages have passed into Persian, Urdu, Ukrainian, Russian, Chinese, Mongolian, Hungarian and to a lesser extent, Arabic.
The geographical distribution of Turkic-speaking peoples across Eurasia since the Ottoman era ranges from the North-East of Siberia to Turkey in the West.
For centuries, the Turkic-speaking peoples have migrated extensively and intermingled continuously, and their languages have been influenced mutually and through contact with the surrounding languages, especially the Iranian, Slavic, and Mongolic languages.
This has obscured the historical developments within each language and/or language group, and as a result, there exist several systems to classify the Turkic languages. The modern genetic classification schemes for Turkic are still largely indebted to Samoilovich.
The Turkic languages may be divided into six branches:
  • Turkic
  • *Common Turkic
  • **Oghuz Turkic
  • **Kipchak Turkic
  • **Karluk Turkic
  • **Siberian Turkic
  • **Arghu Turkic
  • *Oghur Turkic
In this classification, Oghur Turkic is also referred to as Lir-Turkic, and the other branches are subsumed under the title of Shaz-Turkic or Common Turkic. It is not clear when these two major types of Turkic can be assumed to have diverged.
With less certainty, the Southwestern, Northwestern, Southeastern and Oghur groups may further be summarized as West Turkic, the Northeastern, Kyrgyz-Kipchak, and Arghu groups as East Turkic.
Geographically and linguistically, the languages of the Northwestern and Southeastern subgroups belong to the central Turkic languages, while the Northeastern and Khalaj languages are the so-called peripheral languages.
Hruschka, et al. use computational phylogenetic methods to calculate a tree of Turkic based on phonological sound changes.

Schema

The following isoglosses are traditionally used in the classification of the Turkic languages:
  • Rhotacism, e.g. in the last consonant of the word for "nine" *tokkuz. This separates the Oghur branch, which exhibits /r/, from the rest of Turkic, which exhibits /z/. In this case, rhotacism refers to the development of *-/r/, *-/z/, and *-/d/ to /r/,*-/k/,*-/kh/ in this branch. See Antonov and Jacques on the debate concerning rhotacism and lambdacism in Turkic.
  • Intervocalic *d, e.g. the second consonant in the word for "foot" *hadaq
  • Suffix-final -G, e.g. in the suffix *lIG, in e.g. *tāglïg
Additional isoglosses include:
  • Preservation of word initial *h, e.g. in the word for "foot" *hadaq. This separates Khalaj as a peripheral language.
  • Denasalisation of palatal *ń, e.g. in the word for "moon", *āń


*In the standard Istanbul dialect of Turkish, the ğ in dağ and dağlı is not realized as a consonant, but as a slight lengthening of the preceding vowel.