Indo-Aryan languages
The Indo-Aryan languages are a branch of the Indo-Iranian languages in the Indo-European language family. As of 2024, there are more than 1.5 billion speakers, primarily concentrated east of the Indus River in South Asia, spread across Eastern Pakistan, Northern India, southern Nepal, Bangladesh, Sri Lanka, and Maldives. Moreover, apart from the Indian subcontinent, large immigrant and expatriate Indo-Aryan–speaking communities live in Northwestern Europe, Western Asia, North America, the Caribbean, Southeast Africa, Polynesia and Australia, along with several million speakers of Romani languages primarily concentrated in Southeastern Europe. There are over 200 known Indo-Aryan languages.
Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early Vedic Sanskrit, Sanskrit through Middle Indo-Aryan languages. The largest such languages in terms of first-speakers are Hindustani , Bengali, Punjabi, Marathi, and Gujarati. A 2005 estimate placed the total number of native speakers of the Indo-Aryan languages at nearly 900 million people. Other estimates are higher, suggesting a figure of 1.5 billion speakers of Indo-Aryan languages.
Classification
Theories
The Indo-Aryan family as a whole is thought to represent a dialect continuum, where languages are often transitional towards neighbouring varieties. Because of this, the division into languages vs. dialects is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a tree model is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the wave model.Subgroups
The following table of proposals is expanded from , and also includes subsequent classification proposals. The table lists only some modern Indo-Aryan languages.Anton I. Kogan, in 2016, conducted a lexicostatistical study of the New Indo-Aryan languages based on a 100-word Swadesh list, using techniques developed by the glottochronologist and comparative linguist Sergei Starostin. That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan and negligible difference with similarity to Iranian. He also calculated Sinhala–Dhivehi to be the most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features.
Inner–Outer hypothesis
The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with the newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by Rudolf Hoernlé in 1880 and refinement by George Grierson it has undergone numerous revisions and a great deal of debate, with the most recent iteration by Franklin Southworth and Claus Peter Zoller based on robust linguistic evidence. Some of the theory's sceptics include Suniti Kumar Chatterji and Colin P. Masica.Groups
The below classification follows, and.Dardic
The Dardic languages are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by George Abraham Grierson in his Linguistic Survey of India but he did not consider it to be a subfamily of Indo-Aryan. The Dardic group as a genetic grouping has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among may result from subsequent convergence".The Dardic languages are thought to be transitional with Punjabi and Pahari, as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in the context of Proto-Indo-Aryan.
- Kashmiri: Kashmiri, Kishtwari, Poguli;
- Shina: Brokskad, Kundal Shahi, Shina, Ushojo, Kalkoti, Palula, Savi;
- Chitrali: Kalasha, Khowar;
- Kohistani: Bateri, Chilisso, Gowro, Indus Kohistani, Kalami, Tirahi, Torwali, Wotapuri-Katarqalai;
- Pashayi
- Kunar: Dameli, Gawar-Bati, Nangalami, Shumashti.
Northern Zone
- Eastern Pahari: Nepali, Jumli, Doteli;
- Central Pahari: Garhwali, Kumaoni;
- Western Pahari: Dogri, Kangri, Bhadarwahi, Churahi, Bhateali, Bilaspuri, Chambeali, Gaddi, Pangwali, Mandeali, Mahasu Pahari, Jaunsari, Kullui, Pahari Kinnauri, Hinduri, Sarazi, Sirmauri.
Northwestern Zone
- Punjabi
- * Central: Majhi
- * Eastern: Doabi, Malwai, Puadhi
- * Western: Inku, Saraiki, Dhanni, Jhangvi, Shahpuri, Thali
- * North-Western: Hindko, Pahari-Pothwari
- Sindhi: Sindhi, Jadgali, Kutchi, Luwati, Memoni, Khetrani, Kholosi.
Western Zone
- Rajasthani: Bagri, Marwari, Mewati, Dhundari, Harauti, Mewari, Shekhawati, Dhatki, Malvi, Nimadi, Gujari, Goaria, Loarki, Bhoyari/Pawari, Kanjari, Od, Lambadi;
- Gujarati: Gujarati, Jandavra, Saurashtra, Aer, Vaghri, Parkari Koli, Kachi Koli, Wadiyara Koli;
- Bhil: Kalto, Vasavi, Wagdi, Gamit, Vaagri Booli;
- * Northern Bhil: Bauria, Bhilori, Magari;
- * Central Bhil: Bhili proper, Bhilali, Chodri, Dhodia, Dhanki, Dubli;
- * Bareli: Palya Bareli, Pauri Bareli, Rathwi Bareli, Pardhi;
- Khandeshi
- Domaaki
- Domari
- Romani: Carpathian Romani, Balkan Romani, Vlax Romani, Baltic Romani;
- * Northern Romani
- ** British Romani: Angloromani, Welsh Romani
- ** Northwestern Romani: Sinte Romani, Finnish Kalo
Zone
- Western Hindi: Hindustani, Khariboli, Braj, Haryanvi, Bundeli, Kannauji, Parya, Sansi.
- Eastern Hindi: Bagheli, Chhattisgarhi, Surgujia, Awadhi.
Eastern Zone
- Bihari:
- * Bhojpuri, Caribbean Hindustani, Fiji Hindi;
- * Magahi, Khortha;
- * Maithili, Angika, Bajjika, Thethi, Dehati;
- * Sadanic: Nagpuri, Kurmali ;
- * Tharu: Kochila Tharu, Rana Tharu, Kathariya Tharu, Sonha Tharu, Dangaura Tharu, Chitwania Buksa, Majhi, Musasa;
- * Kumhali, Kuswaric: Danwar, Bote-Darai;
- Halbic: Halbi, Kamar, Bhunjia, Nahari;
- Odia: Baleswari, Kataki, Ganjami, Sundargadi, Sambalpuri, Desia;
- * Bodo Parja, Bhatri, Reli, Kupia;
- Bengali–Assamese
- * Bengali-Gauda: Bengali, Noakhali, Chittagonian, Sylheti, Bishnupriya Manipuri, Hajong, Chakma, Tanchangya, Thar, Rohingya;
- * Kamarupic: Assamese, Kamtapuri, Surjapuri, Rajbanshi;
Southern Zone