Haplogroup P (Y-DNA)
Haplogroup P also known as P-P295 or K2b2 is a Y-chromosome DNA haplogroup in human genetics, it forms a clade within Haplogroup K2b (K-P331). Its sister clade within K2b is K2b1.
It was proposed to redefine the root of Haplogroup P through the novel Single-nucleotide polymorphism PF5850, yet this is so far not reflected in the International Society of Genetic Genealogy Y-DNA Haplogroup Tree nor in scientific publications.
Basal PF5850* is found in Southeast Asia. Basal P-P295* is found among South and Southeast Asians as well as Oceanians, P-FT292000 with unknown distribution, and P-M45* commonly found among Siberians and Central Asians. P-M45 is the parent node of [haplogroup haplogroup Q (Y-DNA)|Q (Y-DNA)|Haplogroup Q] and [haplogroup haplogroup R (Y-DNA)|R (Y-DNA)|Haplogroup R].
The major subclades of Haplogroups P-M45, Q and R now include most males among Europeans, Native Americans, South Asians, North Africans, and Central Asians.
Origin and dispersal
Karafet et al. 2015 suggests an origin and dispersal of haplogroup P and its ancestral clade K from either South Asia or Southeast Asia as part of the early human dispersal, based on the distribution of subclades of P-P295 and more ancient clades such as K1 and K2. However, Karafet, et al. mentions that this hypothesis is "parsimonious" and K may have alternatively originated elsewhere in Eurasia and later went extinct there. According to a geneticist Spencer Wells, haplogroup K, from which haplogroup P descended, originated most likely in the Middle East or Central Asia. According to Bergstorm et al, haplogroup K2b1 found in Indigenous Australians and ancestors of haplogroup R and Q split in Southeast Asia near Sahul.The highest frequency and diversity of haplogroup P clades is observed in Southeast Asia, specifically on the Malay Peninsula and the Philippines. To date, the ancestral clade K2b has only been confirmed among the 39,000 year old Tianyuan man.
Structure
The subclades of Haplogroup P with their defining mutation:K2b
Distribution
PF5850*
PF5850* was found among a Jehai sample in Malaysia. Basal P1* was also found in one historical 19th-century Andaman islander.P*(xP1~)
This paraphyletic group comprises all subclades of P-P295 except for the main clade P-M45. The position of P1~ alias P-M45 within P is not confirmed, thus the tilde suffix.Because P2 was discovered relatively recently, it is not always clear if older studies have screened for it. Therefore, cases reported as P-P295* or K2b2* in older literature likely include P-B253.
P* exists at low to moderate levels among various groups in Island South East Asia, the South West Pacific and East Asia.
P* is found at its highest rate among members of the Aeta, a people indigenous to Luzon who formed from various ancient groups, such as Oceanians and Austronesian peoples from Taiwan. P1* is most common among individuals in Siberia and Central Asia, as well as in Southern Asian at lower frequency.
| Population | P* % | Notes |
| Papua New Guinea | 0.69 | assumed from Kayser et al. 2006 1 P* found |
| New Zealand | 0 | |
| Fiji | 0 | |
| Solomon Islands | 0 | |
| French Polynesia | 0 | |
| Vanuatu | 0 | |
| New Caledonia | ||
| Guam | 0 | |
| Samoa | 0 | |
| Kiribati | ||
| Tonga | 0 | |
| Micronesia FDR | 0 | |
| Marshall Islands | 0 | |
| American Samoa | ||
| Northern Mariana Islands | ||
| Palau | ||
| Cook Islands | 0 | |
| Wallis and Futuna | 0 | |
| Tuvalu | 0 | |
| Nauru | ||
| Norfolk Island | ||
| Niue | 0 | small sample size |
| Tokelau | 0 | small sample size |
| Hawaii | 0 | small sample size from FTDNA |
| Australia | 0 | |
| Timor | 10.8 | |
| Aeta | 28 | |
| Austronesians | 0 | |
| Malaysia | 0 | |
| Flores | 0 | |
| Sulawesi | 0.6 | |
| East Indonesia | 0 | |
| Java Indonesia | 0 | |
| Bali Indonesia | 0 | |
| Sumatra Indonesia | 0 | |
| Borneo Indonesia | 0 | |
| West Papua Province | 0 | |
| Papua Province | 0 | |
| Sumba Indonesia | 3.2 |
P-M45 (P1~)
Derived P1~ has been found among the Ancient North Eurasian Yana specimens, which carried around 29–47% ancestry from an East Eurasian source represented by the Tianyuan man.Many modern ethnic groups with high frequencies of P1~, also known as P-M45 and K2b2a, are located in Central Asia and Siberia: 35.4% among Tuvans, 28.3% among Altai-Kizhi, and 35% among Nivkh males.
| Modern population | Modern ethnolinguistic affiliation | Reference | n | Percentage | Notes/SNPs tested |
| Tuvinian | Turkic | 113 | 35.40 | P-M45 | |
| Nivkh | isolate | 17 | 35 | P-M45 | |
| Altai-Kizhi | Turkic | 92 | 28.3 | P-M45 | |
| Todjin | Turkic | 36 | 22.2 | P-M45 | |
| Chukchi | Chukotkan | 24 | 20.8 | P-M45 | |
| Koryak | Chukotkan | 27 | 18.5 | P-M45 | |
| Yupik | Eskimo-Aleut | 33 | 18.2 | P-M45 | |
| Uighur | Turkic | 70 | 17.1 | P-M45 | |
| Kalmyk | Mongolic | 68 | 11.8 | P-M45 | |
| Turkmen | Turkic | 30 | 10 | P-M45 | |
| Soyot | Turkic | 34 | 8.8 | P-M45 | |
| Uriankhai | Mongolic | 60 | 8.3 | P-M45 | |
| Khakas | Turkic | 53 | 7.6 | P-M45 | |
| Kazakh | Turkic | 54 | 5.6 | P-M45 | |
| Uzbek | Turkic | 366 | 5.5 | P-M45 | |
| Khasi-Khmuic | Austro-Asiatic | 353 | 5.40 | P-M45 § | |
| Munda | Austro-Asiatic | 64 | 10.90 | P-M45 § | |
| Nicobarese | Austro-Asiatic | 11 | 0.00 | P-M45 § | |
| Southeast Asia | Austro-Asiatic | 257 | 1.60 | P-M45 § | |
| Garo | Tibeto-Burman | 71 | 1.40 | P-M45 § | |
| India | Tibeto-Burman | 226 | 3.10 | P-M45 § | |
| East Asia | Tibeto-Burman | 214 | 0.00 | P-M45 § | |
| Eastern India | unclear/various | 54 | 18.50 | P-M45 § | |
| Southern Talysh, Iran | Iranian | 50 | 4.00 | P-M45 | |
| Northern Talysh, Azerbaijan | Iranian | 40 | 5.00 | P-M45 | |
| Mazandarani | Iranian | 50 | 4.00 | P-M45 | |
| Gilaki | Iranian | 50 | 0.00 | P-M45 | |
| Tehran | Iranian | 80 | 4.00 | P-M45 | |
| Isfahan | Iranian | 50 | 6.00 | P-M45 | |
| Bakhtiari | Iranian | 53 | 2.00 | P-M45 | |
| Iranian Arabs | Arabic | 47 | 2.00 | P-M45 | |
| North Iran | Iranian | 33 | 9.00 | P-M45 | |
| South Iran | Iranian | 117 | 3.00 | P-M45 | |
| South Caucacus | Georgian | 77 | 3.00 | P-M45 | |
| South Caucacus | Armenian | 100 | 2.00 | P-M45 | |
| Hvar | Croatian | 14 | - | ||
| Korčula | Croatian | 6 | - |
§ May include members of haplogroup [Haplogroup Haplogroup R2 (Y-DNA)|R2 (Y-DNA)|R2].
| Population group | N | P | Q | R | |||
| Count | % | Count | % | Count | % | ||
| Gope | 16 | 1 | 6.4 | ||||
| Oriya Brahmin | 24 | 1 | 4.2 | ||||
| Mahishya | 17 | 3 | 17.6 | ||||
| Bhumij | 15 | 2 | 13.3 | ||||
| Saora | 13 | 3 | 23.1 | ||||
| Nepali | 7 | 2 | 28.6 | ||||
| Muslims of Manipur | 9 | 3 | 33.3 | ||||
| Himachal Pradesh Rajput | 15 | 1 | 6.7 | ||||
| Lambadi | 18 | 4 | 22.2 | ||||
| Gujarati Patel | 9 | 2 | 22.2 | ||||
| Katkari | 19 | 1 | 5.3 | ||||
| Madia Gond | 14 | 1 | 7.1 | ||||
| Kamma Chowdary | 15 | 0 | 0 | 1 | 6.7 | 12 | 80 |
Q
Near universal in the Kets of Siberia. Very common in pre-modern Native American populations and Selkups, except for the Na-Dene peoples, where it reaches 50-90%. Also common, at 25-50% in Siberian populations such as the Siberian Tatars, Nivkh, Tuvans, Chukchi, Siberian Eskimos, Northern Altaians, and in 70% of Turkmens.R
The only discovered case of basal R* is the Mal'ta Boy in the Upper Paleolithic on the upper Angara River in the area west of Lake Baikal in the Irkutsk Oblast, Siberia, Russian Federation.R1
| Continent | Population | #No. | Total% | R-P25* | R-V88 | R-M269 | R-M73 |
| Africa | Northern Africa | 691 | 5.9% | 0.0% | 5.2% | 0.7% | 0.0% |
| Africa | Central Sahel Region | 461 | 23.0% | 0.0% | 23.0% | 0.0% | 0.0% |
| Africa | Western Africa | 123 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| Africa | Eastern Africa | 442 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| Africa | Southern Africa | 105 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| Europe | Western Europeans | 465 | 57.8% | 0.0% | 0.0% | 57.8% | 0.0% |
| Europe | North western Europeans | 43 | 55.8% | 0.0% | 0.0% | 55.8% | 0.0% |
| Europe | Central Europeans | 77 | 42.9% | 0.0% | 0.0% | 42.9% | 0.0% |
| Europe | North Eastern Europeans | 74 | 1.4% | 0.0% | 0.0% | 1.4% | 0.0% |
| Europe | Russians | 60 | 6.7% | 0.0% | 0.0% | 6.7% | 0.0% |
| Europe | Eastern Europeans | 149 | 20.8% | 0.0% | 0.0% | 20.8% | 0.0% |
| Europe | South eastern Europeans | 510 | 13.1% | 0.0% | 0.2% | 12.9% | 0.0% |
| Asia | Western Asians | 328 | 5.8% | 0.0% | 0.3% | 5.5% | 0.0% |
| Asia | Southern Asians | 288 | 4.8% | 0.0% | 0.0% | 1.7% | 3.1% |
| Asia | South eastern Asians | 10 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| Asia | North eastern Asians | 30 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| Asia | Eastern Asians | 156 | 0.6% | 0.0% | 0.0% | 0.6% | 0.0% |
| TOTAL | 5326 |
R2
Haplogroup R2 is most common in South Asia and south Central Asia, as well as diaspora populations, such as the Romanis.| Tibeto-Burman | Austro-Asiatic | Dravidian | Indo-European | |
| Tribe | 5.75% | 10.94% | 5.00% | - |
| Lower Caste | - | - | 13.79% | 10.00% |
| Middle Caste | - | - | 3.53% | 18.75% |
| Upper Caste | - | - | 10.17% | 16.28% |
| Count | Sample Size | R-M124 Frequency % | |
| UAE | 8 | 217 | 3.69% |
| Qatar | 1 | 72 | 1.39% |
| Kuwait | 1 | 153 | 0.65% |
| Yemen | 1 | 104 | 0.96% |
| Jordan | 2 | 146 | 1.37% |
| Lebanon | 2 | 935 | 0.21% |
| Palestine | 1 | 49 | 2.04% |
| Egypt | 1 | 147 | 0.68% |
| R2b% | Population |
| 10.3% | Burusho |
| 6.8% | Kalash |
| 1.0% | Pashtuns |
| 3.4% | Gujarat |
| 0.63% | Pakistan |