Tag Archives: harappaworld - Page 2

Gujaratis HarappaWorld Admixture

Someone asked for the individual HarappaWorld Admixture results for the Gujarati B from HapMap.

To refresh your memory, the Gujarati B are those individuals who do not form part of the big closely clustered Gujarati cluster.

I decided to include HGDP Sindhis as well as Gujaratis, Rajasthanis, Maharashtrians, etc from Harappa Ancestry Project in the list so the Gujaratis can be compared to the people of neighboring regions.

You can check the spreadsheet too.

UPDATE: I have added the Thathai Bhatia and Halai Bhatia participants that I had forgotten.

Pathan/Pashtun Admixture Results

Someone asked for the individual HarappaWorld Admixture results for the Pathans and Pashtuns from HGDP (23) and Harappa Ancestry Project (3). So here they are.

You can check the spreadsheet too.

HarappaWorld Oracle Update

I have updated HarappaWorld Oracle with the latest group averages.

You can download it and use the instructions here and here.

Also, please note the limitations of the Oracle.

HarappaWorld HRP0298-HRP0311

I have added the HarappaWorld Admixture results for HRP0298-HRP0311 to the individual spreadsheet.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

I have also updated the group averages.

We have an Indian adoptee participant, HRP0303. Their results seem closest to non-Brahmin Tamils.

HarappaWorld HRP0289-HRP0297

I have added the HarappaWorld Admixture results for HRP0289-HRP0297 to the individual spreadsheet.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

I also updated the results for HRP0274 using FTDNA Family Finder data instead of the Genographic 2.0 data that was originally submitted. As the Geno2 data has only 14,000 SNPs in common with my HarappaWorld calculator, it's interesting to see HRP0274's admxiture results change:

Component Geno2 FTDNA
South Indian 48.68% 46.00%
Baloch 34.22% 32.99%
Caucasian 4.33% 5.02%
Northeast Euro 3.89% 3.57%
Southeast Asian 2.75% 1.06%
Siberian 1.25% 1.87%
Northeast Asian 1.16% 1.69%
Papuan 1.14% 1.85%
American 0.87% 1.23%
Beringian 0.01% 1.23%
Mediterranean 0.00% 0.39%
Southwest Asian 1.69% 3.10%
San 0.00% 0.00%
East African 0.00% 0.00%
Pygmy 0.00% 0.00%
West African 0.00% 0.00%

The only differences greater than 1% are South Indian (2.68%), Southeast Asian (1.69%), Southwest Asian (1.41%), Baloch (1.23%), and Beringian (1.22%). It's remarkable that only 14,000 SNPs could provide us a decent result.

We have two new Gujarati participants. HRP0292, a Gujarati Jain, seems to be more similar to somewhat southern populations. HRP0294, a Gujarati Sunni Vohra, has results somewhat similar to HRP0265 (Gujarati Patel Muslim) and more north-oriented. Therefore, I have separated a new ethnic category of Gujarati Muslims in my ethnic spreadsheet. I'll have averages when I compute them next time.

We have two Indian adoptee participants as well. HRP0297 has results which match well with the Bengalis (other than the Brahmins) in this project. HRP0290's results are somewhat harder to figure out. The closest groups, not too close, are probably Tharu from Uttarakhand and Satnami from Chhattisgarh (Reich et al dataset). A ChromoPainter analysis would be more useful here.

HarappaWorld HRP0284-HRP0288

I have added the HarappaWorld Admixture results for HRP0284-HRP0288 to the individual spreadsheet.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

I have also updated the group averages (weighted) spreadsheet.

HarappaWorld HRP0273-HRP0283

I have added the HarappaWorld Admixture results for HRP0273-HRP0283 to the individual spreadsheet.

I got two participants from the Geno 2.0 Project. While I have calculated their HarappaWorld Admixture results, please note that Geno2 has only about 14,000 SNPs in common with HarappaWorld. Thus these results are very noisy.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

We got our first Pashtun participants, one Afghan and one Pakistani. Both have very similar results and are not much different than the HGDP Pathan sample average in their South Indian component.

HRP0278, a Bengali (mostly), is more East Asian components than any other Bengali participants (including my friend Razib.)

HarappaWorld HRP0253-HRP0272

I have added the HarappaWorld Admixture results for HRP0253-HRP0272 to the individual spreadsheet.

I got two participants from the Geno 2.0 Project. While I have calculated their HarappaWorld Admixture results, please note that Geno2 has only about 14,000 SNPs in common with HarappaWorld. Thus these results are very noisy.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

HarappaWorld HRP0250-HRP0252

I have added the HarappaWorld Admixture results for HRP0250-HRP0252 to the individual spreadsheet.

However, I have not recomputed the weighted averages for the Kashmiris or Bengali Brahmins. Also, I am not sure about Tamil Gounder. Wikipedia says they are Vellalars, but I don't know if I should report separate Gounder results or include in the Tamil Vellalar average.

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

HarappaWorld HRP0245-HRP0249

I have added the HarappaWorld Admixture results for HRP0245-HRP0249 to the individual spreadsheet.

I have also recomputed the weighted averages for Kurds (from 6 to 10 now).

Do note that the admixture components do not necessarily represent real ancestral populations. Also, the names I have chosen for the components should be thought of as mnemonics to ease discussion. I chose them based on which populations in my data these components peaked in. They do not tell anything directly about ancestral populations. The best way to look at these admixture results is by comparing individuals and populations. Finally, the standard error estimates on these results can be about 1%. Therefore, it is entirely possible that your 1% exotic admixture result is just noise.

Let's look at the Kurdish results from Yunusbayev (prefix: kurd), Xing (prefix: F) and Harappa (prefix: HRP). Do note that the Xing results were computed with a smaller number of SNPs and thus might be noisy.