Admixture K=10-12, HRP0001 to HRP0010

Let's continue our admixture analysis of the first batch of Harappa participants.

Here are their ethnic backgrounds and their admixture analysis results.

You might want to refer to the admixture analysis of the reference dataset.

At K=10,

Batch 1 Admixture K=10

C1 South Asian C2 Kalash
C3 Southwest Asian C4 Southeast Asian
C5 European C6 Papuan
C7 Northeast Asian C8 Siberian
C9 West African C10 East African

At K=11,

Batch 1 Admixture K=11

C1 South Asian C2 Balochistan/Caucasus
C3 Kalash C4 Southeast Asian
C5 Southwest Asian C6 European
C7 Papuan C8 Northeast Asian
C9 Siberian C10 West African
C11 East African

Note the C2 component, it sounds a bit like ANI (Ancestral North Indian) of Reich et al, though hold off on your conclusions and your excitement for now.

Also, note that this split is different from the results of Reference I K=11 admixture run where the East African split happened. However, at K=12 we get similar components.

At K=12,

Batch 1 Admixture K=12

C1 South Asian C2 Balochistan/Caucasus
C3 Kalash C4 Southeast Asian
C5 Southwest Asian C6 European
C7 Papuan C8 Northeast Asian
C9 Siberian C10 East African Bantus
C11 West African C12 East African

I am going to explore even higher values of K since the crossvalidation errors are still decreasing.

5 Comments.

  1. HP0007 and HP0009 don't look nearly as simlar as they did in some of the low K analyses. In particular, at K=12, HP0009 has C2+C3 = 20% and HP0007 has C2+C3 = 10%.

    Anyway, great work, as always. Should we next expect to see a new batch at lower values of K, this first batch at even higher values of K, or intermediate batches at the current values of K?

  2. South Asian is mostly undented at K=12 - the anticipated split did not happen. It is all the other stuff being redistributed in intriguing ways. Very interesting!

    • While the South Asian component percentages are about the same for K=12 as they were for K=9, they have changed from K=7 for Punjabis etc. Let's see what happens later.

  3. Great getting more and more interesting now.

    Here is my Charts Participant K12 and the Reference pop K12