More Reference Admixture Runs

In addition to the removals and changes in the previous set of runs, I removed the Onge, Great Andamanese and Kalash for this set.

The admixture results of this dataset are in a spreadsheet as usual and the bar chart is below.

K=10, 11, 12 are the ones with the lowest cross-validation error.

I wonder if anyone is going to mind my calling C2 at K=9 Pakistani instead of Balochistan/Caucasus? ;-)

I like K=12 here and K=12 or 13 in the previous run. So the question is which one of all these K runs with two different datasets should I use to replace the old reference I K=12 admixture runs?

Related Reading:

Related Posts:

3 Comments.

  1. Thank you Zack, I found this run particularly interesting especially achieving the essential components with the lowest number of Ks. It seems like the Kalash component is just an instance of the Balochistan/Caucasus component and by removing Kalash samples' member register with Balochistan/Caucasus. So I was wondering if the same holds for the Gujarati component. That is by removing the Gujarati-a samples a south Asian related component still emerges. I think probably not but I'm still curious to see if this happens.

  2. Ref4C Admixture | Harappa Ancestry Project - pingback on May 24, 2011 at 9:40 am
  3. Ref4C Admixture | Harappa Ancestry Project - pingback on May 24, 2011 at 9:40 am

Trackbacks and Pingbacks: