Reference 3 + Yunusbayev + HAP PCA and Mclust
Posted by Zack
on December 19, 2011
I ran Principal Component Analysis (PCA) on reference 3 along with Yunusbayev et al Caucasus dataset and Harappa Ancestry Project participants (up to HRP0200).
Then I ran mclust on the first 70 dimensions. The resulting 156 clusters can be seen in a spreadsheet.
For individuals belonging to Harappa Ancestry Project, the value in a column shows that person's probability of being in that cluster. So if there is a 1 in CL15 for example, then that person has a 100% probability of being in Cluster CL15.
For the reference population groups, I have added up the probabilities for all the individuals belonging to that group.