HapMap Gujaratis

Razib is wondering what's going on with the HapMap Houston Gujaratis.

As you can see, the Chinese simply do not vary much, and are a tight cluster. But, there is a somewhat equivalent Gujarati cluster too! The HapMap sample was collected from Gujaratis in Houston. To me, it looks like that Houston population can be divided into two groups: one of the tight cluster, and the rest of the population, which is all over the place. [...] What’s more interesting is to try and understanding what’s going on with Houston Gujaratis. Anyone in the audience know?

And his 3-dimensional PCA plot: (Those on the right are Gujaratis)
PCA Plot of Gujaratis and Chinese

So I thought I would share the admixture results for the Gujaratis for K=8. Here's the spreadsheet of the admixture proportions for Gujaratis. And here is the plot:

Gujaratis Admixture K=8

The ancestral components and their statistics are as follows:

Population Range Mean Median
C1 South Asian 64-89% 81.9% 85.8%
C2 West Asian 0-13% 2.3% 1.6%
C3 European 2-22% 7.6% 5.0%
C4 Southeast Asian 0-9% 4.9% 5.0%
C5 Austronesian 1-6% 2.8% 2.9%
C6 Northeast Asian 0-3% 0.4% 0.0%
C7 West African 0-1% 0.0% 0.0%
C8 East African 0-0% 0.0% 0.0%

It looks like a majority of the Gujarati samples have mostly South Asian ancestral component with small amounts of West Asian, European and Southeast Asian, but some Gujarati samples have much larger West Asian and/or European ancestral components.


  1. zack, yeah, that's in line with the PCA. you see a tight cluster, and then a line where there's variance of euro-south asian. what i'm curious about isn't the genetics, but the anthropology. i assume that the cluster is one caste, while the rest are many castes. something like that.

  2. Razib,

    I have no idea if this applies to your parents case, but there was a significant connection in the past between Bengal and Gujarat. Nevertheless, a 1000 year+ connection should not show on autosomal maps.

    The Gujarati script is based loosely on the Bengali one ( http://books.google.com/books?id=y3KdxBqjg5cC&pg=PA185 ) and the so called Vallabhi era of Gujarat is the same as the Gupta era and some consider the Guptas to be from Bengal. The last names of people in Gujarat today looks very different from Bengali ones but in the past they had similar names to those you find in modern Bengal. If you look at the titles of Nagar Brahmans according to a manuscripts kept by Nagars (late Vallabji Acharya of Rajkot), in 1226AD there were the thirteen titles of Gujarati Nagars – Datta, Gupta, Nanda, Ghosh, Shraman, Das, Varman, Nagadatta, Trat, Bhatt, Mitra, Dev, and Bhav. At least three of these titles – Mitra, Trat, and Datta, are also found on copperplate grants of Vallabhi from 509-766AD thus confirming the manuscripts. These titles are still current in Bengal but have disappeared from Gujarat though they are still recited in some marriage ceremonies.

    Another thing I have seen mentioned the high incidence of brachycephaly in Bengalis & Gujaratis ( http://books.google.com/books?id=Wba-EZhZcfgC&pg=PA318 ) .

    • parasar, well, my parents are being placed *into* the gujarati variance. if you added other indian pops they wouldn't show up like this. the spatial organization is dominated by gujaratis, and my parents are basically just slotted into it. if there were lots of bengalis and tamils you'd see those clusters, and i bet my parents would be in the bengali cluster.

      • Yes, I understand, and what you suggest is correct that once Bengalis are put in your parents will cluster with them. I was just a little surprised by your mother clustering with that tight Gujarati group.

        On your hitting the wall comment, one factor that could potentially be resolved further is our East Asian component. The ones shared with southern Indians I believe on future runs should be separable from the Shan and Munda kind. The same will be the case for me too though I have little less of E Asian since I am from just to the west of Bengal.

        • my mom seems a little less "euro" and more "asian" than my dad.

          On your hitting the wall comment, one factor that could potentially be resolved further is our East Asian component. The ones shared with southern Indians I believe on future runs should be separable from the Shan and Munda kind.

          2/3 to 3/4, of the east asian is not found in southern indians i think. that's what dienekes found. my ratio of east/southeast asian is consistent with burmese. i presume that munda would have more southeast asian, so that might be intermixed with the general south asian component.

          see the breakdown here:


  3. Any idea what is the actual makeup of the gujuratis in houston samples? Not exactly sure of the various groups that inhabit the gujrat, someone more familiar with gujratis perhaps can enlighten us. I can say for punjab itself , that there are a myriad of ethnic groups that would call themselves punjabi.( Kambojs( Kambujia, Cambodia) , Jatts ,Arains , Tharkhans , Khatris(e.g your kapoors etc) , Rajputs, Gujjars and many more)