Latino Dataset

Razib mentioned a Latino/Hispanic dataset to me a few days ago.

The relevant paper is "Genome-wide patterns of population structure and admixture among Hispanic/Latino populations" by Katarzyna Bryca, Christopher Velezb, Tatiana Karafetc, Andres Moreno-Estradaa, Andy Reynoldsa, Adam Autona, Michael Hammerc, Carlos D. Bustamantea, and Harry Ostrer. And the data is available on the GEO Accession viewer.

The dataset has 100 samples from Colombia, Dominican Republic, Ecuador, and Puerto Rico.

It's in the same format and uses the same chip as Behar et al and Rasmussen et al. So it was really easy to download and convert it to Plink PED format.

Now what does a Hispanic dataset got to do with a South Asian genetics project? Nothing, for now. But I am collecting all genotyping data. And also I am hoping that we get more participants of South Asian origin from the Caribbean and other countries of the region where there has been a longer presence of South Asians. In that case, it would be interesting to compare them against other populations of the Americas.

In keeping with my effort to clean the data of any relatives, here are the IBD/IBS analysis results. The 2nd sheet shows the two samples I removed.


  1. Relatives in Datasets | Harappa Ancestry Project - pingback on February 6, 2012 at 6:12 pm

Trackbacks and Pingbacks: