Tag Archives: baloch

June Update

Posted by Zack on June 4, 2011 Comments Off

I have a total of 123 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.Укладка дикого камня

The following groups are represented:

Most are 23andme data while 4 are from FTDNA.

We are getting close to 100 South Asian participants.

April Update

Posted by Zack on May 1, 2011 5 comments

I have a total of 97 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.http://mountainsphoto.ru

The following groups are represented:

Let's try to get to hundred soon.

And yes, I am accepting FTDNA Family Finder (new Illumina chip) now.

End of March Update

Posted by Zack on March 27, 2011 10 comments

I have a total of 67 participants in the project right now who have sent me their raw data. This is not counting those who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.http://polvam.ru

The following groups are represented:

I need to post analyses of Tamils, Bengalis and Punjabis soon.

Another Update

Posted by Zack on March 12, 2011 28 comments

I have a total of 51 participants in the project right now who have sent me their raw data. This is not counting three people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

Punjab: 7
Iran: 7
Tamil: 6
Bengal: 5
Andhra Pradesh: 2
Bihar: 2
Karnataka: 2
Caribbean Indian: 2
Kashmir: 2
Uttar Pradesh: 2
Sri Lankan: 2
Kerala: 2
Iraqi Arab: 2
Anglo-Indian: 1
Roma: 1
Goa: 1
Rajasthan: 1
Baloch: 1
Unknown: 1
Egyptian/Iraqi Jew: 1
Maharashtra: 1

I haven't received data from any new participants for more than a week which is the longest lull since I started Harappa Ancestry Project. So go out there and get people to send me their 23andme raw data.

Also, does anyone know if there are a significant number of South Asians who have done FamilyTreeDNA's Family Finder test? Is there a good overlap of SNPs between their test and 23andme's?

We have enough Punjabis, Iranians, Tamil and Bengalis that they deserve separate analysis posts.

Project Update

Posted by Zack on February 20, 2011 16 comments

I have a total of 42 participants in the project right now who have sent me their raw data. This is not counting two people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

Punjab: 7
Iran: 6
Tamil: 5
Andhra Pradesh: 2
Bengal: 2
Bihar: 2
Karnataka: 2
Caribbean Indian: 2
Kashmir: 2
Anglo-Indian: 1
Roma: 1
Goa: 1
Uttar Pradesh: 1
Sri Lankan: 1
Rajasthan: 1
Kerala: 1
Baloch: 1
Unknown: 1

The unknown is Manu Sporny who has put his genetic data in the public domain and I have drafted him into our project.

In addition, out of curiosity, I have accepted data from the following:

Iraqi Arab: 2
Egyptian/Iraqi Jew: 1

I know a bunch of you have done a lot to make this project known and gotten people to submit their data. But we really do need more participants of every ethnicity and geographic region in and around South Asia. So keep on!

I am working on K=12 admixture runs for the batches we have already done. In addition, the reference I dataset will be used for even higher values of K admixture components to see where the limit is.

Also, I am looking into doing chromosome by chromosome admixture (and other analysis). I have done some experimental runs and once I have pored over that data, I'll have something to report.

As we have seen, even with the removal of the San and Pygmy, the Africans take up 3 ancestral components and most South Asians (excepting me of course) do not have any African admixture. So I am working on a reference dataset without any Africans. I have my own take on how to do that which I'll share in the next few days.

In short, my home computer is running admixture, plink, eigensoft, etc. 24x7.

HGDP

Posted by Zack on January 25, 2011 3 comments

Human Genome Diversity Project (HGDP) is the best resource for a diverse set of genomic data. It has 1050 individuals from 52 different populations.

I got the Stanford University data which has data for 660,918 SNPs from 1,043 samples. It is claimed that the forward strand is given but that turned out not to be true and I had to flip strands and make sure I didn't include any ambiguous A/T or C/G strands in my dataset.

I followed the recommendations of Rosenberg (spreadsheet) in excluding some atypical samples and relatives, leaving me with 940 samples.

I also excluded the Native American samples because we are not interested in them and they are very closely related either due to recent endogamy or ancient bottlenecks. (yeah I had the nerve to write that.)

Of the total of 876 samples, here are the numbers for our populations of interest:

Total South Asians	190
Balochi	24
Brahui	25
Burusho	25
Hazara	22
Kalash	23
Makrani	25
Pathan	22
Sindhi	24

These samples have about 541,560 SNPs in common with 23andme v2.

Harappa Ancestry Project

Genetics and South Asia

Tag Archives: baloch

June Update

April Update

End of March Update

Another Update

Project Update

HGDP

Contact

My Sites

Data

Affiliate DNA Tests

Categories

Archives

Recent Comments

Blogroll

Harappa Ancestry Project

Genetics and South Asia

Tag Archives: baloch

June Update

Share this:

April Update

Share this:

End of March Update

Share this:

Another Update

Share this:

Project Update

Share this:

HGDP

Share this:

Contact

My Sites

Data

Affiliate DNA Tests

Categories

Tags

Archives

Recent Comments

Blogroll