A fun application of genetic testing is inferring ancestry: Which ancestral group are you descended from? Can we estimate the admixture of the different population groups you are descended from?
Most DNA testing companies provide information about ancestry and genetic genealogy has taken off. With several genome databases (HapMap, HGDP, etc) and software (like plink, admixture, Structure) publicly available, the days of the genome bloggers are here. And I am trying to be the latest one.
What is Harappa Ancestry Project?
It is a project to analyze (autosomal) genetic data of participants of South Asian origin for the purpose of providing detailed ancestry information. So the focus of the project is on South Asians: Indians, Pakistanis, Bangladeshis and Sri Lankans.
The project will collect 23andme and FTDNA Family Finder raw genetic data from participants to better understand the ancestry relationships of different South Asian ethnicities.
Participation
People of South Asian origin, or from neighboring countries, are eligible to participate. The list of countries of origin I am accepting are as follows:
Afghanistan
Bangladesh
Bhutan
Burma
India
Iran
Maldives
Nepal
Pakistan
Sri Lanka
Tibet
Right now, I am only accepting raw data samples from people who have tested with 23andme or FTDNA Family Finder.
Please do not send samples from close relatives. I define close relatives as 2nd cousins or closer. If you have data from yourself and your parents, it might be better to send the samples from your parents (assuming they are not related to each other) and not send your own sample.
If you are unsure if you are eligible to participate, please send me an email (harappa@zackvision.com) to inquire about it before sending off your raw data.
What to send?
Please send your All DNA raw data text file (zipped is better) downloaded from 23andme to harappa@zackvision.com along with ancestral background information about you and all four of your grandparents. Background information would include where they were born, mother tongue, caste/community to which they belonged, etc. Please provide as much ancestry information as possible and try to be specific. Do especially include information about any ancestry from outside South Asia.
Data Privacy
The raw genetic data and ancestry information that you send me will not be shared with anyone.
Your data will be used only for ancestry analysis. No analysis of physical or health/medical traits will be performed.
The individual ancestry analysis published on this blog will be done using an ID of the form HRPnnnn known to only you and me.
What do you get?
All results of ancestry analysis (individual and group) will be posted on this blog. This will include admixture analysis as well as clustering into population groups etc.
About
This is a project by Zack Ajmal.
Most DNA testing companies provide information about ancestry and genetic genealogy has taken off. With several genome databases (HapMap, HGDP, etc) and software (like plink, admixture, Structure) publicly available, the days of the genome bloggers are here. And I am trying to be the latest one.
In starting this project, I have been inspired by the Dodecad Ancestry Project by Dienekes Pontikos and Eurogenes Ancestry Project by David Wesolowski. The catalyst for this project was my friend Razib who I bug whenever I need to talk genetics.
What is Harappa Ancestry Project?
It is a project to analyze (autosomal) genetic data of participants of South Asian origin for the purpose of providing detailed ancestry information. So the focus of the project is on South Asians: Indians, Pakistanis, Bangladeshis and Sri Lankans.
The project will collect 23andme and FTDNA Family Finder raw genetic data from participants to better understand the ancestry relationships of different South Asian ethnicities.
I have named it after Harappa, an archaeological site of the Indus Valley Civilization in Punjab, Pakistan.
Participation
People of South Asian origin, or from neighboring countries, are eligible to participate. The list of countries of origin I am accepting are as follows:
Right now, I am only accepting raw data samples from people who have tested with 23andme or FTDNA Family Finder.
Please do not send samples from close relatives. I define close relatives as 2nd cousins or closer. If you have data from yourself and your parents, it might be better to send the samples from your parents (assuming they are not related to each other) and not send your own sample.
If you are unsure if you are eligible to participate, please send me an email (harappa@zackvision.com) to inquire about it before sending off your raw data.
What to send?
Please send your All DNA raw data text file (zipped is better) downloaded from 23andme to harappa@zackvision.com along with ancestral background information about you and all four of your grandparents. Background information would include where they were born, mother tongue, caste/community to which they belonged, etc. Please provide as much ancestry information as possible and try to be specific. Do especially include information about any ancestry from outside South Asia.
Data Privacy
The raw genetic data and ancestry information that you send me will not be shared with anyone.
Your data will be used only for ancestry analysis. No analysis of physical or health/medical traits will be performed.
The individual ancestry analysis published on this blog will be done using an ID of the form HRPnnnn known to only you and me.
What do you get?
All results of ancestry analysis (individual and group) will be posted on this blog. This will include admixture analysis as well as clustering into population groups etc.
You can see admixture results, PCA plots and more of current participants.
Related Reading: