Tag Archives: baloch

June Update

I have a total of 123 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • South Asian: 90
    • Tamil: 15
    • Punjab: 13
    • Bengal: 9
    • Karnataka: 7
    • Andhra Pradesh: 5
    • Uttar Pradesh: 5
    • Kerala: 5
    • Bihar: 5
    • Gujarati: 4
    • Sindhi: 4
    • Maharashtra: 3
    • Sri Lankan: 3
    • Caribbean Indian: 2
    • Kashmir: 2
    • Romani: 2
    • Goa: 1
    • Rajasthan: 1
    • Baloch: 1
    • Orissa: 1
    • Anglo-Indian: 1
    • Unknown: 1
  • Others: 33
    • Iran: 8
    • Assyrian: 3
    • Kurd: 2
    • Mexican: 2
    • Ashkenazi: 2
    • Northwest European: 2
    • Iraqi Arab: 2
    • Georgian: 1
    • Azeri: 1
    • Kazakh: 1
    • Brazilian: 1
    • Yemen: 1
    • Irish: 1
    • Egypt: 1
    • Gagauz Turk: 1
    • Afro-Belizean: 1
    • Iraqi Mandaean: 1
    • Egyptian/Iraqi Jew: 1
    • French/Madagascar/Indian: 1

Most are 23andme data while 4 are from FTDNA.

We are getting close to 100 South Asian participants.

Related Reading:

Fatal Invention: How Science, Politics, and Big Business Re-create Race in the Twenty-First Century
The Dynamics of Indian Political Factions: A Study of District Councils in the State of Maharashtra (Cambridge South Asian Studies)
The Ayatollah Begs to Differ: The Paradox of Modern Iran
Sri  Lanka - Culture Smart!: the essential guide to customs & culture

April Update

I have a total of 97 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Tamil: 14
  • Punjab: 10
  • Bengal: 7
  • Iran: 7
  • Karnataka: 6
  • Andhra Pradesh: 4
  • Uttar Pradesh: 4
  • Gujarati: 3
  • Kerala: 3
  • Maharashtra: 3
  • Assyrian: 3
  • Bihar: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Sindhi: 2
  • Sri Lankan: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Egyptian/Iraqi Jew: 1
  • Baloch: 1
  • Iraqi Kurd: 1
  • Georgian: 1
  • Azeri: 1
  • French/Madagascar/Indian: 1
  • Kazakh: 1
  • Ashkenazi: 1
  • Brazilian: 1
  • Mexican: 1
  • Unknown: 2

Let's try to get to hundred soon.

And yes, I am accepting FTDNA Family Finder (new Illumina chip) now.

Related Reading:

Understanding Iran: Everything You Need to Know, from Persia to the Islamic Republic, from Cyrus to Ahmadinejad
A history of mediaeval Jewish philosophy
The Kurds of Iraq: Building a State Within a State
Bengal's Heart (Breeds, No 7)

End of March Update

I have a total of 67 participants in the project right now who have sent me their raw data. This is not counting those who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Tamil: 11
  • Punjab: 9
  • Iran: 7
  • Bengal: 5
  • Uttar Pradesh: 4
  • Andhra Pradesh: 3
  • Kerala: 3
  • Gujarati: 3
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Sri Lankan: 2
  • Maharashtra: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Baloch: 1
  • Sindhi: 1
  • Iraqi Kurd: 1
  • Egyptian/Iraqi Jew: 1

I need to post analyses of Tamils, Bengalis and Punjabis soon.

Related Reading:

GitaSaar-Gujarati
The Sikhs of the Punjab (The New Cambridge History of India)
Rajasthan
Daughters of Kerala: Twenty-Five Short Stories by Award-Winning Authors
Circle of Bones: a Caribbean Thriller

Another Update

I have a total of 51 participants in the project right now who have sent me their raw data. This is not counting three people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Punjab: 7
  • Iran: 7
  • Tamil: 6
  • Bengal: 5
  • Andhra Pradesh: 2
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Uttar Pradesh: 2
  • Sri Lankan: 2
  • Kerala: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Baloch: 1
  • Unknown: 1
  • Egyptian/Iraqi Jew: 1
  • Maharashtra: 1

I haven't received data from any new participants for more than a week which is the longest lull since I started Harappa Ancestry Project. So go out there and get people to send me their 23andme raw data.

Also, does anyone know if there are a significant number of South Asians who have done FamilyTreeDNA's Family Finder test? Is there a good overlap of SNPs between their test and 23andme's?

We have enough Punjabis, Iranians, Tamil and Bengalis that they deserve separate analysis posts.

Related Reading:

Embodied Working Lives: Manual Laboring in Maharashtra, India
Lonely Planet Goa and Mumbai (Regional Travel Guide)
A Concise History of the Caribbean (Cambridge Concise Histories)
A Literary History of the Arabs
FLAVOURS OF THE PAST (BRIDGET'S ANGLO-INDIAN RECIPE BOOKS)

Project Update

I have a total of 42 participants in the project right now who have sent me their raw data. This is not counting two people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Punjab: 7
  • Iran: 6
  • Tamil: 5
  • Andhra Pradesh: 2
  • Bengal: 2
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Uttar Pradesh: 1
  • Sri Lankan: 1
  • Rajasthan: 1
  • Kerala: 1
  • Baloch: 1
  • Unknown: 1

The unknown is Manu Sporny who has put his genetic data in the public domain and I have drafted him into our project.

In addition, out of curiosity, I have accepted data from the following:

  • Iraqi Arab: 2
  • Egyptian/Iraqi Jew: 1

I know a bunch of you have done a lot to make this project known and gotten people to submit their data. But we really do need more participants of every ethnicity and geographic region in and around South Asia. So keep on!

I am working on K=12 admixture runs for the batches we have already done. In addition, the reference I dataset will be used for even higher values of K admixture components to see where the limit is.

Also, I am looking into doing chromosome by chromosome admixture (and other analysis). I have done some experimental runs and once I have pored over that data, I'll have something to report.

As we have seen, even with the removal of the San and Pygmy, the Africans take up 3 ancestral components and most South Asians (excepting me of course) do not have any African admixture. So I am working on a reference dataset without any Africans. I have my own take on how to do that which I'll share in the next few days.

In short, my home computer is running admixture, plink, eigensoft, etc. 24x7.

Related Reading:

The Big Jewish Book for Jews: Everything You Need to Know to Be a Really Jewish Jew
Molecular Genetics and Personalized Medicine (Molecular and Translational Medicine)
Merchants, Politics and Society in Early Modern India: Bihar: 1733-1820 (Brill's Indological Library, Vol 10)
Rajasthan-Delhi-Agra. An Indo-Muslim Lifestyle
Menus and Memories from Punjab: Meals to Nourish Body and Soul (Hippocrene Cookbooks)

HGDP

Human Genome Diversity Project (HGDP) is the best resource for a diverse set of genomic data. It has 1050 individuals from 52 different populations.

I got the Stanford University data which has data for 660,918 SNPs from 1,043 samples. It is claimed that the forward strand is given but that turned out not to be true and I had to flip strands and make sure I didn't include any ambiguous A/T or C/G strands in my dataset.

I followed the recommendations of Rosenberg (spreadsheet) in excluding some atypical samples and relatives, leaving me with 940 samples.

I also excluded the Native American samples because we are not interested in them and they are very closely related either due to recent endogamy or ancient bottlenecks. (yeah I had the nerve to write that.)

Of the total of 876 samples, here are the numbers for our populations of interest:

Balochi 24
Brahui 25
Burusho 25
Hazara 22
Kalash 23
Makrani 25
Pathan 22
Sindhi 24
Total South Asians 190

These samples have about 541,560 SNPs in common with 23andme v2.

Related Reading:

A Genetic and Cultural Odyssey: The Life and Work of L. Luca Cavalli-Sforza
Genome