Tag Archives: punjab

June Update

I have a total of 123 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • South Asian: 90
    • Tamil: 15
    • Punjab: 13
    • Bengal: 9
    • Karnataka: 7
    • Andhra Pradesh: 5
    • Uttar Pradesh: 5
    • Kerala: 5
    • Bihar: 5
    • Gujarati: 4
    • Sindhi: 4
    • Maharashtra: 3
    • Sri Lankan: 3
    • Caribbean Indian: 2
    • Kashmir: 2
    • Romani: 2
    • Goa: 1
    • Rajasthan: 1
    • Baloch: 1
    • Orissa: 1
    • Anglo-Indian: 1
    • Unknown: 1
  • Others: 33
    • Iran: 8
    • Assyrian: 3
    • Kurd: 2
    • Mexican: 2
    • Ashkenazi: 2
    • Northwest European: 2
    • Iraqi Arab: 2
    • Georgian: 1
    • Azeri: 1
    • Kazakh: 1
    • Brazilian: 1
    • Yemen: 1
    • Irish: 1
    • Egypt: 1
    • Gagauz Turk: 1
    • Afro-Belizean: 1
    • Iraqi Mandaean: 1
    • Egyptian/Iraqi Jew: 1
    • French/Madagascar/Indian: 1

Most are 23andme data while 4 are from FTDNA.

We are getting close to 100 South Asian participants.

Related Reading:

The Georgian Phrasebook: Fully Transliterated (Georgian Edition)
Madagascar: A Short History
British Language & Culture (Lonely Planet Language & Culture) (Language Reference)
Breve historia de Roma (Breve Historia (nowtilus)) (Spanish Edition)
Roman Blood: A Novel of Ancient Rome (Novels of Ancient Rome)

April Update

I have a total of 97 participants in the project right now who have sent me their raw data. Six of those have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Tamil: 14
  • Punjab: 10
  • Bengal: 7
  • Iran: 7
  • Karnataka: 6
  • Andhra Pradesh: 4
  • Uttar Pradesh: 4
  • Gujarati: 3
  • Kerala: 3
  • Maharashtra: 3
  • Assyrian: 3
  • Bihar: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Sindhi: 2
  • Sri Lankan: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Egyptian/Iraqi Jew: 1
  • Baloch: 1
  • Iraqi Kurd: 1
  • Georgian: 1
  • Azeri: 1
  • French/Madagascar/Indian: 1
  • Kazakh: 1
  • Ashkenazi: 1
  • Brazilian: 1
  • Mexican: 1
  • Unknown: 2

Let's try to get to hundred soon.

And yes, I am accepting FTDNA Family Finder (new Illumina chip) now.

Related Reading:

The Making of Southern Karnataka: Society Polity and Culture in the Early Medieval Period, AD 400-1030
Colloquial Tamil: The Complete Course for Beginners (Colloquial Series)
Mexico
Le tour du monde en quatre-vingts jours (French Edition)
Azeri Folksongs: At The Fountain-head Of Music

End of March Update

I have a total of 67 participants in the project right now who have sent me their raw data. This is not counting those who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Tamil: 11
  • Punjab: 9
  • Iran: 7
  • Bengal: 5
  • Uttar Pradesh: 4
  • Andhra Pradesh: 3
  • Kerala: 3
  • Gujarati: 3
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Sri Lankan: 2
  • Maharashtra: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Baloch: 1
  • Sindhi: 1
  • Iraqi Kurd: 1
  • Egyptian/Iraqi Jew: 1

I need to post analyses of Tamils, Bengalis and Punjabis soon.

Related Reading:

Ancient Rights and Future Comfort: Bihar, the Bengal Tenancy Act of 1885, and British Rule in India (London Studies on South Asia)
The Secret Keeper
Kerala: Tropical Beaches & Backwater Villages (Lonely Planet Travel Guide)
The Dynamics of Indian Political Factions: A Study of District Councils in the State of Maharashtra (Cambridge South Asian Studies)

Another Update

I have a total of 51 participants in the project right now who have sent me their raw data. This is not counting three people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Punjab: 7
  • Iran: 7
  • Tamil: 6
  • Bengal: 5
  • Andhra Pradesh: 2
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Uttar Pradesh: 2
  • Sri Lankan: 2
  • Kerala: 2
  • Iraqi Arab: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Rajasthan: 1
  • Baloch: 1
  • Unknown: 1
  • Egyptian/Iraqi Jew: 1
  • Maharashtra: 1

I haven't received data from any new participants for more than a week which is the longest lull since I started Harappa Ancestry Project. So go out there and get people to send me their 23andme raw data.

Also, does anyone know if there are a significant number of South Asians who have done FamilyTreeDNA's Family Finder test? Is there a good overlap of SNPs between their test and 23andme's?

We have enough Punjabis, Iranians, Tamil and Bengalis that they deserve separate analysis posts.

Related Reading:

Choosing a Jewish Life: A Handbook for People Converting to Judaism and for Their Family and Friends
The Poison Tree, A Tale of Hindu Life in Bengal
Goa Travel Guide - What To See & Do In 2012
Rajasthan (India Travel Guides)
Menus and Memories from Punjab: Meals to Nourish Body and Soul (Hippocrene Cookbooks)

Project Update

I have a total of 42 participants in the project right now who have sent me their raw data. This is not counting two people who have relatives participating and thus have to be filtered out for most analysis other than individual admixture percentages etc where I divide participants into small groups.

The following groups are represented:

  • Punjab: 7
  • Iran: 6
  • Tamil: 5
  • Andhra Pradesh: 2
  • Bengal: 2
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Kashmir: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Goa: 1
  • Uttar Pradesh: 1
  • Sri Lankan: 1
  • Rajasthan: 1
  • Kerala: 1
  • Baloch: 1
  • Unknown: 1

The unknown is Manu Sporny who has put his genetic data in the public domain and I have drafted him into our project.

In addition, out of curiosity, I have accepted data from the following:

  • Iraqi Arab: 2
  • Egyptian/Iraqi Jew: 1

I know a bunch of you have done a lot to make this project known and gotten people to submit their data. But we really do need more participants of every ethnicity and geographic region in and around South Asia. So keep on!

I am working on K=12 admixture runs for the batches we have already done. In addition, the reference I dataset will be used for even higher values of K admixture components to see where the limit is.

Also, I am looking into doing chromosome by chromosome admixture (and other analysis). I have done some experimental runs and once I have pored over that data, I'll have something to report.

As we have seen, even with the removal of the San and Pygmy, the Africans take up 3 ancestral components and most South Asians (excepting me of course) do not have any African admixture. So I am working on a reference dataset without any Africans. I have my own take on how to do that which I'll share in the next few days.

In short, my home computer is running admixture, plink, eigensoft, etc. 24x7.

Related Reading:

Ambush Alley: The Most Extraordinary Battle of the Iraq War
Essential Andhra Cookbook with Hyderabadi and....
The Politics of Ethnicity in Pakistan: The Baloch, Sindhi and Mohajir Ethnic Movements (Routledge Contemporary South Asia Series)
The Ascendancy of the Congress in Uttar Pradesh: Class, Community and Nation in Northern India, 1920-1940 (Anthem World History)
Dry Grain Farming Families: Hausalund (Nigeria) and Karnataka (India) Compared

Latest on Participants

I have a total of 31 participants in the project right now who have sent me their raw data. The following groups are represented:

  • Punjab: 7
  • Tamil: 4
  • Iran: 4
  • Andhra Pradesh: 2
  • Bengal: 2
  • Bihar: 2
  • Karnataka: 2
  • Caribbean Indian: 2
  • Anglo-Indian: 1
  • Roma: 1
  • Kashmir: 1
  • Goa: 1
  • Uttar Pradesh: 1
  • Sri Lankan: 1

Keep them coming!

I am going to get some admixture analysis on the second batch (HRP0011 to HRP0020) done this week.

Related Reading:

Time Out Mumbai and Goa (Time Out Guides)
Bengal Breed Profile (Your Cat Magazine Breed Profiles)
Bengal Tigers (Asian Animals)
The Sri Lanka Reader: History, Culture, Politics (The World Readers)

Participation Update

I have a total of 23 participants in the project right now who have sent me their raw data. The following groups are represented:

  • Punjab: 7
  • Tamil: 4
  • Iran: 3
  • Bengal: 2
  • Andhra Pradesh: 2
  • Bihar: 1
  • Anglo-Indian: 1
  • Roma: 1
  • Karnataka: 1
  • Kashmir: 1

There is still a lot of ethnicities and regions missing. Uttar Pradesh comes to mind as the biggest one.

Related Reading:

Family Tree Pocket Reference
A Time to Betray: The Astonishing Double Life of a CIA Agent Inside the Revolutionary Guards of Iran
Tamil for Beginners
Pre- and Protohistoric Andhra Pradesh Up to 500 B.C.
The Coolest Startups in America (Volume 1)

Xing et al Data

The data for Xing et al's paper "Toward a more uniform sampling of human genetic diversity: a survey of worldwide populations by high-density genotyping" is available online.

This dataset consists of 850 individuals, but 259 of them overlap with the HapMap. Another 15 samples had to be removed because they were too similar to others. I also removed Native American samples. This leaves us with 529 samples.

Ethnic group Count
Slovenian 25
Punjabi Arain 25
N. European 25
Nepalese 25
Kyrgyzstani 25
Iban 25
Buryat 25
Bambaran 25
Andhra Pradesh Brahmin 25
Kurd 24
Dogon 24
Irula 23
Thai 22
Pygmy 22
Urkarah 18
Tamil Nadu Brahmin 14
Hema 14
Tongan 13
Tamil Nadu Dalit 13
Samoan 13
!Kung 13
Japanese 13
Andhra Pradesh Mala 11
Pedi 10
Andhra Pradesh Madiga 10
Alur 10
Nguni 9
Sotho/Tswana 8
Vietnamese 7
Stalskoe 5
Chinese 5
Khmer Cambodian 3

This dataset is valuable because it contains several South Asian, Central Asian, Southeast Asian and Caucasian groups. However, it does not have a good SNP overlap with 23andme and the other datasets. It has only about 29,000 SNPs in common with 23andme v2 data. Combining HapMap, HGDP, SGVP, Behar et al and Xing et al with 23andme data leaves us with 25,000 SNPs. Due to that, I'll be using Xing et al data for only a few analyses.

Related Reading:

Lonely Planet Trekking in the Nepal Himalaya (Walking)
Arain
The Case for Sanctions Against Israel
Thirukkural

Participants So Far

While I am analyzing the data, checking for errors and making sure the results I am getting are valid, here is some information about participants till now.

So far I have got 11 participants send me their raw data. Of these eleven, ten have some South Asian ancestry.

The regions/ethnicities they cover are:

  • Punjab
  • Bengal
  • Bihar
  • Tamil Nadu
  • Telegu
  • Anglo-Indian

Of these, Punjabis are the only ones I have multiple samples of. So I definitely need more samples of the other ethnicities. And there are lots of ethnicities/regions I haven't gotten any participants in.

It would be great for this project if we got a few participants from each state/province of India and Pakistan. So if you know someone who is from our target regions and has tested with 23andme, please spread the word.

If you tested with 23andme during their Christmas sale, I am hearing that results are going to start coming in starting today.

Related Reading:

Deep Ancestry: Inside The Genographic Project
Bengal Tigers (Asian Animals)
Genome: The Autobiography of a Species in 23 Chapters (P.S.)
The Family Tree Problem Solver: Tried-and-True Tactics for Tracing Elusive Ancestors