Tag Archives: ibd - Page 2

Behar Redo

As part of my effort to create one big reference dataset for my use, I have been going over all the datasets I have and make sure there's no duplicates or relatives or any other strange things that could cause issues with my analysis.

So I went back to the Behar et al dataset, which you can download from the GEO Accession website.

I found three set of duplicates and two pairs with very high identity-by-descent values, which I calculated using Plink. You can see the samples with PI_HAT greater than 0.5 in this spreadsheet. PI_HAT is the proportion IBD estimated by plink. Notice also that all these pairs also have high IBS similarity (the DSC column), more than 83% similar.

The five samples I have removed as a result of this are listed in this spreadsheet.

Related Reading:

The How to Make Money in Stocks Complete Investing System: Your Ultimate Guide to Winning in Good Times and Bad
How to Make Money in Stocks Success Stories: New and Advanced Investors Share Their Winning Secrets
Reference and Information Services: An Introduction, Third Edition
Recipes for the Specific Carbohydrate Diet: The Grain-Free, Lactose-Free, Sugar-Free Solution to IBD, Celiac Disease, Autism, Cystic Fibrosis, and Other Health Conditions (Healthy Living Cookbooks)
Student Lab Notebook: 100 Top Bound Carbonless Duplicate Sets

HapMap Redo

As part of my effort to create one big reference dataset for my use, I have been going over all the datasets I have and make sure there's no duplicates or relatives or any other strange things that could cause issues with my analysis.

So I went back to HapMap, which you can download from their website. I am using HapMap 3 public release #3 from May 28, 2010.

I found one set of duplicates, NA21344 is identical to NA21737. And a whole bunch of pairs with high identity-by-descent values, which I calculated using Plink. You can see the samples with PI_HAT greater than 0.5 in this spreadsheet. PI_HAT is the proportion IBD estimated by plink. Notice also that all these pairs also have high IBS similarity (the DSC column), more than 85% similar in fact.

All the 41 samples I have removed as a result of this are listed in this spreadsheet.

Related Reading:

WIN At Duplicate Bridge: Bid Difficult Bridge Hands Like An Expert
The IBD Healing Plan and Recipe Book: Using Whole Foods to Relieve Crohn's Disease and Colitis
Race Decoded: The Genomic Fight for Social Justice
How to Make Money in Stocks Success Stories: New and Advanced Investors Share Their Winning Secrets
How to Make Money in Stocks Getting Started: A Guide to Putting CAN SLIM Concepts into Action