Pan-Asian to PED Conversion

Posted by Zack on April 16, 2011

Even though the Pan-Asian dataset is not public, there was a request for my script to convert the data to Plink's PED format.

Here is how I convert the Pan-Asian data to Plink's transposed file format.

#!/usr/bin/perl -w
 
$file="Genotypes_All.txt";
 
open(INFILE,"<",$file);
open(TFAM,">","panasian.tfam");
open(TPED,">","panasian.tped");
 
$line = <INFILE>;
chomp $line;
@first = split('\t',$line);
foreach my $sample (5..$#first) {
        print TFAM "0 $first[$sample] 0 0 0 -9\n";
}
 
my $alleles;
 
while(<INFILE>) {
        chomp;
        @lines = split('\t',$_);
        my ($major,$minor) = split('/',$lines[4]);
        print TPED "$lines[2] $lines[1] 0 $lines[3]";
        foreach my $snp (5..$#lines) {
                if ($lines[$snp] == 0) {
                        $alleles = "$major $major";}
                elsif ($lines[$snp] == 1) {
                        $alleles = "$major $minor";}
                elsif ($lines[$snp] == 2) {
                        $alleles = "$minor $minor";}
                else {
                        $alleles = "0 0";}
                print TPED " $alleles";
        }
        print TPED "\n";
}
 
close(INFILE);
close(TFAM);
close(TPED);

Again, no guarantees! It's Perl though, so it should be more stable across various operating systems.

Codeconversion, pan-asian, plink

← Reference 3 Admixture

Behar Paniya →

8 Comments.

sarabjeet April 16, 2011 at 12:05 pm

hats off to you!
mallu April 16, 2011 at 6:07 pm

you are the man. keep it up.
Davidski April 19, 2011 at 6:58 am

Hey Zack, do you know of a way to output a list of samples in a particular order when using the --keep flag in PLINK?
- Zack April 19, 2011 at 8:05 am
  
  Not that I know of.
edg May 18, 2012 at 6:16 am

How about the .map file?
- Zack May 18, 2012 at 6:39 pm
  
  Simply use plink to convert from tped/tfam to ped/map or bed/bim/fam.
  - edg June 22, 2012 at 6:21 am
    
    I cant find a command in plink that does that... do you know what it is?
    - edg June 22, 2012 at 6:38 am
      
      nevermind, itÂ´s done 🙂 Thanks for the good script!

Harappa Ancestry Project

Genetics and South Asia

Pan-Asian to PED Conversion

Related

8 Comments.

Contact

My Sites

Data

Affiliate DNA Tests

Categories

Archives

Recent Comments

Blogroll

Harappa Ancestry Project

Genetics and South Asia

Pan-Asian to PED Conversion

Share this:

Related

8 Comments.

Contact

My Sites

Data

Affiliate DNA Tests

Categories

Tags

Archives

Recent Comments

Blogroll