TDT<sub>P</sub>

Users' documentation for TDT_P in c++

Installation

Note: Please, substitute <ver> by the current version in all explantions below.

Compiled version:

Compiled version for linux.-After you have succesfully obtained the archive you can execute the following Unix commands to install the program:

gzip TDTP<ver>.gz

Source code version:

Source code version for linux.-After you have succesfully obtained the archive you can execute the following Unix commands to install the program:

gzip TDTP<ver>.tar.gz
tar xvf TDTP<ver>.tar
cd TDTP<ver>
make

Execution

To use TDTP, use the following command line:

./TDTP<ver> <filename> <solve phase> <window width> <window offset>

where <filename> is the input file with format as shown below.

A second argument can be used to change the default value (0) which considers input data already ordered by haplotypes (i.e., the first allele at each snp come from the father, the second from the mother). If 1 is used, the E-M algorithm considering families will be used to estimate haplotype frequencies, and haplotypes will be weighted by their frequencies (Zhang et al. 2003, "TDT based on haplotype sharing for tightly linked markers", AJHG 73, pp.566-69).
A third argument can be used to change the default window width which is set to 10.
A third argument can be used to change the default window offset which is set to 1.

Format of the input file

A text file of SNP data set, in post-makeped format as described here:

One row per individual, 7 + 2xTotalSNPs columns, with TotalSNPs being the number of SNPs. Columns must be separared by a blank or tab character.Each column must contain an integer. See an example of input file.

Description of columns:

Column 1: family code
Column 2: individual code
Column 3: father code (0 if the individual is not a child)
Column 4: mother code (0 if the individual is not a child)
Column 5: gender (not used by rTDT)
Column 6: affection status (not used by rTDT)
Column 7: another individual id (not used by rTDT)
Columns 8 until 7+2*TotalSNPs: SNPs values (each SNP uses two contiguous columns). Example: if SNP1 value for an individual is 12 (heterozygous), column8 will have the value 1 and column9 will have the 2.

Output for the program TDTP

The output will be written at a file with the same name as the input file but with the extension ".cal". The output file is a csv text file with one line for each SNPs and one column for each method (mTDT_P, mTDT_2G, mTDT, mTDT_1T and mTDT_1U).