Users' documentation for TDTP in c++


Installation

Note: Please, substitute <ver> by the current version in all explantions below.

Compiled version:

Compiled version for linux.-After you have succesfully obtained the archive you can execute the following Unix commands to install the program:

Source code version:

Source code version for linux.-After you have succesfully obtained the archive you can execute the following Unix commands to install the program:

Execution

To use TDTP, use the following command line:

./TDTP<ver> <filename> <solve phase> <window width> <window offset>

where <filename> is the input file with format as shown below.

A second argument can be used to change the default value (0) which considers input data already ordered by haplotypes (i.e., the first allele at each snp come from the father, the second from the mother). If 1 is used, the E-M algorithm considering families will be used to estimate haplotype frequencies, and haplotypes will be weighted by their frequencies (Zhang et al. 2003, "TDT based on haplotype sharing for tightly linked markers", AJHG 73, pp.566-69).
A third argument can be used to change the default window width which is set to 10.
A third argument can be used to change the default window offset which is set to 1.

 

Format of the input file

A text file of SNP data set, in post-makeped format as described here:

One row per individual, 7 + 2xTotalSNPs columns, with TotalSNPs being the number of SNPs. Columns must be separared by a blank or tab character.Each column must contain an integer. See an example of input file.

Description of columns:

Output for the program TDTP

The output will be written at a file with the same name as the input file but with the extension ".cal". The output file is a csv text file with one line for each SNPs and one column for each method (mTDTP, mTDT2G, mTDT, mTDT1T and mTDT1U).