Related Tools

MOSAIK
GigaBayes

Other Links

MDACC BCB
MDACC Biostatistics

Author

Xiaoping Su

Introduction


Tumor purity is an estimation of how much of the tumor genome sequence is different from that of the normal matched tissue as a result of mutation. A sample of only tumor cells ( containing no normal tissue ) should show a mean frequency of 0.5 for mutant alleles at heterozygous loci with somatic mutations. The contamination of tumor tissue with normal tissue, the level of which is sample-dependent, affects mutant allele frequencies and indeed decreases their relative fraction for both homozygous and heterozygous somatic mutations. We developed a novel algorithm, PurityEst (tumor purity estimation), to infer the tumor purity from the allelic differential representation of heterozygous loci with somatic mutations in a tumor sample with a matched normal tissue sample.

For help with PurityEst, please email xsu1@mdanderson.org

Downloading PurityEst


The source code and sample input files can be downloaded here.

FileLink
PurityEst.tar.gzdownload

Running PurityEst


PurityEst takes the widely used and customized GFF files as input. Here, the GFF files for somatic mutation detection are generated by GigaBayes.