[-] Show simple item record

dc.contributor.authorLin, Guan Ning, 1978-eng
dc.contributor.authorCai, Zhipengeng
dc.contributor.authorLin, Guohuieng
dc.contributor.authorChakraborty, Sounakeng
dc.contributor.authorXu, Dong, 1965-eng
dc.contributor.otherUniversity of Missouri-Columbia. College of Arts and Sciences. Department of Statisticseng
dc.date.issued2009eng
dc.descriptiondoi:10.1186/1471-2105-10-S1-S5eng
dc.description.abstractWith the increasing availability of whole genome sequences, it is becoming more and more important to use complete genome sequences for inferring species phylogenies. We developed a new tool ComPhy, 'Composite Distance Phylogeny', based on a composite distance matrix calculated from the comparison of complete gene sets between genome pairs to produce a prokaryotic phylogeny. The composite distance between two genomes is defined by three components: Gene Dispersion Distance (GDD), Genome Breakpoint Distance (GBD) and Gene Content Distance (GCD). GDD quantifies the dispersion of orthologous genes along the genomic coordinates from one genome to another; GBD measures the shared breakpoints between two genomes; GCD measures the level of shared orthologs between two genomes. The phylogenetic tree is constructed from the composite distance matrix using a neighbor joining method. We tested our method on 9 datasets from 398 completely sequenced prokaryotic genomes. We have achieved above 90% agreement in quartet topologies between the tree created by our method and the tree from the Bergey's taxonomy. In comparison to several other phylogenetic analysis methods, our method showed consistently better performance. ComPhy is a fast and robust tool for genome-wide inference of evolutionary relationship among genomes.eng
dc.description.sponsorship"This work was supported in part by NSF/ITR-IIS-0407204."eng
dc.identifier.citationBMC Bioinformatics 2009, 10(Suppl 1):S5.eng
dc.identifier.urihttp://hdl.handle.net/10355/9126eng
dc.publisherBioMed Centraleng
dc.relation.ispartofStatistics publications (MU)eng
dc.subjectbacteria classificationeng
dc.subjectGene Composite Distanceeng
dc.subjectphylogeny construction toolseng
dc.subject.lcshProkaryotes -- Phylogenyeng
dc.subject.lcshBacteria -- Phylogenyeng
dc.subject.lcshGene mappingeng
dc.subject.lcshProkaryotes -- Geneome mappingeng
dc.subject.lcshBacteria -- Genome mappingeng
dc.titleComPhy: Prokaryotic Composite Distance Phylogenies Inferred from Whole-Genome Gene Setseng
dc.typeArticleeng


Files in this item

[PDF]

This item appears in the following Collection(s)

  • Statistics publications (MU)
    The items in this collection are the scholarly output of the faculty, staff, and students of the Department of Statistics.

[-] Show simple item record