Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Heuristics for the Gene-Duplication Problem: A Θ(n) Speed-Up for the Local Search

Mukul S. BansalContact Information, J. Gordon BurleighContact Information, Oliver EulensteinContact Information and André WeheContact Information

(1)  Department of Computer Science, Iowa State University, Ames, IA, USA
(2)  National Evolutionary Synthesis Center Durham, NC, USA
(3)  Department of Electrical and Computer Engineering, Iowa State University, Ames, IA, USA
Abstract
The gene-duplication problem is to infer a species supertree from a collection of gene trees that are confounded by complex histories of gene duplications. This problem is NP-hard and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. We show how this local search problem can be solved efficiently by reusing previously computed information. This improves the running time of the current solution by a factor of n, where n is the number of species in the resulting supertree solution, and makes the gene-duplication problem more tractable for large-scale phylogenetic analyses. We verify the exceptional performance of our solution in a comparison study using sets of large randomly generated gene trees. Furthermore, we demonstrate the utility of our solution by incorporating large genomic data sets from GenBank into a supertree analysis of plants.
During this research, O.E. and M.S.B. were supported in part by NSF grant no. 0334832 and J.G.B. by NESCent NSF grant no. EF-0423641.

Contact Information Mukul S. Bansal
Email: bansal@cs.iastate.edu

Contact Information J. Gordon Burleigh
Email: jgb12@duke.edu

Contact Information Oliver Eulenstein
Email: oeulenst@cs.iastate.edu

Contact Information André Wehe
Email: awehe@iastate.edu
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Referenced by
2 newer articles

  1. Thomson, R. C. (2009) Sparse Supermatrices for Phylogenetic Inference: Taxonomy, Alignment, Rogue Taxa, and the Phylogeny of Living Turtles. Systematic Biology
    [CrossRef]
  2. Wehe, A. (2008) DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics 24(13)
    [CrossRef]
Remote Address: 38.107.191.110 • Server: mpweb20
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)