ABSTRACT
The Persistent-Phylogeny Model is an extension of the widely studied Perfect-Phylogeny Model, encompassing a broader range of evolutionary phenomena. Biological and algorithmic questions concerning persistent phylogeny have been intensely investigated in recent years. In this paper, we explore two alternative approaches to the persistent-phylogeny problem that grow out of our previous work on perfect phylogeny, and on galled trees. We develop an integer programming solution to the Persistent-Phylogeny Problem; empirically explore its efficiency; and empirically explore the utility of using fast algorithms that recognize galled trees, to recognize persistent phylogeny. The empirical results identify parameter ranges where persistent phylogeny are galled trees with high frequency, and show that the integer programming approach can efficiently identify persistent phylogeny of much larger size than has been previously reported.
- P. Bonizzoni, C. Braghin, R. Dondi, and G. Trucco. The binary perfect phylogeny with persistent characters. Theoretical Computer Science, 454:51--63, 2012. Google ScholarDigital Library
- P. Bonizzoni, A. Carrieri, G. Della Vedova, R. Dondi, and T. Przytycka. When and how the perfect phylogeny model explains evolution. In N. Jonoska and M. Saito, editors, Discrete and Topological Models in Molecular Biology, Natural Computing Series, chapter 4. Springer, 2013.Google Scholar
- P. Bonizzoni, A. P. Carrieri, G. Della Vedova, and G. Trucco. Algorithms for the constrained perfect phylogeny with consistent characters, 2014. arXiv:1405.7497v1.Google Scholar
- P. Bonizzoni, A. P. Carrieri, G. Della Vedova, and G. Trucco. Explaining evolution via constrained persistent perfect phylogeny. BMC Genomics, 15(Suppl 6):S10, 2014.Google ScholarCross Ref
- International HapMap Consortium. Integrating common and rare genetic variation in diverse human populations. Nature, 467:52--58, 2010.Google ScholarCross Ref
- N. C. Crawford and T. C. Glenn et al. More than 1000 ultraconserved elements provide evidence that turtles are the sister group of archosaurs. Biology Letters, 8:783 786, 2012.Google Scholar
- L. Dollo. Le lois de l'évolution. Bulletin de la Societé Belge de Géologie de Paléontologie et d'Hydrologie, 7:164--167, 1893.Google Scholar
- J. Felsenstein. Inferring Phylogenies. Sinauer, 2004.Google Scholar
- G. Gundem and S. Bova et al. The evolutionary history of lethal metestatic prostate concer. Nature, 520:353--357, 2015.Google ScholarCross Ref
- D. Gusfield. wwwcsif.cs.ucdavis.edu/~gusfield/.Google Scholar
- D. Gusfield. Efficient algorithms for inferring evolutionary history. Networks, 21:19--28, 1991.Google ScholarCross Ref
- D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, 1997. Google ScholarCross Ref
- D. Gusfield. ReCombinatorics: The Algorithmics of Ancestral Recombination Graphs and Explicit Phylogenetic Networks. MIT Press, 2014. Google ScholarCross Ref
- D. Gusfield, S. Eddhu, and C. Langley. Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. Journal of Bioinformatics and Computational Biology, 2(1):173--213, 2004.Google ScholarCross Ref
- D. Gusfield, Y. Frid, and D. Brown. Integer programming formulations and computations solving phylogenetic and population genetic problems with missing or genotypic data. In Proceedings of 13th Annual International Conference on Combinatorics and Computing, pages 51--64. LNCS 4598, Springer, 2007. Google ScholarDigital Library
- A. Heimberg and R. C. Salari et al. MicroRNAs reveal the interrelationships of hagfish, lampreys and gnathostomes and the nature of the ancestral vertebrate. Proceedings of the National Academy of Sciences (USA), 107:19379--19383, 2010.Google Scholar
- D. M. Hillis. SINEs of the perfect character. Proceedings of the National Academy of Sciences (USA), 96:9979--9981, 1999.Google ScholarCross Ref
- R. Hudson. Generating samples under the Wright-Fisher neutral model of genetic variation. Bioinformatics, 18(2):337--338, 2002.Google ScholarCross Ref
- R. Hudson and N. Kaplan. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics, 111:147--164, 1985.Google ScholarCross Ref
- I. Pe'er, T. Pupko, R. Shamir, and R. Sharan. Incomplete directed perfect phylogeny. SIAM J. Comput., 33:590--607, 2004. Google ScholarDigital Library
- T. Przytycka. Stability of characters and construction of phylogenetic trees. Journal of Computational Biology, 14:539--549, 2007.Google ScholarCross Ref
- T. Przytycka, G. Davis, N. Song, and D. Durand. Graph theoretical insights into evolution of multidomain proteins. Journal of Computational Biology, 13:351--363, 2006.Google ScholarCross Ref
- D. A. Ray, J. Xing, A.-H. Salem, and M. A. Batzer. SINEs of the nearly perfect character. Systematic Biology, 55:928--935, 2006.Google ScholarCross Ref
- I. B. Rogozin, Y. I. Wolf, V. N. Babenko, and E. V. Koonin. Dollo parsimony and the reconstruction of genome evolution. In V. A. Albert, editor, Parsimony, Phylogeny, and Genomics. Oxford University Press, 2006.Google ScholarCross Ref
- A. Rokas and P. Holland. Rare genomic changes as a tool for phylogenetics. Trends in Evolution and Ecology, 15:454--459, 2000.Google ScholarCross Ref
- J. Zheng, I. Rogozin, E. Koonin, and T. Przytycka. Support for the coelomata clade of animals from a rigorous analysis of the pattern of intron conservation. Molecular Biology and Evolution, 24:2583--2592, 2007.Google ScholarCross Ref
Index Terms
- Persistent phylogeny: a galled-tree and integer linear programming approach
Recommendations
Recombination and phylogeny: effects and detection
The role of phylogeny in guiding comparative studies is rapidly growing in the post genomic era. Most phylogeny reconstruction methods though, assume a single tree underlying a given alignment of sequences. However, when events such as recombination ...
Mixed Integer Linear Programming for Maximum-Parsimony Phylogeny Inference
Reconstruction of phylogenetic trees is a fundamental problem in computational biology. While excellent heuristic methods are available for many variants of this problem, new advances in phylogeny inference will be required if we are to be able to ...
Constructing the Simplest Possible Phylogenetic Network from Triplets
A phylogenetic network is a directed acyclic graph that visualizes an evolutionary history containing so-called reticulations such as recombinations, hybridizations or lateral gene transfers. Here we consider the construction of a simplest possible ...
Comments