animal evolution; phylogenomics; phylogenetic resolution; systematic error; nonphylogenetic signal
Abstract :
[en] A recent phylogenomic study reported that the animal phylogeny was unresolved despite the use of 50 genes. This lack of resolution was interpreted as "a positive signature of closely spaced cladogenetic events." Here, we propose that this lack of resolution is rather due to the mutual cancellation of the phylogenetic signal (historical) and the nonphylogenetic signal (due to systematic errors) that results from inadequate taxon sampling and/or model of sequence evolution. Starting with a data set of comparable size, we use 3 different strategies to reduce the nonphylogenetic signal: 1) increasing the number of species; 2) replacing a fast-evolving species by a slowly evolving one; and 3) using a better model of sequence evolution. In all cases, the phylogenetic resolution is markedly improved, in agreement with our hypothesis that the originally reported lack of resolution was artifactual.
Disciplines :
Biochemistry, biophysics & molecular biology
Author, co-author :
Baurain, Denis ; Université de Liège - ULiège > Département des Sciences de la Vie
Brinkmann, Henner; Université de Montréal - UdeM > Canadian Institute for Advanced Research and Département de Biochimie
Philippe, Hervé; Université de Montréal - UdeM > Canadian Institute for Advanced Research and Département de Biochimie
Language :
English
Title :
Lack of resolution in the animal phylogeny: Closely spaced cladogeneses or undetected systematic errors?
Adachi J, Hasegawa M. 1995. Phylogeny of whales: dependence of the inference on species sampling. Mol Biol Evol. 12:177-179.
Aguinaldo AM, Turbeville JM, Linford LS, Rivera MC, Garey JR, Raff RA, Lake JA. 1997. Evidence for a clade of nematodes, arthropods and other moulting animals. Nature. 387:489-493.
Blanquart S, Lartillot N. 2006. A bayesian compound stochastic process for modelling non-stationary and non-homogeneous sequence evolution. Mol Biol Evol. 23:2058-2071.
Delsuc F, Brinkmann H, Chourrout D, Philippe H. 2006. Tunicates and not cephalochordates are the closest living relatives of vertebrates. Nature. 439:965-968.
Delsuc F, Brinkmann H, Philippe H. 2005. Phylogenomics and the reconstruction of the tree of life. Nat Rev Genet. 6:361-375.
Felsenstein J. 2004. Inferring phylogenies. Sunderland (MA): Sinauer Associates, Inc.
Galtier N, Gouy M. 1995. Inferring phylogenies from DNA sequences of unequal base compositions. Proc Natl Acad Sci USA. 92:11317-11321.
Goldstein B, Blaxter M. 2002. Tardigrades. Curr Biol. 12:R475.
Graybeal A. 1998. Is it better to add taxa or characters to a difficult phylogenetic problem? Syst Biol. 47:9-17.
Guindon S, Gascuel O. 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 52:696-704.
Halanych KM. 2004. The new view of animal phylogeny. Annu Rev Ecol Evol Syst. 35:229-256.
Hedtke SM, Townsend TM, Hillis DM. 2006. Resolution of phylogenetic conflict in large data sets by increased taxon sampling. Syst Biol. 55:522-529.
Hendy MD, Penny D. 1989. A framework for the quantitative study of evolutionary trees. Syst Zool. 38:297-309.
Ho SY, Jermiin L. 2004. Tracing the decay of the historical signal in biological sequence data. Syst Biol. 53:623-637.
Hordijk W, Gascuel O. 2005. Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood. Bioinformatics. 21:4338-4347.
Jobb G, von Haeseler A, Strimmer K. 2004. TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics. BMC Evol Biol. 4:18.
Lanave C, Preparata G, Saccone C, Serio G. 1984. A new method for calculating evolutionary substitution rates. J Mol Evol. 20:86-93.
Lartillot N, Brinkmann H, Philippe H. Forthcoming 2006. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol Biol.
Lartillot N, Philippe H. 2004. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol. 21:1095-1109.
Lecointre G, Philippe H, Van Le HL, Le Guyader H. 1993. Species sampling has a major impact on phylogenetic inference. Mol Phylogenet Evol. 2:205-224.
Marletaz F, Martin E, Perez Y, et al. (12 co-authors). 2006. Chaetognath phylogenomics: a protostome with deuterostome-like development. Curr Biol. 16:R577-R578.
Martin AP, Burg TM. 2002. Perils of paralogy: using HSP70 genes for inferring organismal phylogenies. Syst Biol. 51:570-587.
Matus DQ, Copley RR, Dunn CW, Hejnol A, Eccleston H, Halanych KM, Martindale MQ, Telford MJ. 2006. Broad taxon and gene sampling indicate that chaetognaths are protostomes. Curr Biol. 16:R575-R576.
Olsen G. 1987. Earliest phylogenetic branching: comparing rRNA-based evolutionary trees inferred with various techniques. Cold Spring Harb Symp Quant Biol. LII:825-837.
Philippe H, Delsuc F, Brinkmann H, Lartillot N. 2005. Phylogenomics. Annu Rev Ecol Evol Syst. 36:541-562.
Philippe H, Lartillot N, Brinkmann H. 2005. Multigene analyses of bilaterian animals corroborate the monophyly of ecdysozoa, lophotrochozoa, and protostomia. Mol Biol Evol. 22:1246-1253.
Philippe H, Telford MJ. 2006. Large-scale sequencing and the new animal phylogeny. Trends Ecol Evol. 21:614-620.
Philippe H, Zhou Y, Brinkmann H, Rodrigue N, Delsuc F. 2005. Heterotachy and long-branch attraction in phylogenetics. BMC Evol Biol. 5:50.
Robinson DM, Jones DT, Kishino H, Goldman N, Thorne JL. 2003. Protein evolution with dependence among codons due to tertiary structure. Mol Biol Evol. 28:28.
Rokas A, Carroll SB. 2005. More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy. Mol Biol Evol. 22:1337-1344.
Rokas A, Kruger D, Carroll SB. 2005. Animal evolution and the molecular signature of radiations compressed in time. Science. 310:1933-1938.
Rosenberg MS, Kumar S. 2001. Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci USA. 98:10751-10756.
Tuffley C, Steel M. 1998. Modeling the covarion hypothesis of nucleotide substitution. Math Biosci. 147:63-91.
Wheeler WC. 1992. Extinction, sampling, and molecular phylogenetics. In: Novacek MJ, Wheeler QD, editors. Extinction and phylogeny. New York: Columbia University Press. p. 205-215.
Whelan S, Goldman N. 2001. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 18:691-699.
Whelan S, Lio P, Goldman N. 2001. Molecular phylogenetics: state-of-the-art methods for looking into the past. Trends Genet. 17:262-272.
Yang Z. 1993. Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Mol Biol Evol. 10:1396-1401.