Skip to Main content Skip to Navigation
Book sections

The Concatenation Question

Abstract : Gene tree discordance is now recognized as a major source of biological heterogeneity. How to deal with this heterogeneity is an unsolved problem, as the accurate inference of individual gene tree topologies is difficult. One solution has been to simply concatenate all of the data together, ignoring the underlying heterogeneity. Another approach infers gene tree topologies separately and combines the individual estimates in order to explicitly model this heterogeneity. Here we discuss the advantages and disadvantages of both approaches-using the gene trees singly or in concatenation-paying special attention to the sources of variance and implicit assumptions. We make it clear that all methods are likely to have their assumptions violated, though the consequence of these violations differs in different parts of parameter space. The main conclusion of our review is that different sources of error are more or less important in different settings, such that phylogenetics researchers should be using the methods most appropriate to their problems rather than stick to one dogmatically.
Complete list of metadata

Cited literature [89 references]  Display  Hide  Download
Contributor : Christine Bibal Connect in order to contact the contributor
Submitted on : Friday, April 10, 2020 - 3:16:10 PM
Last modification on : Wednesday, April 15, 2020 - 10:35:05 AM


Files produced by the author(s)


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License


  • HAL Id : hal-02535651, version 1




David Bryant, Matthew W. Hahn. The Concatenation Question. Scornavacca, Celine; Delsuc, Frédéric; Galtier, Nicolas. Phylogenetics in the Genomic Era, No commercial publisher | Authors open access book, pp.3.4:1--3.4:23, 2020. ⟨hal-02535651⟩



Record views


Files downloads