TY - JOUR
T1 - Polymorphism-aware species trees with advanced mutation models, bootstrap, and rate heterogeneity
AU - Schrempf, Dominik
AU - Minh, Bui Quang
AU - Von Haeseler, Arndt
AU - Kosiol, Carolin
N1 - Publisher Copyright:
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
PY - 2019/6/1
Y1 - 2019/6/1
N2 - Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Recently, multispecies coalescent based methods have increased in popularity, however, their application is limited to a small number of species and individuals. We introduced a polymorphism-aware phylogenetic model (PoMo), which overcomes this limitation and scales well with the increasing amount of sequence data whereas accounting for present and ancestral polymorphisms. PoMo circumvents handling of gene trees and directly infers species trees from allele frequency data. Here, we extend the PoMo implementation in IQ-TREE and integrate search for the statistically best-fit mutation model, the ability to infer mutation rate variation across sites, and assessment of branch support values. We exemplify an analysis of a hundred species with ten haploid individuals each, showing that PoMo can perform inference on large data sets. While PoMo is more accurate than standard substitution models applied to concatenated alignments, it is almost as fast. We also provide bmm-simulate, a software package that allows simulation of sequences evolving under PoMo. The new options consolidate the value of PoMo for phylogenetic analyses with population data.
AB - Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Recently, multispecies coalescent based methods have increased in popularity, however, their application is limited to a small number of species and individuals. We introduced a polymorphism-aware phylogenetic model (PoMo), which overcomes this limitation and scales well with the increasing amount of sequence data whereas accounting for present and ancestral polymorphisms. PoMo circumvents handling of gene trees and directly infers species trees from allele frequency data. Here, we extend the PoMo implementation in IQ-TREE and integrate search for the statistically best-fit mutation model, the ability to infer mutation rate variation across sites, and assessment of branch support values. We exemplify an analysis of a hundred species with ten haploid individuals each, showing that PoMo can perform inference on large data sets. While PoMo is more accurate than standard substitution models applied to concatenated alignments, it is almost as fast. We also provide bmm-simulate, a software package that allows simulation of sequences evolving under PoMo. The new options consolidate the value of PoMo for phylogenetic analyses with population data.
KW - Boundary mutation model
KW - Incomplete lineage sorting
KW - Phylogenetics
KW - Polymorphism-aware phylogenetic model
KW - Species tree
UR - http://www.scopus.com/inward/record.url?scp=85066425176&partnerID=8YFLogxK
U2 - 10.1093/molbev/msz043
DO - 10.1093/molbev/msz043
M3 - Article
SN - 0737-4038
VL - 36
SP - 1294
EP - 1301
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
IS - 6
ER -