Branch-length estimation bias misleads molecular dating for a vertebrate mitochondrial phylogeny

Matthew J. Phillips*

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    85 Citations (Scopus)

    Abstract

    Despite recent methodological advances in inferring the time-scale of biological evolution from molecular data, the fundamental question of whether our substitution models are sufficiently well specified to accurately estimate branch-lengths has received little attention. I examine this implicit assumption of all molecular dating methods, on a vertebrate mitochondrial protein-coding dataset. Comparison with analyses in which the data are RY-coded (AG → R; CT → Y) suggests that even rates-across-sites maximum likelihood greatly under-compensates for multiple substitutions among the standard (ACGT) NT-coded data, which has been subject to greater phylogenetic signal erosion. Accordingly, the fossil record indicates that branch-lengths inferred from the NT-coded data translate into divergence time overestimates when calibrated from deeper in the tree. Intriguingly, RY-coding led to the opposite result. The underlying NT and RY substitution model misspecifications likely relate respectively to "hidden" rate heterogeneity and changes in substitution processes across the tree, for which I provide simulated examples. Given the magnitude of the inferred molecular dating errors, branch-length estimation biases may partly explain current conflicts with some palaeontological dating estimates.

    Original languageEnglish
    Pages (from-to)132-140
    Number of pages9
    JournalGene
    Volume441
    Issue number1-2
    DOIs
    Publication statusPublished - 15 Jul 2009

    Fingerprint

    Dive into the research topics of 'Branch-length estimation bias misleads molecular dating for a vertebrate mitochondrial phylogeny'. Together they form a unique fingerprint.

    Cite this