Spectral cluster supertree: fast and statistically robust merging of rooted phylogenetic trees

Robert N. McArthur*, Ahad N. Zehmakan, Michael A. Charleston, Yu Lin, Gavin Huttley

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The algorithms for phylogenetic reconstruction are central to computational molecular evolution. The relentless pace of data acquisition has exposed their poor scalability and the conclusion that the conventional application of these methods is impractical and not justifiable from an energy usage perspective. Furthermore, the drive to improve the statistical performance of phylogenetic methods produces increasingly parameter-rich models of sequence evolution, which worsens the computational performance. Established theoretical and algorithmic results identify supertree methods as critical to divide-and-conquer strategies for improving scalability of phylogenetic reconstruction. Of particular importance is the ability to explicitly accommodate rooted topologies. These can arise from the more biologically plausible non-stationary models of sequence evolution. We make a contribution to addressing this challenge with Spectral Cluster Supertree, a novel supertree method for merging a set of overlapping rooted phylogenetic trees. It offers significant improvements over Min-Cut supertree and previous state-of-the-art methods in terms of both time complexity and overall topological accuracy, particularly for problems of large size. We perform comparisons against Min-Cut supertree and Bad Clade Deletion. Leveraging two tree topology distance metrics, we demonstrate that while Bad Clade Deletion generates more correct clades in its resulting supertree, Spectral Cluster Supertree’s generated tree is generally more topologically close to the true model tree. Over large datasets containing 10,000 taxa and (Formula presented.) 500 source trees, where Bad Clade Deletion usually takes (Formula presented.) 2 h to run, our method generates a supertree in on average 20 s. Spectral Cluster Supertree is released under an open source license and is available on the python package index as sc-supertree.

Original languageEnglish
Article number1432495
JournalFrontiers in Molecular Biosciences
Volume11
DOIs
Publication statusPublished - 30 Oct 2024

Fingerprint

Dive into the research topics of 'Spectral cluster supertree: fast and statistically robust merging of rooted phylogenetic trees'. Together they form a unique fingerprint.

Cite this