TY - JOUR
T1 - Selecting taxa to save or sequence
T2 - Desirable criteria and a greedy solution
AU - Bordewich, Magnus
AU - Rodrigo, Allen G.
AU - Semple, Charles
PY - 2008/12
Y1 - 2008/12
N2 - Three desirable properties for any method of selecting a subset of evolutionary units (EUs) for conservation or for genomic sequencing are discussed. These properties are spread, stability, and applicability. We are motivated by a practical case in which the maximization of phylogenetic diversity (PD), which has been suggested as a suitable method, appears to lead to counterintuitive collections of EUs and does not meet these three criteria. We define a simple greedy algorithm (GreedyMMD) as a close approximation to choosing the subset that maximizes the minimum pairwise distance (MMD) between EUs. GreedyMMD satisfies our three criteria and may be a useful alternative to PD in real-world situations. In particular, we show that this method of selection is suitable under a model of biodiversity in which features arise and/or disappear during evolution. We also show that if distances between EUs satisfy the ultrametric condition, then GreedyMMD delivers an optimal subset of EUs that maximizes both the minimum pairwise distance and the PD. Finally, because GreedyMMD works with distances and does not require a tree, it is readily applicable to many data sets.
AB - Three desirable properties for any method of selecting a subset of evolutionary units (EUs) for conservation or for genomic sequencing are discussed. These properties are spread, stability, and applicability. We are motivated by a practical case in which the maximization of phylogenetic diversity (PD), which has been suggested as a suitable method, appears to lead to counterintuitive collections of EUs and does not meet these three criteria. We define a simple greedy algorithm (GreedyMMD) as a close approximation to choosing the subset that maximizes the minimum pairwise distance (MMD) between EUs. GreedyMMD satisfies our three criteria and may be a useful alternative to PD in real-world situations. In particular, we show that this method of selection is suitable under a model of biodiversity in which features arise and/or disappear during evolution. We also show that if distances between EUs satisfy the ultrametric condition, then GreedyMMD delivers an optimal subset of EUs that maximizes both the minimum pairwise distance and the PD. Finally, because GreedyMMD works with distances and does not require a tree, it is readily applicable to many data sets.
KW - Biodiversity conservation
KW - Greedy algorithm
KW - Phylogenetic diversity
UR - http://www.scopus.com/inward/record.url?scp=58149234724&partnerID=8YFLogxK
U2 - 10.1080/10635150802552831
DO - 10.1080/10635150802552831
M3 - Article
SN - 1063-5157
VL - 57
SP - 825
EP - 834
JO - Systematic Biology
JF - Systematic Biology
IS - 6
ER -