TY - GEN
T1 - Evidence of multiple maximum likelihood points for a phylogenetic tree
AU - Zhou, B. B.
AU - Tarawneh, M.
AU - Wang, P.
AU - Chu, D.
AU - Wang, C.
AU - Zomaya, A. Y.
AU - Brent, R. P.
PY - 2006
Y1 - 2006
N2 - An interesting and important, but largely ignored question associated with the ML method is whether there exists only a single maximum likelihood point for a given phylogenetic tree. Mike Steel presented a simple analytical result to argue that the ML point is not unique [11]. However, his view so far attracts only little attention. Though many researchers believe that multiple maximum likelihood points may exist for certain phylogenetic trees, most existing phylogenetic construction programs only produce a single best tree under the ML criterion and in practice many researchers still use only the ML values to make judgment on the quality of different trees for a given problem. In this paper we present some experimental results from a large number of synthetic test data sets and show that it is quite common that certain incorrect trees can have likelihood values at least as large as that of the correct tree. A significant implication of this is that even if we are able to find a truly globally optimal tree under the maximum likelihood criterion, this tree may not necessarily be the correct phylogenetic tree. In the paper we also show that our newly developed algorithm can perform much better in terms of accuracy than well known algorithms such as FASTDNAML and PHYML by constructing only a few more trees for a given problem.
AB - An interesting and important, but largely ignored question associated with the ML method is whether there exists only a single maximum likelihood point for a given phylogenetic tree. Mike Steel presented a simple analytical result to argue that the ML point is not unique [11]. However, his view so far attracts only little attention. Though many researchers believe that multiple maximum likelihood points may exist for certain phylogenetic trees, most existing phylogenetic construction programs only produce a single best tree under the ML criterion and in practice many researchers still use only the ML values to make judgment on the quality of different trees for a given problem. In this paper we present some experimental results from a large number of synthetic test data sets and show that it is quite common that certain incorrect trees can have likelihood values at least as large as that of the correct tree. A significant implication of this is that even if we are able to find a truly globally optimal tree under the maximum likelihood criterion, this tree may not necessarily be the correct phylogenetic tree. In the paper we also show that our newly developed algorithm can perform much better in terms of accuracy than well known algorithms such as FASTDNAML and PHYML by constructing only a few more trees for a given problem.
UR - http://www.scopus.com/inward/record.url?scp=34547450499&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2006.253334
DO - 10.1109/BIBE.2006.253334
M3 - Conference contribution
SN - 0769527272
SN - 9780769527277
T3 - Proceedings - Sixth IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
SP - 193
EP - 197
BT - Proceedings - Sixth IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
T2 - 6th IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
Y2 - 16 October 2006 through 18 October 2006
ER -