Abstract
Choosing the number of components in a finite mixture model is a challenging task. In this article, we study the behaviour of information criteria for selecting the mixture order, based on either the observed likelihood or the complete likelihood including component labels. We propose a new observed likelihood criterion called aicmix, which is shown to be order consistent. We further show that when there is a nontrivial level of classification uncertainty in the true model, complete likelihood criteria asymptotically underestimate the true number of components. A simulation study illustrates the potentially poor finite-sample performance of complete likelihood criteria, while aicmix and the Bayesian information criterion perform strongly regardless of the level of classification uncertainty.
Original language | English |
---|---|
Pages (from-to) | 724-730 |
Number of pages | 7 |
Journal | Biometrika |
Volume | 102 |
Issue number | 3 |
DOIs | |
Publication status | Published - 1 Aug 2015 |
Externally published | Yes |