TY - UNPB
T1 - An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type
AU - Wang, Jiali
AU - Westveld, Anton
AU - Loong, Bronwyn
AU - Welsh, Alan
PY - 2019/10
Y1 - 2019/10
N2 - Copula-based methods provide a flexible approach to build missing data imputation models of multivariate data of mixed types. However, the choice of copula function is an open question. We consider a Bayesian nonparametric approach by using an infinite mixture of elliptical copulas induced by a Dirichlet process mixture to build a flexible copula function. A slice sampling algorithm is used to sample from the infinite dimensional parameter space. We extend the work on prior parallel tempering used in finite mixture models to the Dirichlet process mixture model to overcome the mixing issue in multimodal distributions. Using simulations, we demonstrate that the infinite mixture copula model provides a better overall fit compared to their single component counterparts, and performs better at capturing tail dependence features of the data. Simulations further show that our proposed model achieves more accurate imputation especially for continuous variables and better inferential results in some analytic models. The proposed model is applied to a medical data set of acute stroke patients in Australia
AB - Copula-based methods provide a flexible approach to build missing data imputation models of multivariate data of mixed types. However, the choice of copula function is an open question. We consider a Bayesian nonparametric approach by using an infinite mixture of elliptical copulas induced by a Dirichlet process mixture to build a flexible copula function. A slice sampling algorithm is used to sample from the infinite dimensional parameter space. We extend the work on prior parallel tempering used in finite mixture models to the Dirichlet process mixture model to overcome the mixing issue in multimodal distributions. Using simulations, we demonstrate that the infinite mixture copula model provides a better overall fit compared to their single component counterparts, and performs better at capturing tail dependence features of the data. Simulations further show that our proposed model achieves more accurate imputation especially for continuous variables and better inferential results in some analytic models. The proposed model is applied to a medical data set of acute stroke patients in Australia
UR - https://arxiv.org/abs/1910.05473
U2 - 10.48550/arXiv.1910.05473
DO - 10.48550/arXiv.1910.05473
M3 - Working paper
BT - An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type
ER -