TY - JOUR
T1 - Nonparametric bootstrap tests of conditional independence in two-way contingency tables
AU - Hui, Francis K.C.
AU - Geenens, Gery
PY - 2012/11
Y1 - 2012/11
N2 - When analyzing a two-way contingency table, a preliminary question is often whether the categorical variables under study, say R and S, are independent or not. Suppose now that for each individual in the table, a continuous variable X is also known. It is then worth analyzing the table conditionally on X. Contrasting these "local" results to the global unconditional case allows one to go beyond the initial analysis and provide a better understanding of the underlying phenomenon. Recently, Geenens and Simar (2010) [11] have proposed two nonparametric procedures for testing whether R and S are conditionally independent given X, free of any constraining linearity assumptions. However, based on an average of kernel-based estimators, the asymptotic criterion they suggested shows an inflated Type I error (false positive) for small to moderate sample sizes. In this paper, we address this problem by proposing consistent bootstrap versions of the Geenens-Simar test procedures when testing for local independence. A comprehensive simulation study indeed shows the superiority of the bootstrap rejection criterion as compared to the asymptotic criterion in terms of Type I error. It also highlights the advantage of the flexibility guaranteed by the nonparametric Geenens-Simar tests when compared with parametric competitors, e.g.logistic models. The approach is finally illustrated with a real-data example.
AB - When analyzing a two-way contingency table, a preliminary question is often whether the categorical variables under study, say R and S, are independent or not. Suppose now that for each individual in the table, a continuous variable X is also known. It is then worth analyzing the table conditionally on X. Contrasting these "local" results to the global unconditional case allows one to go beyond the initial analysis and provide a better understanding of the underlying phenomenon. Recently, Geenens and Simar (2010) [11] have proposed two nonparametric procedures for testing whether R and S are conditionally independent given X, free of any constraining linearity assumptions. However, based on an average of kernel-based estimators, the asymptotic criterion they suggested shows an inflated Type I error (false positive) for small to moderate sample sizes. In this paper, we address this problem by proposing consistent bootstrap versions of the Geenens-Simar test procedures when testing for local independence. A comprehensive simulation study indeed shows the superiority of the bootstrap rejection criterion as compared to the asymptotic criterion in terms of Type I error. It also highlights the advantage of the flexibility guaranteed by the nonparametric Geenens-Simar tests when compared with parametric competitors, e.g.logistic models. The approach is finally illustrated with a real-data example.
KW - Binary regression
KW - Conditional probabilities
KW - Likelihood ratio
KW - Nadaraya-Watson estimator
KW - Pearson's chi-squared
UR - http://www.scopus.com/inward/record.url?scp=84863533772&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2012.05.015
DO - 10.1016/j.jmva.2012.05.015
M3 - Article
SN - 0047-259X
VL - 112
SP - 130
EP - 144
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
ER -