TY - JOUR
T1 - Tail density estimation for exploratory data analysis using kernel methods
AU - Béranger, B.
AU - Duong, T.
AU - Perkins-Kirkpatrick, S. E.
AU - Sisson, S. A.
N1 - Publisher Copyright:
© 2018, © American Statistical Association and Taylor & Francis 2018.
PY - 2019/1/2
Y1 - 2019/1/2
N2 - It is often critical to accurately model the upper tail behaviour of a random process. Nonparametric density estimation methods are commonly implemented as exploratory data analysis techniques for this purpose and can avoid model specification biases implied by using parametric estimators. In particular, kernel-based estimators place minimal assumptions on the data, and provide improved visualisation over scatterplots and histograms. However kernel density estimators can perform poorly when estimating tail behaviour above a threshold, and can over-emphasise bumps in the density for heavy tailed data. We develop a transformation kernel density estimator which is able to handle heavy tailed and bounded data, and is robust to threshold choice. We derive closed form expressions for its asymptotic bias and variance, which demonstrate its good performance in the tail region. Finite sample performance is illustrated in numerical studies, and in an expanded analysis of the performance of global climate models.
AB - It is often critical to accurately model the upper tail behaviour of a random process. Nonparametric density estimation methods are commonly implemented as exploratory data analysis techniques for this purpose and can avoid model specification biases implied by using parametric estimators. In particular, kernel-based estimators place minimal assumptions on the data, and provide improved visualisation over scatterplots and histograms. However kernel density estimators can perform poorly when estimating tail behaviour above a threshold, and can over-emphasise bumps in the density for heavy tailed data. We develop a transformation kernel density estimator which is able to handle heavy tailed and bounded data, and is robust to threshold choice. We derive closed form expressions for its asymptotic bias and variance, which demonstrate its good performance in the tail region. Finite sample performance is illustrated in numerical studies, and in an expanded analysis of the performance of global climate models.
KW - Climate extremes
KW - exploratory data analysis
KW - global climate models
KW - histograms
KW - model selection
KW - multivariate kernel density estimation
UR - http://www.scopus.com/inward/record.url?scp=85056105910&partnerID=8YFLogxK
U2 - 10.1080/10485252.2018.1537442
DO - 10.1080/10485252.2018.1537442
M3 - Article
AN - SCOPUS:85056105910
SN - 1048-5252
VL - 31
SP - 144
EP - 174
JO - Journal of Nonparametric Statistics
JF - Journal of Nonparametric Statistics
IS - 1
ER -