TY - JOUR
T1 - Estimating a changepoint, boundary, or frontier in the presence of observation error
AU - Hall, P.
AU - Simar, L.
PY - 2002
Y1 - 2002
N2 - A range of problems in economics and statistics involve calculation of the boundary, or frontier, of the support of a distribution. Several practical and attractive solutions exist if the sampled distribution has a sharp discontinuity at the frontier, but accuracy can be greatly diminished if the data are observed with error. Indeed, if the error is additive and has variance σ2 then inaccuracies are usually of order σ, for small σ. In this article we suggest an elementary method for reducing the effect of error to O(σ2), and show that refinements can improve accuracy still further, to O(σ3) or less. The problem is inherently ill-posed, however, to such an extent that the frontier is generally not even identifiable unless the error distribution is known. The latter assumption is unreasonable in most practical settings, not in the least because the error is often asymmetrically distributed. For example, in the context of productivity analysis the error distribution tends to have a longer tail in the direction of underestimation of production. Nevertheless, even when the error distribution is unknown, it is often tree that error variance is relatively low, and so methods for reducing systematic error in that case are useful in practice.
AB - A range of problems in economics and statistics involve calculation of the boundary, or frontier, of the support of a distribution. Several practical and attractive solutions exist if the sampled distribution has a sharp discontinuity at the frontier, but accuracy can be greatly diminished if the data are observed with error. Indeed, if the error is additive and has variance σ2 then inaccuracies are usually of order σ, for small σ. In this article we suggest an elementary method for reducing the effect of error to O(σ2), and show that refinements can improve accuracy still further, to O(σ3) or less. The problem is inherently ill-posed, however, to such an extent that the frontier is generally not even identifiable unless the error distribution is known. The latter assumption is unreasonable in most practical settings, not in the least because the error is often asymmetrically distributed. For example, in the context of productivity analysis the error distribution tends to have a longer tail in the direction of underestimation of production. Nevertheless, even when the error distribution is unknown, it is often tree that error variance is relatively low, and so methods for reducing systematic error in that case are useful in practice.
KW - Changepoint analysis
KW - Data envelopment analysis
KW - Deconvolution
KW - Frontier estimation
KW - Kernel methods
KW - Nonparametric estimation
KW - Panel data
KW - Production frontier
UR - http://www.scopus.com/inward/record.url?scp=0035998822&partnerID=8YFLogxK
U2 - 10.1198/016214502760047050
DO - 10.1198/016214502760047050
M3 - Article
SN - 0162-1459
VL - 97
SP - 523
EP - 534
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 458
ER -