TY - JOUR
T1 - On properties of functional principal components analysis
AU - Hall, Peter
AU - Hosseini-Nasab, Mohammad
PY - 2006/2
Y1 - 2006/2
N2 - Functional data analysis is intrinsically infinite dimensional; functional principal component analysis reduces dimension to a finite level, and points to the most significant components of the data. However, although this technique is often discussed, its properties are not as well understood as they might be. We show how the properties of functional principal component analysis can be elucidated through stochastic expansions and related results. Our approach quantifies the errors that arise through statistical approximation, in successive terms of orders n-1/2, n-1, n-3/2,. . . , where n denotes sample size. The expansions show how spacings among eigenvalues impact on statistical performance. The term of size n1/2 illustrates first-order properties and leads directly to limit theory which describes the dominant effect of spacings. Thus, for example, spacings are seen to have an immediate, first-order effect on properties of eigen-function estimators, but only a second-order effect on eigenvalue estimators. Our results can be used to explore properties of existing methods, and also to suggest new techniques. In particular, we suggest bootstrap methods for constructing simultaneous confidence regions for an infinite number of eigenvalues, and also for individual eigenvalues and eigenvectors.
AB - Functional data analysis is intrinsically infinite dimensional; functional principal component analysis reduces dimension to a finite level, and points to the most significant components of the data. However, although this technique is often discussed, its properties are not as well understood as they might be. We show how the properties of functional principal component analysis can be elucidated through stochastic expansions and related results. Our approach quantifies the errors that arise through statistical approximation, in successive terms of orders n-1/2, n-1, n-3/2,. . . , where n denotes sample size. The expansions show how spacings among eigenvalues impact on statistical performance. The term of size n1/2 illustrates first-order properties and leads directly to limit theory which describes the dominant effect of spacings. Thus, for example, spacings are seen to have an immediate, first-order effect on properties of eigen-function estimators, but only a second-order effect on eigenvalue estimators. Our results can be used to explore properties of existing methods, and also to suggest new techniques. In particular, we suggest bootstrap methods for constructing simultaneous confidence regions for an infinite number of eigenvalues, and also for individual eigenvalues and eigenvectors.
KW - Confidence interval
KW - Cross-validation
KW - Eigenfunction
KW - Eigenvalue
KW - Linear regression
KW - Operator theory
KW - Principal component analysis
KW - Simultaneous confidence region
UR - http://www.scopus.com/inward/record.url?scp=33645039219&partnerID=8YFLogxK
U2 - 10.1111/j.1467-9868.2005.00535.x
DO - 10.1111/j.1467-9868.2005.00535.x
M3 - Article
SN - 1369-7412
VL - 68
SP - 109
EP - 126
JO - Journal of the Royal Statistical Society. Series B: Statistical Methodology
JF - Journal of the Royal Statistical Society. Series B: Statistical Methodology
IS - 1
ER -