Abstract
High dimension, low sample size data are emerging in various areas of science. We find a common structure underlying many such data sets by using a non-standard type of asymptotics: the dimension tends to ∞ while the sample size is fixed. Our analysis shows a tendency for the data to lie deterministically at the vertices of a regular simplex. Essentially all the randomness in the data appears only as a random rotation of this simplex. This geometric representation is used to obtain several new statistical insights.
Original language | English |
---|---|
Pages (from-to) | 427-444 |
Number of pages | 18 |
Journal | Journal of the Royal Statistical Society. Series B: Statistical Methodology |
Volume | 67 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2005 |