Geometric representation of high dimension, low sample size data

Peter Hall, J. S. Marron*, Amnon Neeman

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    347 Citations (Scopus)

    Abstract

    High dimension, low sample size data are emerging in various areas of science. We find a common structure underlying many such data sets by using a non-standard type of asymptotics: the dimension tends to ∞ while the sample size is fixed. Our analysis shows a tendency for the data to lie deterministically at the vertices of a regular simplex. Essentially all the randomness in the data appears only as a random rotation of this simplex. This geometric representation is used to obtain several new statistical insights.

    Original languageEnglish
    Pages (from-to)427-444
    Number of pages18
    JournalJournal of the Royal Statistical Society. Series B: Statistical Methodology
    Volume67
    Issue number3
    DOIs
    Publication statusPublished - 2005

    Fingerprint

    Dive into the research topics of 'Geometric representation of high dimension, low sample size data'. Together they form a unique fingerprint.

    Cite this