Abstract
We study conditions under which, given a dictionary F = {f1, . . . , fM} and an i.i.d. sample (Xi,Yi )N i=1, the empirical minimizer in span(F ) relative to the squared loss, satisfies that with high probability R(∼f ERM) ≤ inf f span(F )R(f )+ rN(M),where R() is the squared risk and rN(M) is of the order of M/N. Among other results, we prove that a uniform small-ball estimate for functions in span(F) is enough to achieve that goal when the noise is independent of the design.
Original language | English |
---|---|
Pages (from-to) | 1520-1534 |
Number of pages | 15 |
Journal | Bernoulli |
Volume | 22 |
Issue number | 3 |
DOIs | |
Publication status | Published - Aug 2016 |
Externally published | Yes |