Abstract
We study conditions under which, given a dictionary F = {f1, . . . , fM} and an i.i.d. sample (Xi,Yi )N i=1, the empirical minimizer in span(F ) relative to the squared loss, satisfies that with high probability R(∼f ERM) ≤ inf f span(F )R(f )+ rN(M),where R() is the squared risk and rN(M) is of the order of M/N. Among other results, we prove that a uniform small-ball estimate for functions in span(F) is enough to achieve that goal when the noise is independent of the design.
| Original language | English |
|---|---|
| Pages (from-to) | 1520-1534 |
| Number of pages | 15 |
| Journal | Bernoulli |
| Volume | 22 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - Aug 2016 |
| Externally published | Yes |