TY - JOUR
T1 - On aggregation for heavy-tailed classes
AU - Mendelson, Shahar
N1 - Publisher Copyright:
© 2016, Springer-Verlag Berlin Heidelberg.
PY - 2017/8/1
Y1 - 2017/8/1
N2 - We introduce an alternative to the notion of ‘fast rate’ in Learning Theory, which coincides with the optimal error rate when the given class happens to be convex and regular in some sense. While it is well known that such a rate cannot always be attained by a learning procedure (i.e., a procedure that selects a function in the given class), we introduce an aggregation procedure that attains that rate under rather minimal assumptions—for example, that the Lq and L2 norms are equivalent on the linear span of the class for some q> 2 , and the target random variable is square-integrable. The key components in the proof include a two-sided isomorphic estimator on distances between class members, which is based on the median-of-means; and an almost isometric lower bound of the form N-1∑i=1Nf2(Xi)≥(1-ζ)Ef2 which holds uniformly in the class. Both results only require that the Lq and L2 norms are equivalent on the linear span of the class for some q> 2.
AB - We introduce an alternative to the notion of ‘fast rate’ in Learning Theory, which coincides with the optimal error rate when the given class happens to be convex and regular in some sense. While it is well known that such a rate cannot always be attained by a learning procedure (i.e., a procedure that selects a function in the given class), we introduce an aggregation procedure that attains that rate under rather minimal assumptions—for example, that the Lq and L2 norms are equivalent on the linear span of the class for some q> 2 , and the target random variable is square-integrable. The key components in the proof include a two-sided isomorphic estimator on distances between class members, which is based on the median-of-means; and an almost isometric lower bound of the form N-1∑i=1Nf2(Xi)≥(1-ζ)Ef2 which holds uniformly in the class. Both results only require that the Lq and L2 norms are equivalent on the linear span of the class for some q> 2.
KW - 60G25
KW - 62G99
KW - 68T01
UR - http://www.scopus.com/inward/record.url?scp=84975524920&partnerID=8YFLogxK
U2 - 10.1007/s00440-016-0720-6
DO - 10.1007/s00440-016-0720-6
M3 - Article
SN - 0178-8051
VL - 168
SP - 641
EP - 674
JO - Probability Theory and Related Fields
JF - Probability Theory and Related Fields
IS - 3-4
ER -