Generalization performance of regularization networks and support vector machines via entropy numbers of compact operators

Robert C. Williamson*, Alex J. Smola, Bernhard Schölkopf

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    94 Citations (Scopus)

    Abstract

    We derive new bounds for the generalization error of kernel machines, such as support vector machines and related regularization networks by obtaining new bounds on their covering numbers. The proofs make use of a viewpoint that is apparently novel in the field of statistical learning theory. The hypothesis class is described in terms of a linear operator mapping from a possibly infinite-dimensional unit ball in feature space into a finite-dimensional space. The covering numbers of the class are then determined via the entropy numbers of the operator. These numbers, which characterize the degree of compactness of the operator, can be bounded in terms of the eigenvalues of an integral operator induced by the kernel function used by the machine. As a consequence, we are able to theoretically explain the effect of the choice of kernel function on the generalization performance of support vector machines.

    Original languageEnglish
    Pages (from-to)2516-2532
    Number of pages17
    JournalIEEE Transactions on Information Theory
    Volume47
    Issue number6
    DOIs
    Publication statusPublished - Sept 2001

    Fingerprint

    Dive into the research topics of 'Generalization performance of regularization networks and support vector machines via entropy numbers of compact operators'. Together they form a unique fingerprint.

    Cite this