TY - GEN
T1 - Measuring statistical dependence with Hilbert-Schmidt norms
AU - Gretton, Arthur
AU - Bousquet, Olivier
AU - Smola, Alex
AU - Scḧlkopf, Bernhard
PY - 2005
Y1 - 2005
N2 - We propose an independence criterion based on the eigen-spectrum of covariance operators in reproducing kernel Hilbert spaces (RKHSs), consisting of an empirical estimate of the Hilbert-Schmidt norm of the cross-covariance operator (we term this a Hilbert-Schmidt Independence Criterion, or HSIC). This approach has several advantages, compared with previous kernel-based independence criteria. First, the empirical estimate is simpler than any other kernel dependence test, and requires no user-defined regularisation. Second, there is a clearly defined population quantity which the empirical estimate approaches in the large sample limit, with exponential convergence guaranteed between the two: this ensures that independence tests based on HSIC do not suffer from slow learning rates. Finally, we show in the context of independent component analysis (ICA) that the performance of HSIC is competitive with that of previously published kernel-based criteria, and of other recently published ICA methods.
AB - We propose an independence criterion based on the eigen-spectrum of covariance operators in reproducing kernel Hilbert spaces (RKHSs), consisting of an empirical estimate of the Hilbert-Schmidt norm of the cross-covariance operator (we term this a Hilbert-Schmidt Independence Criterion, or HSIC). This approach has several advantages, compared with previous kernel-based independence criteria. First, the empirical estimate is simpler than any other kernel dependence test, and requires no user-defined regularisation. Second, there is a clearly defined population quantity which the empirical estimate approaches in the large sample limit, with exponential convergence guaranteed between the two: this ensures that independence tests based on HSIC do not suffer from slow learning rates. Finally, we show in the context of independent component analysis (ICA) that the performance of HSIC is competitive with that of previously published kernel-based criteria, and of other recently published ICA methods.
UR - http://www.scopus.com/inward/record.url?scp=33646528415&partnerID=8YFLogxK
U2 - 10.1007/11564089_7
DO - 10.1007/11564089_7
M3 - Conference contribution
SN - 354029242X
SN - 9783540292425
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 63
EP - 77
BT - Algorithmic Learning Theory - 16th International Conference, ALT 2005, Proceedings
T2 - 16th International Conference on Algorithmic Learning Theory, ALT 2005
Y2 - 8 October 2005 through 11 October 2005
ER -