TY - GEN
T1 - Kernel methods for missing variables
AU - Smola, Alex J.
AU - Vishwanathan, S. V.N.
AU - Hofmann, Thomas
PY - 2005
Y1 - 2005
N2 - We present methods for dealing with missing variables in the context of Gaussian Processes and Support Vector Machines. This solves an important problem which has largely been ignored by kernel methods: How to systematically deal with incomplete data? Our method can also be applied to problems with partially observed labels as well as to the transductive setting where we view the labels as missing data. Our approach relies on casting kernel methods as an estimation problem in exponential families. Hence, estimation with missing variables becomes a problem of computing marginal distributions, and finding efficient optimization methods. To that extent we propose an optimization scheme which extends the Concave Convex Procedure (CCP) of Yuille and Rangarajan, and present a simplified and intuitive proof of its convergence. We show how our algorithm can be specialized to various cases in order to efficiently solve the optimization problems that arise. Encouraging preliminary experimental results on the USPS dataset are also presented.
AB - We present methods for dealing with missing variables in the context of Gaussian Processes and Support Vector Machines. This solves an important problem which has largely been ignored by kernel methods: How to systematically deal with incomplete data? Our method can also be applied to problems with partially observed labels as well as to the transductive setting where we view the labels as missing data. Our approach relies on casting kernel methods as an estimation problem in exponential families. Hence, estimation with missing variables becomes a problem of computing marginal distributions, and finding efficient optimization methods. To that extent we propose an optimization scheme which extends the Concave Convex Procedure (CCP) of Yuille and Rangarajan, and present a simplified and intuitive proof of its convergence. We show how our algorithm can be specialized to various cases in order to efficiently solve the optimization problems that arise. Encouraging preliminary experimental results on the USPS dataset are also presented.
UR - http://www.scopus.com/inward/record.url?scp=84862623799&partnerID=8YFLogxK
M3 - Conference contribution
SN - 097273581X
SN - 9780972735810
T3 - AISTATS 2005 - Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics
SP - 325
EP - 332
BT - AISTATS 2005 - Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics
T2 - 10th International Workshop on Artificial Intelligence and Statistics, AISTATS 2005
Y2 - 6 January 2005 through 8 January 2005
ER -