TY - GEN
T1 - Correcting sample selection bias by unlabeled data
AU - Huang, Jiayuan
AU - Smola, Alexander J.
AU - Gretton, Arthur
AU - Borgwardt, Karsten M.
AU - Schölkopf, Bernhard
PY - 2007
Y1 - 2007
N2 - We consider the scenario where training and test data are drawn from different distributions, commonly referred to as sample selection bias. Most algorithms for this setting try to first recover sampling distributions and then make appropriate corrections based on the distribution estimate. We present a nonparametric method which directly produces resampling weights without distribution estimation. Our method works by matching distributions between training and testing sets in feature space. Experimental results demonstrate that our method works well in practice.
AB - We consider the scenario where training and test data are drawn from different distributions, commonly referred to as sample selection bias. Most algorithms for this setting try to first recover sampling distributions and then make appropriate corrections based on the distribution estimate. We present a nonparametric method which directly produces resampling weights without distribution estimation. Our method works by matching distributions between training and testing sets in feature space. Experimental results demonstrate that our method works well in practice.
UR - http://www.scopus.com/inward/record.url?scp=84864031047&partnerID=8YFLogxK
M3 - Conference contribution
SN - 9780262195683
T3 - Advances in Neural Information Processing Systems
SP - 601
EP - 608
BT - Advances in Neural Information Processing Systems 19 - Proceedings of the 2006 Conference
T2 - 20th Annual Conference on Neural Information Processing Systems, NIPS 2006
Y2 - 4 December 2006 through 7 December 2006
ER -