Abstract
We present the heirarchical Dirichlet scaling process (HDSP), a Bayesian nonparametric mixed membership model for multi-labeled data. We construct the HDSP based on the gamma representation of the hierarchical Dirichlet process (HDP) which allows scaling the micture components. With such construction, HDSP allocates a latent location to each label and micture component in a space, and uses the distance between them to guide membership probabilities. We develop a variational Bayes algorithm for the approximate posterior inference of the HDSP. Though experiments on synthetic datasets as well as datasets of newswire, medical journal articles and Wikipedia, we show that the HDSP results in better predictive performance that HDP, labeled LDA and partially labeled LDA.
Original language | English |
---|---|
Title of host publication | 31st International Conference on Machine Learning, ICML 2014 |
Place of Publication | Online |
Publisher | JMLR - Journal of Machine Learning |
Edition | Peer Reviewed |
ISBN (Print) | 9781634393973 |
DOIs | |
Publication status | Published - 2014 |
Event | 31st International Conference on Machine Learning, ICML 2014 - Beijing, China, China Duration: 1 Jan 2014 → … |
Conference
Conference | 31st International Conference on Machine Learning, ICML 2014 |
---|---|
Country/Territory | China |
Period | 1/01/14 → … |
Other | June 21-26 2014 |