Abstract
We describe a nonparametric topic model for labeled data. The model uses a mixture of random measures (MRM) as a base distribution of the Dirichlet process (DP) of the HDP framework, so we call it the DPMRM. To model labeled data, we de ne a DP distributed random measure for each la- bel, and the resulting model generates an unbounded number of topics for each label. We apply DP-MRM on single-labeled and multi-labeled corpora of documents and com- pare the performance on label prediction with MedLDA, LDA-SVM, and Labeled-LDA. We further enhance the model by incorporating ddCRP and modeling multi-labeled images for image segmentation and object labeling, comparing the performance with nCuts and rddCRP.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 29th International Conference on Machine Learning |
| Place of Publication | unknown |
| Publisher | Association for Computational Linguistics (ACL) |
| Edition | Peer Reviewed |
| Publication status | Published - 2012 |
| Event | 29th International Conference on Machine Learning (ICML2012) - Edinburgh, Scotland, United Kingdom Duration: 1 Jan 2012 → … |
Conference
| Conference | 29th International Conference on Machine Learning (ICML2012) |
|---|---|
| Country/Territory | United Kingdom |
| Period | 1/01/12 → … |
| Other | 26 June - 1 July 2012 |