Dirichlet Process with Mixed Random Measures: A Nonparametric Topic Model for Labeled Data

Dongwoo Kim, Suin Kim, Alice Oh

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    18 Citations (Scopus)

    Abstract

    We describe a nonparametric topic model for labeled data. The model uses a mixture of random measures (MRM) as a base distribution of the Dirichlet process (DP) of the HDP framework, so we call it the DPMRM. To model labeled data, we de ne a DP distributed random measure for each la- bel, and the resulting model generates an unbounded number of topics for each label. We apply DP-MRM on single-labeled and multi-labeled corpora of documents and com- pare the performance on label prediction with MedLDA, LDA-SVM, and Labeled-LDA. We further enhance the model by incorporating ddCRP and modeling multi-labeled images for image segmentation and object labeling, comparing the performance with nCuts and rddCRP.
    Original languageEnglish
    Title of host publicationProceedings of the 29th International Conference on Machine Learning
    Place of Publicationunknown
    PublisherAssociation for Computational Linguistics
    EditionPeer Reviewed
    Publication statusPublished - 2012
    Event29th International Conference on Machine Learning (ICML2012) - Edinburgh, Scotland, United Kingdom
    Duration: 1 Jan 2012 → …

    Conference

    Conference29th International Conference on Machine Learning (ICML2012)
    Country/TerritoryUnited Kingdom
    Period1/01/12 → …
    Other26 June - 1 July 2012

    Fingerprint

    Dive into the research topics of 'Dirichlet Process with Mixed Random Measures: A Nonparametric Topic Model for Labeled Data'. Together they form a unique fingerprint.

    Cite this