Twitter opinion topic model: Extracting product opinions from tweets by leveraging hashtags and sentiment lexicon

Kar Wai Lim, Wray Buntine

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    62 Citations (Scopus)

    Abstract

    Aspect-based opinion mining is widely applied to review data to aggregate or summarize opinions of a product, and the current state-of-the-art is achieved with Latent Dirichlet Allocation (LDA)-based model. Although social media data like tweets are laden with opinions, their "dirty" nature (as natural language) has discouraged researchers from applying LDA-based opinion model for product review mining. Tweets are often informal, unstructured and lacking labeled data such as categories and ratings, making it challenging for product opinion mining. In this paper, we propose an LDA-based opinion model named Twitter Opinion Topic Model (TOTM) for opinion mining and sentiment analysis. TOTM leverages hashtags, mentions, emoticons and strong sentiment words that are present in tweets in its discovery process. It improves opinion prediction by modeling the target-opinion interaction directly, thus discovering target specific opinion words, neglected in existing approaches. Moreover, we propose a new formulation of incorporating sentiment prior information into a topic model, by utilizing an existing public sentiment lexicon. This is novel in that it learns and updates with the data. We conduct experiments on 9 million tweets on electronic products, and demonstrate the improved performance of TOTM in both quantitative evaluations and qualitative analysis. We show that aspect-based opinion analysis on massive volume of tweets provides useful opinions on products.

    Original languageEnglish
    Title of host publicationCIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management
    PublisherAssociation for Computing Machinery (ACM)
    Pages1319-1328
    Number of pages10
    ISBN (Electronic)9781450325981
    DOIs
    Publication statusPublished - 3 Nov 2014
    Event23rd ACM International Conference on Information and Knowledge Management, CIKM 2014 - Shanghai, China
    Duration: 3 Nov 20147 Nov 2014

    Publication series

    NameCIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management

    Conference

    Conference23rd ACM International Conference on Information and Knowledge Management, CIKM 2014
    Country/TerritoryChina
    CityShanghai
    Period3/11/147/11/14

    Fingerprint

    Dive into the research topics of 'Twitter opinion topic model: Extracting product opinions from tweets by leveraging hashtags and sentiment lexicon'. Together they form a unique fingerprint.

    Cite this