SentiCap: Generating image descriptions with sentiments

Alexander Mathews, Lexing Xie, Xuming He

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    163 Citations (Scopus)

    Abstract

    The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a system to describe an image with emotions, and present a model that automatically generates captions with positive or negative sentiments. We propose a novel switching recurrent neural network with word-level regularization, which is able to produce emotional image captions using only 2000+ training sentences containing sentiments. We evaluate the captions with different automatic and crowd-sourcing metrics. Our model compares favourably in common quality metrics for image captioning. In 84.6% of cases the generated positive captions were judged as being at least as descriptive as the factual captions. Of these positive captions 88% were confirmed by the crowd-sourced workers as having the appropriate sentiment.

    Original languageEnglish
    Title of host publication30th AAAI Conference on Artificial Intelligence, AAAI 2016
    PublisherAAAI Press
    Pages3574-3580
    Number of pages7
    ISBN (Electronic)9781577357605
    Publication statusPublished - 2016
    Event30th AAAI Conference on Artificial Intelligence, AAAI 2016 - Phoenix, United States
    Duration: 12 Feb 201617 Feb 2016

    Publication series

    Name30th AAAI Conference on Artificial Intelligence, AAAI 2016

    Conference

    Conference30th AAAI Conference on Artificial Intelligence, AAAI 2016
    Country/TerritoryUnited States
    CityPhoenix
    Period12/02/1617/02/16

    Fingerprint

    Dive into the research topics of 'SentiCap: Generating image descriptions with sentiments'. Together they form a unique fingerprint.

    Cite this