Picture tags and world knowledge: Learning tag relations from visual semantic sources

Lexing Xie, Xuming He

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    20 Citations (Scopus)

    Abstract

    This paper studies the use of everyday words to describe images. The common saying has it that a picture is worth a thousand words, here we ask which thousand? The proliferation of tagged social multimedia data presents a challenge to understanding collective tag-use at large scale { one can ask if patterns from photo tags help understand tag-tag relations, and how it can be leveraged to improve visual search and recognition. We propose a new method to jointly analyze three distinct visual knowledge resources: Flickr, ImageNet/WordNet, and ConceptNet. This allows us to quantify the visual relevance of both tags learn their relationships. We propose a novel network estimation algorithm, Inverse Concept Rank, to infer incomplete tag relationships. We then design an algorithm for image annotation that takes into account both image and tag features. We analyze over 5 million photos with over 20,000 visual tags. The statistics from this collection leads to good results for image tagging, relationship estimation, and generalizing to unseen tags. This is a first step in analyzing picture tags and everyday semantic knowledge. Potential other applications include generating natural language descriptions of pictures, as well as validating and supplementing knowledge databases.

    Original languageEnglish
    Title of host publicationMM 2013 - Proceedings of the 2013 ACM Multimedia Conference
    Pages967-976
    Number of pages10
    DOIs
    Publication statusPublished - 2013
    Event21st ACM International Conference on Multimedia, MM 2013 - Barcelona, Spain
    Duration: 21 Oct 201325 Oct 2013

    Publication series

    NameMM 2013 - Proceedings of the 2013 ACM Multimedia Conference

    Conference

    Conference21st ACM International Conference on Multimedia, MM 2013
    Country/TerritorySpain
    CityBarcelona
    Period21/10/1325/10/13

    Fingerprint

    Dive into the research topics of 'Picture tags and world knowledge: Learning tag relations from visual semantic sources'. Together they form a unique fingerprint.

    Cite this