Diverse retrieval via greedy optimization of expected 1-call@k in a latent subtopic relevance model

Scott Sanner*, Shengbo Guo, Thore Graepel, Sadegh Kharazmi, Sarvnaz Karimi

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    15 Citations (Scopus)

    Abstract

    It has been previously observed that optimization of the 1-call@k relevance objective (i.e., a set-based objective that is 1 if at least one document is relevant, otherwise 0) empirically correlates with diverse retrieval. In this paper, we proceed one step further and show theoretically that greedily optimizing expected 1-call@k w.r.t. a latent subtopic model of binary relevance leads to a diverse retrieval algorithm sharing many features of existing diversification approaches. This new result is complementary to a variety of diverse retrieval algorithms derived from alternate rank-based relevance criteria such as average precision and reciprocal rank. As such, the derivation presented here for expected 1-call@k provides a novel theoretical perspective on the emergence of diversity via a latent subtopic model of relevance - an idea underlying both ambiguous and faceted subtopic retrieval that have been used to motivate diverse retrieval.

    Original languageEnglish
    Title of host publicationCIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management
    Pages1977-1980
    Number of pages4
    DOIs
    Publication statusPublished - 2011
    Event20th ACM Conference on Information and Knowledge Management, CIKM'11 - Glasgow, United Kingdom
    Duration: 24 Oct 201128 Oct 2011

    Publication series

    NameInternational Conference on Information and Knowledge Management, Proceedings

    Conference

    Conference20th ACM Conference on Information and Knowledge Management, CIKM'11
    Country/TerritoryUnited Kingdom
    CityGlasgow
    Period24/10/1128/10/11

    Fingerprint

    Dive into the research topics of 'Diverse retrieval via greedy optimization of expected 1-call@k in a latent subtopic relevance model'. Together they form a unique fingerprint.

    Cite this