On term selection techniques for patent prior art search

Mona Golestan Far, Scott Sanner, Mohamed Reda Bouadjenek, Gabriela Ferraro, David Hawking

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    21 Citations (Scopus)

    Abstract

    In this paper, we investigate the inuence of term selection on retrieval performance on the CLEF-IP prior art test collection, using the Description section of the patent query with Language Model (LM) and BM25 scoring functions. We find that an oracular relevance feedback system that extracts terms from the judged relevant documents far outperforms the baseline and performs twice as well on MAP as the best competitor in CLEF-IP 2010. We find a very clear term selection value threshold for use when choosing terms. We also noticed that most of the useful feedback terms are actually present in the original query and hypothesized that the baseline system could be substantially improved by removing negative query terms. We tried four simple automated approaches to identify negative terms for query reduction but we were unable to notably improve on the baseline performance with any of them. However, we show that a simple, minimal interactive relevance feedback approach where terms are selected from only the first retrieved relevant document outperforms the best result from CLEF-IP 2010 suggesting the promise of interactive methods for term selection in patent prior art search.

    Original languageEnglish
    Title of host publicationSIGIR 2015 - Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
    PublisherAssociation for Computing Machinery, Inc
    Pages803-806
    Number of pages4
    ISBN (Electronic)9781450336215
    DOIs
    Publication statusPublished - 9 Aug 2015
    Event38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2015 - Santiago, Chile
    Duration: 9 Aug 201513 Aug 2015

    Publication series

    NameSIGIR 2015 - Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

    Conference

    Conference38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2015
    Country/TerritoryChile
    CitySantiago
    Period9/08/1513/08/15

    Fingerprint

    Dive into the research topics of 'On term selection techniques for patent prior art search'. Together they form a unique fingerprint.

    Cite this