Extracting significant phrases from text

Yuan J. Lui*, Richard Brent, Ani Calinescu

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    Prospective readers can quickly determine whether a document is relevant to their information need if the significant phrases (or keyphrases) in this document are provided. Although keyphrases are useful, not many documents have keyphrases assigned to them, and manually assigning keyphrases to existing documents is costly. Therefore, there is a need for automatic keyphrase extraction. This paper introduces a new domain independent keyphrase extraction algorithm. The algorithm approaches the problem of keyphrase extraction as a classification task, and uses a combination of statistical and computational linguistics techniques, a new set of attributes, and a new learning method to distinguish keyphrases from non-keyphrases. The experiments indicate that this algorithm performs at least as well as other keyphrase extraction tools and that it significantly outperforms Microsoft Word 2000's AutoSummarizefeature.

    Original languageEnglish
    Title of host publicationProceedings - 21st International Conference on Advanced Information Networking and ApplicationsWorkshops/Symposia, AINAW'07
    Pages361-366
    Number of pages6
    DOIs
    Publication statusPublished - 2007
    Event21st International Conference on Advanced Information Networking and ApplicationsWorkshops/Symposia, AINAW'07 - Niagara Falls, ON, Canada
    Duration: 21 May 200723 May 2007

    Publication series

    NameProceedings - 21st International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINAW'07
    Volume2

    Conference

    Conference21st International Conference on Advanced Information Networking and ApplicationsWorkshops/Symposia, AINAW'07
    Country/TerritoryCanada
    CityNiagara Falls, ON
    Period21/05/0723/05/07

    Fingerprint

    Dive into the research topics of 'Extracting significant phrases from text'. Together they form a unique fingerprint.

    Cite this