Tensor term indexing: An application of HOSVD for document summarization

Sukanya Manna*, Zoltán Petres, Tom Gedeon

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    5 Citations (Scopus)

    Abstract

    In this paper, a new method for text summarization is proposed by using an extended version of the Tensor Term Importance (TTI) model. This method summarizes documents by extracting important sentences from a document. It improves the per document summarization efficiency by incorporating additional information of the whole document set referring to the same topic (or coherent documents). The basic idea of this approach is to represent the whole document set in a uniform form, in the term-sentence-document tensor, and to use higher-order singular value decomposition (HOSVD) to highlight the important terms in each document. Here, we present two different methods of summarization. In the first method, the sentences having the highly weighted terms are extracted as the important sentences representing the document. The important sentences identified by selecting those that contains more from the important terms. The second model uses a so-called super sentence and uses that to extract other sentences having high similarity with it. Unlike in Latent Semantic Analysis (LSA) where SVD is applied for compressing the sparse term-document matrix and defining latent semantic links between terms, in TTI SVD is used to reduce noise and to highlight the important term-document relations in the document. Our evaluation results show that our TTI based methods are more similar to human generated summaries than other automated summarizers which work on single documents at a time.

    Original languageEnglish
    Title of host publicationISCIII '09 - 4th International Symposium on Computational Intelligence and Intelligent Informatics, Proceedings
    Pages135-141
    Number of pages7
    DOIs
    Publication statusPublished - 2009
    EventISCIII '09 - 4th International Symposium on Computational Intelligence and Intelligent Informatics - Luxor, Egypt
    Duration: 21 Oct 200925 Oct 2009

    Publication series

    NameISCIII '09 - 4th International Symposium on Computational Intelligence and Intelligent Informatics, Proceedings

    Conference

    ConferenceISCIII '09 - 4th International Symposium on Computational Intelligence and Intelligent Informatics
    Country/TerritoryEgypt
    CityLuxor
    Period21/10/0925/10/09

    Fingerprint

    Dive into the research topics of 'Tensor term indexing: An application of HOSVD for document summarization'. Together they form a unique fingerprint.

    Cite this