A Token-Wise CNN-Based Method for Sentence Compression

Weiwei Hou*, Hanna Suominen, Piotr Koniusz, Sabrina Caldwell, Tom Gedeon

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    2 Citations (Scopus)

    Abstract

    Sentence compression is a Natural Language Processing (NLP) task aimed at shortening original sentences and preserving their key information. Its applications can benefit many fields e.g., one can build tools for language education. However, current methods are largely based on Recurrent Neural Network (RNN) models which suffer from poor processing speed. To address this issue, in this paper, we propose a token-wiseConvolutional Neural Network, a CNN-based model along with pre-trained Bidirectional Encoder Representations from Transformers (BERT) features for deletion-based sentence compression. We also compare our model with RNN-based models and fine-tuned BERT. Although one of the RNN-based models outperforms marginally other models given the same input, our CNN-based model was ten times faster than the RNN-based approach.

    Original languageEnglish
    Title of host publicationNeural Information Processing - 27th International Conference, ICONIP 2020, Proceedings
    EditorsHaiqin Yang, Kitsuchart Pasupa, Andrew Chi-Sing Leung, James T. Kwok, Jonathan H. Chan, Irwin King
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages668-679
    Number of pages12
    ISBN (Print)9783030638290
    DOIs
    Publication statusPublished - 2020
    Event27th International Conference on Neural Information Processing, ICONIP 2020 - Bangkok, Thailand
    Duration: 18 Nov 202022 Nov 2020

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume12532 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference27th International Conference on Neural Information Processing, ICONIP 2020
    Country/TerritoryThailand
    CityBangkok
    Period18/11/2022/11/20

    Fingerprint

    Dive into the research topics of 'A Token-Wise CNN-Based Method for Sentence Compression'. Together they form a unique fingerprint.

    Cite this