Skip to main navigation Skip to search Skip to main content

CILex: An Investigation of Context Information for Lexical Substitution Methods

Sandaru Seneviratne, Elena Daskalaki, Artem Lenskiy, Hanna Suominen

    Research output: Contribution to journalConference articlepeer-review

    10 Citations (Scopus)

    Abstract

    Lexical substitution, which aims to generate substitutes for a target word given a context, is an important natural language processing task useful in many applications. Due to the paucity of annotated data, existing methods for lexical substitution tend to rely on manually curated lexical resources and contextual word embedding models. Methods based on lexical resources are likely to miss relevant substitutes whereas relying only on contextual word embedding models fails to provide adequate information on the impact of a substitute in the entire context and the overall meaning of the input. We proposed CILex, which uses contextual sentence embeddings along with methods that capture additional Context Information complimenting contextual word embeddings for Lexical substitution. This ensured the semantic consistency of a substitute with the target word while maintaining the overall meaning of the sentence. Our experimental comparisons with previously proposed methods indicated that our solution is now the state-of-the-art on both the widely used LS07 and CoInCo datasets with P@1 scores of 55.96% and 57.25% for lexical substitution. The implementation of the proposed approach is available at https://github.com/sandaruSen/CILex under the MIT license.

    Original languageEnglish
    Pages (from-to)4124-4135
    Number of pages12
    JournalProceedings - International Conference on Computational Linguistics, COLING
    Volume29
    Issue number1
    Publication statusPublished - 2022
    Event29th International Conference on Computational Linguistics, COLING 2022 - Gyeongju, Korea, Republic of
    Duration: 12 Oct 202217 Oct 2022

    Fingerprint

    Dive into the research topics of 'CILex: An Investigation of Context Information for Lexical Substitution Methods'. Together they form a unique fingerprint.

    Cite this