Comparison of Word and Character Level Information for Medical Term Identification Using Convolutional Neural Networks and Transformers

Sandaru Seneviratne, Artem Lenskiy, Christopher Nolan, Eleni Daskalaki, Hanna Suominen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Complexity and domain-specificity make medical text hard to understand for patients and their next of kin. To simplify such text, this paper explored how word and character level information can be leveraged to identify medical terms when training data is limited. We created a dataset of medical and general terms using the Human Disease Ontology from BioPortal and Wikipedia pages. Our results from 10-fold cross validation indicated that convolutional neural networks (CNNs) and transformers perform competitively. The best F score of 93.9% was achieved by a CNN trained on both word and character level embeddings. Statistical significance tests demonstrated that general word embeddings provide rich word representations for medical term identification. Consequently, focusing on words is favorable for medical term identification if using deep learning architectures.

    Original languageEnglish
    Title of host publicationNurses and Midwives in the Digital Age - Selected Papers, Posters and Panels from the 15th International Congress in Nursing Informatics
    EditorsMichelle Honey, Charlene Ronquillo, Ting-Ting Lee, Lucy Westbrooke
    PublisherIOS Press BV
    Pages249-253
    Number of pages5
    ISBN (Electronic)9781643682204
    DOIs
    Publication statusPublished - 15 Dec 2021
    Event15th International Congress in Nursing Informatics: Nurses and Midwives in the Digital Age, NI 2021 - Virtual, Online
    Duration: 23 Aug 20212 Sept 2021

    Publication series

    NameStudies in Health Technology and Informatics
    Volume284
    ISSN (Print)0926-9630
    ISSN (Electronic)1879-8365

    Conference

    Conference15th International Congress in Nursing Informatics: Nurses and Midwives in the Digital Age, NI 2021
    CityVirtual, Online
    Period23/08/212/09/21

    Fingerprint

    Dive into the research topics of 'Comparison of Word and Character Level Information for Medical Term Identification Using Convolutional Neural Networks and Transformers'. Together they form a unique fingerprint.

    Cite this