Improving hierarchical document signature performance by classifier combination

Jieyi Liao*, B. Sumudu U. Mendis, Sukanya Manna

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    We present a classifier-combination experimental framework for part-of-speech (POS) tagging in which four different POS taggers are combined in order to get a better result for sentence similarity using Hierarchical Document Signature (HDS). It is important to abstract information available to form humanly accessible structures. The way people think and talk is hierarchical with limited information presented in any one sentence, and that information is always linked together to further information. As such, HDS is a significant way to represent sentences when finding their similarity. POS tagging plays an important role in HDS. But POS taggers available are not perfect in tagging words in a sentence and tend to tag words improperly if they are either not properly cased or do not match the corpus dataset by which these taggers are trained. Thus, different weighted voting strategies are used to overcome some of these drawbacks of these existing taggers. Comparisons between individual taggers and combined taggers under different voting strategies are made. Their results show that the combined taggers provide better results than the individual ones.

    Original languageEnglish
    Title of host publicationNeural Information Processing
    Subtitle of host publicationTheory and Algorithms - 17th International Conference, ICONIP 2010, Proceedings
    Pages695-702
    Number of pages8
    EditionPART 1
    DOIs
    Publication statusPublished - 2010
    Event17th International Conference on Neural Information Processing, ICONIP 2010 - Sydney, NSW, Australia
    Duration: 22 Nov 201025 Nov 2010

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    NumberPART 1
    Volume6443 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference17th International Conference on Neural Information Processing, ICONIP 2010
    Country/TerritoryAustralia
    CitySydney, NSW
    Period22/11/1025/11/10

    Fingerprint

    Dive into the research topics of 'Improving hierarchical document signature performance by classifier combination'. Together they form a unique fingerprint.

    Cite this