Pairwise FastText Classifier for Entity Disambiguation

Cheng Yu, Bing Chu, Rohit Ram, James Aichinger, Lizhen Qu, Hanna Suominen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    For the Australasian Language Technology Association (ALTA) 2016 Shared Task, we devised Pairwise FastText Classifier (PFC), an efficient embedding-based text classifier, and used it for entity disambiguation. Compared with a few baseline algorithms, PFC achieved a higher F1 score at 0.72 (under the team name BCJR). To generalise the model, we also created a method to bootstrap the training set deterministically without human labelling and at no financial cost. By releasing PFC and the dataset augmentation software to the public1, we hope to invite more collaboration.
    Original languageEnglish
    Title of host publicationProceedings of Australasian Language Technology Association Workshop 2016 Workshop
    EditorsTrevor Cohn
    Place of PublicationPennsylvania, USA
    PublisherAssociation for Computational Linguistics
    Pages175−179pp
    EditionPeer Reviewed
    ISBN (Print)9781510833166
    Publication statusPublished - 2016
    EventAustralasian Language Technology Association Workshop (ALTA 2016) - Caulfield, Australia
    Duration: 1 Jan 2016 → …
    http://alta2016.alta.asn.au/U16/U16-1.pdf

    Conference

    ConferenceAustralasian Language Technology Association Workshop (ALTA 2016)
    Period1/01/16 → …
    OtherDecember 5–7 2016
    Internet address

    Fingerprint

    Dive into the research topics of 'Pairwise FastText Classifier for Entity Disambiguation'. Together they form a unique fingerprint.

    Cite this