Corpus Based Classification of Text in Australian Contracts

Michael Curtotti, Eric McCreath

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Written contracts are a fundamental framework for commercial and cooperative transactions and relationships. Limited research has been published on the application of machine learning and natural language processing (NLP) to contracts. In this paper we report the classification of components of contract texts using machine learning and hand-coded methods. Authors studying a range of domains have found that combining machine learning and rule based approaches increases accuracy of machine learning. We find similar results which suggest the utility of considering leveraging hand coded classification rules for machine learning. We attained an average accuracy of 83.48% on a multiclass labelling task on 20 contracts combining machine learning and rule based approaches, increasing performance over machine learning alone.
    Original languageEnglish
    Title of host publicationProceedings of the Australasian Language Technology Association Workshop (ALTA 2010)
    EditorsNitin Indurkhya, Simon Zwarts
    Place of PublicationMelbourne Australia
    PublisherUniversity of Melbourne
    Pages18-26
    EditionPeer Reviewed
    Publication statusPublished - 2010
    EventAustralasian Language Technology Association Workshop (ALTA 2010) - Melbourne Australia, Australia
    Duration: 1 Jan 2010 → …

    Conference

    ConferenceAustralasian Language Technology Association Workshop (ALTA 2010)
    Country/TerritoryAustralia
    Period1/01/10 → …
    OtherDecember 9-10 2010

    Fingerprint

    Dive into the research topics of 'Corpus Based Classification of Text in Australian Contracts'. Together they form a unique fingerprint.

    Cite this