A probabilistic geocoding system utilising a parcel based address file

Peter Christen*, Alan Willmore, Tim Churches

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    12 Citations (Scopus)

    Abstract

    It is estimated that between 80% and 90% of governmental data collections contain address information, Geocoding - the process of assigning geographic coordinates to addresses - is becoming increasingly important in application areas that involve the analysis and mining of such data. In many cases, address records are captured and/or stored in a free-form or inconsistent manner. This fact complicates the task of accurately matching such addresses to spatially-annotated reference data. In this paper we describe a geocoding system that is based on a comprehensive high-quality geocoded national address database. It uses a learning address parser based on hidden Markov models to segment free-form addresses into components, and a rule-based matching engine to determine the best matches to the reference database.

    Original languageEnglish
    Title of host publicationData Mining
    Subtitle of host publicationTheory, Methodology, Techniques, and Applications
    PublisherSpringer Verlag
    Pages130-145
    Number of pages16
    ISBN (Print)3540325476, 9783540325475
    DOIs
    Publication statusPublished - 2006

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3755 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Fingerprint

    Dive into the research topics of 'A probabilistic geocoding system utilising a parcel based address file'. Together they form a unique fingerprint.

    Cite this