Privacy-preserving data linkage and geocoding: Current approaches and research directions

Peter Christen*

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    21 Citations (Scopus)

    Abstract

    Data linkage is the task of matching and aggregating records that relate to the same entity from one or more data sets. A related technique is geocoding, the matching of addresses to their geographic locations. As data linkage is often based on personal information (like names and addresses), privacy and confidentiality are of paramount importance. In this paper we present an overview of current approaches to privacy-preserving data linkage, and discuss their limitations. Using real-world scenarios we illustrate the significance of developing improved techniques for automated, large scale and distributed privacy-preserving linking and geocoding. We then discuss four core research areas that need to be addressed in order to make linking and geocoding of large confidential data collections feasible.

    Original languageEnglish
    Title of host publicationProceedings - ICDM Workshops 2006 - 6th IEEE International Conference on Data Mining - Workshops
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages497-501
    Number of pages5
    ISBN (Print)0769527027, 9780769527024
    DOIs
    Publication statusPublished - 2006

    Publication series

    NameProceedings - IEEE International Conference on Data Mining, ICDM
    ISSN (Print)1550-4786

    Fingerprint

    Dive into the research topics of 'Privacy-preserving data linkage and geocoding: Current approaches and research directions'. Together they form a unique fingerprint.

    Cite this