Privacy-Preserving Temporal Record Linkage

Thilina Ranbaduge, Peter Christen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    8 Citations (Scopus)

    Abstract

    Record linkage (RL) is the process of identifying matching records from different databases that refer to the same entity. It is common that the attribute values of records that belong to the same entity do evolve over time, for example people can change their surname or address. Therefore, to identify the records that refer to the same entity over time, RL should make use of temporal information such as the time-stamp of when a record was created and/or update last. However, if RL needs to be conducted on information about people, due to privacy and confidentiality concerns organizations are often not willing or allowed to share sensitive data in their databases, such as personal medical records, or location and financial details, with other organizations. This paper is the first to propose a privacy-preserving temporal record linkage (PPTRL) protocol that can link records across different databases while ensuring the privacy of the sensitive data in these databases. We propose a novel protocol based on Bloom filter encoding which incorporates the temporal information available in records during the linkage process. Our approach uses homomorphic encryption to securely calculate the probabilities of entities changing attribute values in their records over a period of time. Based on these probabilities we generate a set of masking Bloom filters to adjust the similarities between record pairs. We provide a theoretical analysis of the complexity and privacy of our technique and conduct an empirical study on large real databases containing several millions of records. The experimental results show that our approach can achieve better linkage quality compared to non-temporal PPRL while providing privacy to individuals in the databases that are being linked.

    Original languageEnglish
    Title of host publication2018 IEEE International Conference on Data Mining, ICDM 2018
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages377-386
    Number of pages10
    ISBN (Electronic)9781538691588
    DOIs
    Publication statusPublished - 27 Dec 2018
    Event18th IEEE International Conference on Data Mining, ICDM 2018 - Singapore, Singapore
    Duration: 17 Nov 201820 Nov 2018

    Publication series

    NameProceedings - IEEE International Conference on Data Mining, ICDM
    Volume2018-November
    ISSN (Print)1550-4786

    Conference

    Conference18th IEEE International Conference on Data Mining, ICDM 2018
    Country/TerritorySingapore
    CitySingapore
    Period17/11/1820/11/18

    Fingerprint

    Dive into the research topics of 'Privacy-Preserving Temporal Record Linkage'. Together they form a unique fingerprint.

    Cite this