Efficient pattern mining based cryptanalysis for privacy-preserving record linkage

Anushka Vidanage, Thilina Ranbaduge, Peter Christen, Rainer Schnell

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    27 Citations (Scopus)

    Abstract

    Privacy-preserving record linkage (PPRL) is the process of identifying records that correspond to the same entities across several databases without revealing any sensitive information about these entities. One popular PPRL technique is Bloom filter (BF) encoding, with first applications of BF based PPRL now being employed in real-world linkage applications. Here we present a cryptanalysis attack that can re-identify attribute values encoded in BFs. Our method applies maximal frequent itemset mining on a BF database to first identify sets of frequently co-occurring bit positions that correspond to encoded frequent q-grams (character substrings extracted from plain-text values). Using a language model, we then identify additional q-grams by applying pattern mining on subsets of BFs that encode a previously identified frequent q-gram. Experiments on a real database show that our attack can successfully re-identify sensitive values even when each BF in a database is unique.

    Original languageEnglish
    Title of host publicationProceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019
    PublisherIEEE Computer Society
    Pages1698-1701
    Number of pages4
    ISBN (Electronic)9781538674741
    DOIs
    Publication statusPublished - Apr 2019
    Event35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, China
    Duration: 8 Apr 201911 Apr 2019

    Publication series

    NameProceedings - International Conference on Data Engineering
    Volume2019-April
    ISSN (Print)1084-4627

    Conference

    Conference35th IEEE International Conference on Data Engineering, ICDE 2019
    Country/TerritoryChina
    CityMacau
    Period8/04/1911/04/19

    Fingerprint

    Dive into the research topics of 'Efficient pattern mining based cryptanalysis for privacy-preserving record linkage'. Together they form a unique fingerprint.

    Cite this