A graph matching method for historical census household linkage

Zhichun Fu, Peter Christen, Jun Zhou

    Research output: Contribution to journalConference articlepeer-review

    19 Citations (Scopus)


    Linking historical census data across time is a challenging task due to various reasons, including data quality, limited individual information, and changes to households over time. Although most census data linking methods link records that correspond to individual household members, recent advances show that linking households as a whole provide more accurate results and less multiple household links. In this paper, we introduce a graph-based method to link households, which takes the structural relationship between household members into consideration. Based on individual record linking results, our method builds a graph for each household, so that the matches are determined by both attribute-level and record-relationship similarity. Our experimental results on both synthetic and real historical census data have validated the effectiveness of this method. The proposed method achieves an F-measure of 0.937 on data extracted from real UK census datasets, outperforming all alternative methods being compared.

    Original languageEnglish
    Pages (from-to)485-496
    Number of pages12
    JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume8443 LNAI
    Issue numberPART 1
    Publication statusPublished - 2014
    Event18th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2014 - Tainan, Taiwan, Province of China
    Duration: 13 May 201416 May 2014


    Dive into the research topics of 'A graph matching method for historical census household linkage'. Together they form a unique fingerprint.

    Cite this