Regression classi�cation for Improved Temporal Record Linkage

Qing Wang, Dinusha Vatsalan, Peter Christen, Yichen Hu

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Temporal record linkage is the process of identifying groups of records which are collected over long periods of time, such as census databases or voter registration databases, that represent the same real-world entities. These datasets often contain temporal information for each record, such as the time when a record was created, or the time when it was modified. Unlike traditional record linkage, which treats differences between records from the same entity as errors or variations, temporal record linkage aims to capture records from entities where the details of these entities change over the time. This paper proposes a temporal record linkage approach that learns the probabilities for attribute values of records to change within different periods of time, which extends an existing temporal approach decay model. The proposed method uses a regression based machine learning model to predict decay with sets of attributes, where attribute values in each set could affect the decay of others. Our experimental results show that the proposed approach results in generally better recall than baseline approaches on real-world datasets.
    Original languageEnglish
    Title of host publicationConferences in Research and Practice in Information Technology
    EditorsV. Estivill-Castro & S. Simoff
    Place of PublicationSydney,Australia
    PublisherAustralian Computer Society Inc.
    Pages1-10pp
    EditionPeer Reviewed
    Publication statusPublished - 2016
    EventAustralasian Data Mining Conference (AusDM 2016) - Canberra, Australia
    Duration: 1 Jan 2016 → …
    http://www.academia.edu/30531349/Factors_influencing_Australian_teachers_intent_to_leave_the_teaching_profession

    Conference

    ConferenceAustralasian Data Mining Conference (AusDM 2016)
    Period1/01/16 → …
    OtherDecember 6-8 2016
    Internet address

    Fingerprint

    Dive into the research topics of 'Regression classi�cation for Improved Temporal Record Linkage'. Together they form a unique fingerprint.

    Cite this