A logical formalisation of the Fellegi-Holt method of data cleaning

Agnes Boskovitz*, Rajeev Goré, Markus Hegland

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

    8 Citations (Scopus)

    Abstract

    The Fellegi-Holt method automatically "corrects" data that fail some predefined requirements. Computer implementations of the method were used in many national statistics agencies but are less used now because they are slow. We recast the method in prepositional logic, and show that many of its results are well-known results in prepositional logic. In particular we show that the Fellegi-Holt method of "edit generation" is essentially the same as a technique for automating logical deduction called resolution. Since modern implementations of resolution are capable of handling large problems efficiently, they might lead to more efficient implementations of the Fellegi-Holt method.

    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    EditorsMichael R. Berthold, Hans-Joachim Lenz, Elizabeth Bradley, Rudolf Kruse, Christian Borgelt
    PublisherSpringer Verlag
    Pages554-565
    Number of pages12
    ISBN (Print)3540408134, 9783540408130
    DOIs
    Publication statusPublished - 2003

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2810
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Fingerprint

    Dive into the research topics of 'A logical formalisation of the Fellegi-Holt method of data cleaning'. Together they form a unique fingerprint.

    Cite this