TY - GEN
T1 - Privacy-preserving data linkage and geocoding
T2 - Current approaches and research directions
AU - Christen, Peter
PY - 2006
Y1 - 2006
N2 - Data linkage is the task of matching and aggregating records that relate to the same entity from one or more data sets. A related technique is geocoding, the matching of addresses to their geographic locations. As data linkage is often based on personal information (like names and addresses), privacy and confidentiality are of paramount importance. In this paper we present an overview of current approaches to privacy-preserving data linkage, and discuss their limitations. Using real-world scenarios we illustrate the significance of developing improved techniques for automated, large scale and distributed privacy-preserving linking and geocoding. We then discuss four core research areas that need to be addressed in order to make linking and geocoding of large confidential data collections feasible.
AB - Data linkage is the task of matching and aggregating records that relate to the same entity from one or more data sets. A related technique is geocoding, the matching of addresses to their geographic locations. As data linkage is often based on personal information (like names and addresses), privacy and confidentiality are of paramount importance. In this paper we present an overview of current approaches to privacy-preserving data linkage, and discuss their limitations. Using real-world scenarios we illustrate the significance of developing improved techniques for automated, large scale and distributed privacy-preserving linking and geocoding. We then discuss four core research areas that need to be addressed in order to make linking and geocoding of large confidential data collections feasible.
UR - http://www.scopus.com/inward/record.url?scp=67650258952&partnerID=8YFLogxK
U2 - 10.1109/icdmw.2006.135
DO - 10.1109/icdmw.2006.135
M3 - Conference contribution
SN - 0769527027
SN - 9780769527024
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 497
EP - 501
BT - Proceedings - ICDM Workshops 2006 - 6th IEEE International Conference on Data Mining - Workshops
PB - Institute of Electrical and Electronics Engineers Inc.
ER -