TY - GEN
T1 - Identifying candidate datasets for data interlinking
AU - Leme, Luiz André P.Paes
AU - Lopes, Giseli Rabello
AU - Nunes, Bernardo Pereira
AU - Casanova, Marco Antonio
AU - Dietze, Stefan
PY - 2013
Y1 - 2013
N2 - One of the design principles that can stimulate the growth and increase the usefulness of the Web of data is URIs linkage. However, the related URIs are typically in different datasets managed by different publishers. Hence, the designer of a new dataset must be aware of the existing datasets and inspect their content to define sameAs links. This paper proposes a technique based on probabilistic classifiers that, given a datasets S to be published and a set T of known published datasets, ranks each T i â̂̂ T according to the probability that links between S and T i can be found by inspecting the most relevant datasets. Results from our technique show that the search space can be reduced up to 85%, thereby greatly decreasing the computational effort.
AB - One of the design principles that can stimulate the growth and increase the usefulness of the Web of data is URIs linkage. However, the related URIs are typically in different datasets managed by different publishers. Hence, the designer of a new dataset must be aware of the existing datasets and inspect their content to define sameAs links. This paper proposes a technique based on probabilistic classifiers that, given a datasets S to be published and a set T of known published datasets, ranks each T i â̂̂ T according to the probability that links between S and T i can be found by inspecting the most relevant datasets. Results from our technique show that the search space can be reduced up to 85%, thereby greatly decreasing the computational effort.
KW - Bayesian classifier
KW - Linked Data
KW - data interlinking
KW - datasets recommendation
UR - http://www.scopus.com/inward/record.url?scp=84880871939&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-39200-9_29
DO - 10.1007/978-3-642-39200-9_29
M3 - Conference contribution
SN - 9783642391996
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 354
EP - 366
BT - Web Engineering - 13th International Conference, ICWE 2013, Proceedings
T2 - 13th International Conference on Web Engineering, ICWE 2013
Y2 - 8 July 2013 through 12 July 2013
ER -