Exploiting the wisdom of the crowds for characterizing and connecting heterogeneous resources

Ricardo Kawase, Patrick Siehndel, Bernardo Pereira Nunes, Eelco Herder, Wolfgang Nejdl

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Heterogeneous content is an inherent problem for cross-system search, recommendation and personalization. In this paper we investigate differences in topic coverage and the impact of topics in different kinds of Web services. We use entity extraction and categorization to create fingerprints that allow for meaningful comparison. As a basis taxonomy, we use the 23 main categories of Wikipedia Category Graph, which has been assembled over the years by the wisdom of the crowds. Following a proof of concept of our approach, we analyze differences in topic coverage and topic impact. The results show many differences between Web services like Twitter, Flickr and Delicious, which reflect users' behavior and the usage of each system. The paper concludes with a user study that demonstrates the benefits of fingerprints over traditional textual methods for recommendations of heterogeneous resources.

Original languageEnglish
Title of host publicationHT 2014 - Proceedings of the 25th ACM Conference on Hypertext and Social Media
PublisherAssociation for Computing Machinery
Pages56-65
Number of pages10
ISBN (Print)9781450329545
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event25th ACM Conference on Hypertext and Social Media, HT 2014 - Santiago, Chile
Duration: 1 Sept 20144 Sept 2014

Publication series

NameHT 2014 - Proceedings of the 25th ACM Conference on Hypertext and Social Media

Conference

Conference25th ACM Conference on Hypertext and Social Media, HT 2014
Country/TerritoryChile
CitySantiago
Period1/09/144/09/14

Fingerprint

Dive into the research topics of 'Exploiting the wisdom of the crowds for characterizing and connecting heterogeneous resources'. Together they form a unique fingerprint.

Cite this