TagTheWeb: Using wikipedia categories to automatically categorize resources on the web

Jerry Fernandes Medeiros*, Bernardo Pereira Nunes, Sean Wolfgand Matsui Siqueira, Luiz André Portes Paes Leme

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Citations (Scopus)


Identifying topics associated with a set of documents is a common task for many applications and can be used to improve various tasks involving documents on the Web, such as search, retrieval, recommendation, and clustering. To address this problem, this paper introduces a tool, called TagTheWeb, as a proposition of a generic classification method, that relies on the knowledge expressed by the taxonomic structure of Wikipedia, based on the generation of a fingerprint through the semantic relation between nodes of the Wikipedia Category Graph. TagTheWeb can be used as a WEB interface or as an API to classify any text based resource.

Original languageEnglish
Title of host publicationThe Semantic Web
Subtitle of host publicationESWC 2018 Satellite Events - ESWC 2018, Revised Selected Papers
EditorsAldo Gangemi, Anna Lisa Gentile, Heiko Paulheim, Maria Maleshkova, Sebastian Rudolph, Jeff Z. Pan, Mehwish Alam, Andrea Giovanni Nuzzolese
PublisherSpringer Verlag
Number of pages5
ISBN (Print)9783319981918
Publication statusPublished - 2018
Externally publishedYes
Event15th Extended Semantic Web Conference, ESWC 2018 - Heraklion, Greece
Duration: 3 Jun 20187 Jun 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11155 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference15th Extended Semantic Web Conference, ESWC 2018


Dive into the research topics of 'TagTheWeb: Using wikipedia categories to automatically categorize resources on the web'. Together they form a unique fingerprint.

Cite this