Summaries on the fly: Query-based extraction of structured knowledge from web documents

Besnik Fetahu, Bernardo Pereira Nunes, Stefan Dietze

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A large part of Web resources consists of unstructured textual content. Processing and retrieving relevant content for a particular information need is challenging for both machines and humans. While information retrieval techniques provide methods for detecting suitable resources for a particular query, information extraction techniques enable the extraction of structured data and text summarization allows the detection of important sentences. However, these techniques usually do not consider particular user interests and information needs. In this paper, we present a novel method to automatically generate structured summaries from user queries that uses POS patterns to identify relevant statements and entities in a certain context. Finally, we evaluate our work using the publicly available New York Times corpus, which shows the applicability of our method and the advantages over previous works.

Original languageEnglish
Title of host publicationWeb Engineering - 13th International Conference, ICWE 2013, Proceedings
Pages249-264
Number of pages16
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event13th International Conference on Web Engineering, ICWE 2013 - Aalborg, Denmark
Duration: 8 Jul 201312 Jul 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7977 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference on Web Engineering, ICWE 2013
Country/TerritoryDenmark
CityAalborg
Period8/07/1312/07/13

Fingerprint

Dive into the research topics of 'Summaries on the fly: Query-based extraction of structured knowledge from web documents'. Together they form a unique fingerprint.

Cite this