Using anchor text for homepage and topic distillation search tasks

Mingfang Wu*, David Hawking, Andrew Turpin, Falk Scholer

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

13 Citations (Scopus)

Abstract

Past work suggests that anchor text is a good source of evidence that can be used to improve web searching. Two approaches for making use of this evidence include fusing search results from an anchor text representation and the original text representation based on a document's relevance score or rank position, and combining term frequency from both representations during the retrieval process. Although these approaches have each been tested and compared against baselines, different evaluations have used different baselines; no consistent work enables rigorous cross-comparison between these methods. The purpose of this work is threefold. First, we survey existing fusion methods of using anchor text in search. Second, we compare these methods with common testbeds and web search tasks, with the aim of identifying the most effective fusion method. Third, we try to correlate search performance with the characteristics of a test collection. Our experimental results show that the best performing method in each category can significantly improve search results over a common baseline. However, there is no single technique that consistently outperforms competing approaches across different collections and search tasks.

Original languageEnglish
Pages (from-to)1235-1255
Number of pages21
JournalJournal of the American Society for Information Science and Technology
Volume63
Issue number6
DOIs
Publication statusPublished - Jun 2012

Fingerprint

Dive into the research topics of 'Using anchor text for homepage and topic distillation search tasks'. Together they form a unique fingerprint.

Cite this