TY - GEN
T1 - Quality-oriented search for depression portals
AU - Tang, Thanh
AU - Hawking, David
AU - Sankaranarayana, Ramesh
AU - Griffiths, Kathleen M.
AU - Craswell, Nick
PY - 2009
Y1 - 2009
N2 - The problem of low-quality information on the Web is nowhere more important than in the domain of health, where unsound information and misleading advice can have serious consequences. The quality of health web sites can be rated by subject experts against evidence-based guidelines. We previously developed an automated quality rating technique (AQA) for depression websites and showed that it correlated 0.85 with such expert ratings. In this paper, we use AQA to filter or rerank Google results returned in response to queries relating to depression. We compare this to an unrestricted quality-oriented (AQA based) focused crawl starting from an Open Directory category and a conventional crawl with manually constructed seedlist and inclusion rules. The results show that postprocessed Google outperforms other forms of search engine restricted to the domain of depressive illness on both relevance and quality.
AB - The problem of low-quality information on the Web is nowhere more important than in the domain of health, where unsound information and misleading advice can have serious consequences. The quality of health web sites can be rated by subject experts against evidence-based guidelines. We previously developed an automated quality rating technique (AQA) for depression websites and showed that it correlated 0.85 with such expert ratings. In this paper, we use AQA to filter or rerank Google results returned in response to queries relating to depression. We compare this to an unrestricted quality-oriented (AQA based) focused crawl starting from an Open Directory category and a conventional crawl with manually constructed seedlist and inclusion rules. The results show that postprocessed Google outperforms other forms of search engine restricted to the domain of depressive illness on both relevance and quality.
KW - Health portal search
KW - Quality filtering of search results
UR - http://www.scopus.com/inward/record.url?scp=67650703929&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-00958-7_60
DO - 10.1007/978-3-642-00958-7_60
M3 - Conference contribution
SN - 3642009573
SN - 9783642009570
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 637
EP - 644
BT - Advances in Information Retrieval - 31th European Conference on IR Research, ECIR 2009, Proceedings
T2 - 31th European Conference on Information Retrieval, ECIR 2009
Y2 - 6 April 2009 through 9 April 2009
ER -