TY - GEN
T1 - Users versus models
T2 - 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
AU - Moffat, Alistair
AU - Thomas, Paul
AU - Scholer, Falk
PY - 2013
Y1 - 2013
N2 - Retrieval system effectiveness can be measured in two quite different ways: by monitoring the behavior of users and gathering data about the ease and accuracy with which they accomplish certain specified information-seeking tasks; or by using numeric effectiveness metrics to score system runs in reference to a set of relevance judgments. In the second approach, the effectiveness metric is chosen in the belief that user task performance, if it were to be measured by the first approach, should be linked to the score provided by the metric. This work explores that link, by analyzing the assumptions and implications of a number of effectiveness metrics, and exploring how these relate to observable user behaviors. Data recorded as part of a user study included user self-assessment of search task difficulty; gaze position; and click activity. Our results show that user behavior is influenced by a blend of many factors, including the extent to which relevant documents are encountered, the stage of the search process, and task difficulty. These insights can be used to guide development of batch effectiveness metrics.
AB - Retrieval system effectiveness can be measured in two quite different ways: by monitoring the behavior of users and gathering data about the ease and accuracy with which they accomplish certain specified information-seeking tasks; or by using numeric effectiveness metrics to score system runs in reference to a set of relevance judgments. In the second approach, the effectiveness metric is chosen in the belief that user task performance, if it were to be measured by the first approach, should be linked to the score provided by the metric. This work explores that link, by analyzing the assumptions and implications of a number of effectiveness metrics, and exploring how these relate to observable user behaviors. Data recorded as part of a user study included user self-assessment of search task difficulty; gaze position; and click activity. Our results show that user behavior is influenced by a blend of many factors, including the extent to which relevant documents are encountered, the stage of the search process, and task difficulty. These insights can be used to guide development of batch effectiveness metrics.
KW - Evaluation
KW - Retrieval experiment
KW - System measurement
UR - http://www.scopus.com/inward/record.url?scp=84889574389&partnerID=8YFLogxK
U2 - 10.1145/2505515.2507665
DO - 10.1145/2505515.2507665
M3 - Conference contribution
SN - 9781450322638
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 659
EP - 668
BT - CIKM 2013 - Proceedings of the 22nd ACM International Conference on Information and Knowledge Management
Y2 - 27 October 2013 through 1 November 2013
ER -