TY - GEN
T1 - Merging algorithms for enterprise search
AU - Li, Peng Fei
AU - Thomas, Paul
AU - Hawking, David
PY - 2013
Y1 - 2013
N2 - Effective enterprise search must draw on a number of sources-for example web pages, telephone directories, and databases. Doing this means we need a way to make a single sorted list from results of very different types. Many merging algorithms have been proposed but none have been applied to this, realistic, application. We report the results of an experiment which simulates heterogeneous enterprise retrieval, in a university setting, and uses multigrade expert judgements to compare merging algorithms. Merging algorithms considered include several variants of round-robin, several methods proposed by Rasolofo et al. in the Current News Metasearcher, and four novel variations including a learned multi-weight method. We find that the round-robin methods and one of the Rasolofo methods perform significantly worse than others. The GDS TS method of Rasolofo achieves the highest average NDCG@101 score but the differences between it and the other GDS methods, local reranking, and the multi-weight method were not significant. Copyright is held by the owner/author(s).
AB - Effective enterprise search must draw on a number of sources-for example web pages, telephone directories, and databases. Doing this means we need a way to make a single sorted list from results of very different types. Many merging algorithms have been proposed but none have been applied to this, realistic, application. We report the results of an experiment which simulates heterogeneous enterprise retrieval, in a university setting, and uses multigrade expert judgements to compare merging algorithms. Merging algorithms considered include several variants of round-robin, several methods proposed by Rasolofo et al. in the Current News Metasearcher, and four novel variations including a learned multi-weight method. We find that the round-robin methods and one of the Rasolofo methods perform significantly worse than others. The GDS TS method of Rasolofo achieves the highest average NDCG@101 score but the differences between it and the other GDS methods, local reranking, and the multi-weight method were not significant. Copyright is held by the owner/author(s).
KW - Federated search
KW - Information retrieval
KW - Results merging
UR - http://www.scopus.com/inward/record.url?scp=84892766686&partnerID=8YFLogxK
U2 - 10.1145/2537734.2537750
DO - 10.1145/2537734.2537750
M3 - Conference contribution
SN - 9781450325240
T3 - ACM International Conference Proceeding Series
SP - 42
EP - 49
BT - ADCS 2013 - Proceedings of the 18th Australasian Document Computing Symposium
T2 - 18th Australasian Document Computing Symposium, ADCS 2013
Y2 - 5 December 2013 through 6 December 2013
ER -