Ye Chen
Meta-evaluation of online and offline web search evaluation metrics
Chen, Ye; Zhou, Ke; Liu, Yiqun; Zhang, Min; Ma, Shaoping
Abstract
As in most information retrieval (IR) studies, evaluation plays an essential part in Web search research. Both offline and online evaluation metrics are adopted in measuring the performance of search engines. Offline metrics are usually based on relevance judgments of query-document pairs from assessors while online metrics exploit the user behavior data, such as clicks, collected from search engines to compare search algorithms. Although both types of IR evaluation metrics have achieved success, to what extent can they predict user satisfaction still remains under-investigated. To shed light on this research question, we meta-evaluate a series of existing online and offline metrics to study how well they infer actual search user satisfaction in different search scenarios. We find that both types of evaluation metrics significantly correlate with user satisfaction while they reflect satisfaction from different perspectives for different search tasks. Offline metrics better align with user satisfaction in homogeneous search (i.e. ten blue links) whereas online metrics outperform when vertical results are federated. Finally, we also propose to incorporate mouse hover information into existing online evaluation metrics, and empirically show that they better align with search user satisfaction than click-based online metrics.
Citation
Chen, Y., Zhou, K., Liu, Y., Zhang, M., & Ma, S. (2017). Meta-evaluation of online and offline web search evaluation metrics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '17 (15-24). https://doi.org/10.1145/3077136.3080804
Presentation Conference Type | Edited Proceedings |
---|---|
Conference Name | SIGIR '17: 40th International ACM SIGIR Conference on Research and Development in Information Retrieval |
Start Date | Aug 7, 2017 |
End Date | Aug 11, 2017 |
Acceptance Date | May 16, 2017 |
Publication Date | Aug 7, 2017 |
Deposit Date | Aug 22, 2017 |
Publicly Available Date | Aug 22, 2017 |
Peer Reviewed | Peer Reviewed |
Pages | 15-24 |
Book Title | Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '17 |
ISBN | 9781450350228 |
DOI | https://doi.org/10.1145/3077136.3080804 |
Public URL | https://nottingham-repository.worktribe.com/output/876955 |
Publisher URL | https://doi.org/10.1145/3077136.3080804 |
Contract Date | Aug 22, 2017 |
Files
sigir2017 (8).pdf
(1.1 Mb)
PDF
You might also like
Detecting collusive spamming activities in community question answering
(2017)
Presentation / Conference Contribution
Palimpsest: improving assisted curation of loco-specific literature
(2016)
Journal Article
A comparative analysis of interleaving methods for aggregated search
(2015)
Journal Article
Composite retrieval of heterogeneous web search
(2014)
Presentation / Conference Contribution
Predicting pre-click quality for native advertisements
(2016)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search