Exploiting User Disagreement for Web Search Evaluation: an Experimental Approach

被引:6
|
作者
Demeester, Thomas [1 ]
Aly, Robin [2 ]
Hiemstra, Djoerd [2 ]
Dong Nguyen [2 ]
Trieschnigg, Dolf [2 ]
Develder, Chris [1 ]
机构
[1] Univ Ghent, iMinds, Ghent, Belgium
[2] Univ Twente, Enschede, Netherlands
关键词
User disagreement; graded relevance; evaluation; RELEVANCE JUDGMENTS;
D O I
10.1145/2556195.2556268
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To express a more nuanced notion of relevance as compared to binary judgments, graded relevance levels can be used for the evaluation of search results. Especially in Web search, users strongly prefer top results over less relevant results, and yet they often disagree on which are the top results for a given information need. Whereas previous works have generally considered disagreement as a negative effect, this paper proposes a method to exploit this user disagreement by integrating it into the evaluation procedure. First, we present experiments that investigate the user disagreement. We argue that, with a high disagreement, lower relevance levels might need to be promoted more than in the case where there is global consensus on the top results. This is formalized by introducing the User Disagreement Model, resulting in a weighting of the relevance levels with a probabilistic interpretation. A validity analysis is given, and we explain how to integrate the model with well-established evaluation metrics. Finally, we discuss a specific application of the model, in the estimation of suitable weights for the combined relevance of Web search snippets and pages.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 50 条
  • [1] User Intent and Assessor Disagreement in Web Search Evaluation
    Kazai, Gabriella
    Yilmaz, Emine
    Craswell, Nick
    Tahaghoghi, S. M. M.
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 699 - 708
  • [2] Exploiting contextual independencies in web search and user profiling
    Butz, CJ
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 1051 - 1056
  • [3] Inferring User Intent in Web Search by Exploiting Social Annotations
    Conde, Jose M.
    Vallet, David
    Castells, Pablo
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 827 - 828
  • [4] Modeling Web-search scenarios exploiting user and source profiles
    Palopoli, L
    Terracina, G
    Ursino, D
    AI COMMUNICATIONS, 2001, 14 (04) : 215 - 230
  • [5] User behavior modeling for Web search evaluation
    Zhang, Fan
    Liu, Yiqun
    Mao, Jiaxin
    Zhang, Min
    Ma, Shaoping
    AI OPEN, 2020, 1 : 40 - 56
  • [6] Personalised Web search and user satisfaction: a user-centred evaluation
    Salehi, Sarah
    Du, Jia Tina
    Ashman, Helen
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2020, 25 (04): : 1
  • [7] PHASES:: A user profile learning approach for web search
    Eckhardt, A.
    Horvath, T.
    Vojtas, P.
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 780 - +
  • [8] User evaluation of textual results clustering for web search
    Pu, Hsiao-Tieh
    ONLINE INFORMATION REVIEW, 2010, 34 (06) : 855 - 874
  • [9] User Evaluation Methods for Visual Web Search Interfaces
    Hoeber, Orland
    INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 139 - 145
  • [10] A Live-User Evaluation of Collaborative Web Search
    Smyth, Barry
    Balfe, Evelyn
    Boydell, Oisin
    Bradley, Keith
    Briggs, Peter
    Coyle, Maurice
    Freyne, Jill
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1419 - 1424