The Effect of Assessor Errors on IR System Evaluation

被引:0
|
作者
Carterette, Ben [1 ]
Soboroff, Ian [1 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
关键词
assessor error; retrieval test collections;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent efforts in test collection building have focused on scaling back the number of necessary relevance judgments and then scaling up the number of search topics. Since the largest source of variation in a Cranfield-style experiment comes from the topics, this is a reasonable approach. However, as topic set sizes grow, and researchers look to crowdsourcing and Amazon's Mechanical Turk to collect relevance judgments, we are faced with issues of quality control. This paper examines the robustness of the TREC Million Query track methods when some assessors make significant and systematic errors. We find that while averages are robust, assessor errors can have a large effect on system rankings.
引用
收藏
页码:539 / 546
页数:8
相关论文
共 50 条
  • [41] The Narrative Evaluation Quality Instrument: Development of a Tool to Assess the Assessor
    Thompson-Stone, Robert
    Kelly, Michael
    Mooney, Christopher
    Braun, Melanie
    Rosati, Justin
    NEUROLOGY, 2020, 94 (15)
  • [42] THE EFFECT OF PUNISHMENT FOR ERRORS ON LEARNING - AN EVALUATION OF THE PARAMETRIC AND MOTIVATION HYPOTHESES
    KNIGHT, NB
    JOURNAL OF PSYCHOLOGICAL STUDIES, 1958, 10 (02): : 76 - 83
  • [43] Effect Evaluation of Optical Magnification Errors for Coded Aperture Spectrometer
    Ma Yuan
    Lu Qun-bo
    Liu Yang-yang
    Qian Lu-lu
    Pei Lin-lin
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2014, 34 (11) : 3157 - 3161
  • [44] Effect of Errors in Maximum Power on Reliability Evaluation for Photovoltaic Modules
    He, Fengqin
    Yang, Qi
    Han, Hongmin
    Hao, Yuanpang
    Huang, Xin
    Yang, Qian
    Wang, Congyu
    Yang, Hong
    2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 952 - 956
  • [45] Critical evaluation of assessor difference correction approaches in sensory analysis
    Grossmann, Justus L.
    Westerhuis, Johan A.
    Naes, Tormod
    Smilde, Age K.
    FOOD QUALITY AND PREFERENCE, 2023, 106
  • [46] The Participation of Social Organizations and the Reform of the People's Assessor System
    Sun Jian
    Xu Yun
    学术界, 2014, (01) : 301 - 306
  • [47] EVALUATION OF ERRORS IN A CURRENT COMPARATOR SYSTEM USED FOR CURRENT TRANSFORMER TESTING
    TAKAHASHI, K
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1989, 38 (02) : 402 - 406
  • [48] Design of Enhanced Vehicle Safety System Based on The Evaluation of Driving Errors
    Attia, Hussain A.
    Ismail, Shereen
    Ali, Halah Y.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 638 - 641
  • [49] EFFECT OF ERRORS OF CONTROL OF A SEARCH SYSTEM ON CHARACTERISTICS OF SIGNAL DETECTION
    KUKSENKO, HP
    REPIN, VG
    RADIO ENGINEERING AND ELECTRONIC PHYSICS-USSR, 1969, 14 (06): : 846 - &
  • [50] On the effect of plotting performance by the errors of pointing targets in the ARPA system
    Pedersen, E
    JOURNAL OF NAVIGATION, 1999, 52 (01): : 119 - 125