Effect of Errors on the Evaluation of Machine Learning Systems

被引:1
|
作者
Bracamonte, Vanessa [1 ]
Hidano, Seira [1 ]
Kiyomoto, Shinsaku [1 ]
机构
[1] KDDI Res Inc, Saitama, Japan
关键词
User Perception; Errors; Machine Learning Model Evaluation; User Study; AUTOMATION; TRUST;
D O I
10.5220/0010839300003124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Information such as accuracy and outcome explanations can be useful for the evaluation of machine learning systems, but they can also lead to over-trust. This means that an evaluator may not have suspicion that a machine learning system could have errors, and that they may overlook problems in the explanation of those systems. Research has shown that errors not only decrease trust but can also promote curiosity about the performance of the system. Therefore, presenting errors to evaluators may be an option to induce suspicion in the context of the evaluation of a machine learning system. In this paper, we evaluate this possibility by conducting three experiments where we asked participants to evaluate text classification systems. We presented two types of errors: incorrect predictions and errors in the explanation. The results show that patterns of errors in explanation negatively influenced willingness to recommend a system, and that fewer participants chose a system with higher accuracy when there was an error pattern, compared to when the errors were random. Moreover, more participants gave evidence from the explanations in their reason for their evaluation of the systems, suggesting that they were able to detect error patterns.
引用
收藏
页码:48 / 57
页数:10
相关论文
共 50 条
  • [1] MACHINE LEARNING The chemistry of errors
    Cole, Jacqueline M.
    NATURE CHEMISTRY, 2022, 14 (09) : 973 - 975
  • [2] A Performance Evaluation of Queueing Systems by Machine Learning
    Niii, Suguru
    Okuda, Takashi
    Wakita, Takuya
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [3] A performance evaluation of general queueing systems by machine learning
    Nii S.
    Okuda T.
    IEEJ Transactions on Electronics, Information and Systems, 2019, 139 (01) : 98 - 105
  • [4] Evaluation of Robustness Metrics for Defense of Machine Learning Systems
    DeMarchi, J.
    Rijken, R.
    Melrose, J.
    Madahar, B.
    Fumera, G.
    Roli, F.
    Ledda, E.
    Aktas, M.
    Kurth, F.
    Baggenstoss, P.
    Pelzer, B.
    Kanestad, L.
    2023 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS, ICMCIS, 2023,
  • [5] A Performance Evaluation of General Traffic Systems by Machine Learning
    Nii, Suguru
    Okuda, Takashi
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [6] A Performance Evaluation of Tandem Queueing Systems by Machine Learning
    Kudou, Tomoyasu
    Nii, Suguru
    Okudat, Takashi
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 389 - 390
  • [7] Extreme learning machine with errors in variables
    Zhao, Jianwei
    Wang, Zhihui
    Cao, Feilong
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (05): : 1205 - 1216
  • [8] The Prevalence of Errors in Machine Learning Experiments
    Shepperd, Martin
    Guo, Yuchen
    Li, Ning
    Arzoky, Mahir
    Capiluppi, Andrea
    Counsell, Steve
    Destefanis, Giuseppe
    Swift, Stephen
    Tucker, Allan
    Yousefi, Leila
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 102 - 109
  • [9] Extreme learning machine with errors in variables
    Jianwei Zhao
    Zhihui Wang
    Feilong Cao
    World Wide Web, 2014, 17 : 1205 - 1216
  • [10] THE EFFECT OF PUNISHMENT FOR ERRORS ON LEARNING - AN EVALUATION OF THE PARAMETRIC AND MOTIVATION HYPOTHESES
    KNIGHT, NB
    JOURNAL OF PSYCHOLOGICAL STUDIES, 1958, 10 (02): : 76 - 83