Factors affecting inter-rater agreement in human classification of eye movements: a comparison of three datasets

被引:0
|
作者
Lee Friedman
Vladyslav Prokopenko
Shagen Djanian
Dmytro Katrychuk
Oleg V. Komogortsev
机构
[1] Texas State University,Derrick M5, Department of Computer Science
[2] Aalborg University,Department of Computer Science
来源
Behavior Research Methods | 2023年 / 55卷
关键词
Eye-movements; Manual classification; Sample-level agreement; Event-level agreement; Cohen’s Kappa; F1-score;
D O I
暂无
中图分类号
学科分类号
摘要
Manual classification of eye-movements is used in research and as a basis for comparison with automatic algorithms in the development phase. However, human classification will not be useful if it is unreliable and unrepeatable. Therefore, it is important to know what factors might influence and enhance the accuracy and reliability of human classification of eye-movements. In this report we compare three datasets of human manual classification, two from earlier datasets and one, our own dataset, which we present here for the first time. For inter-rater reliability, we assess both the event-level F1-score and sample-level Cohen’s κ, across groups of raters. The report points to several possible influences on human classification reliability: eye-tracker quality, use of head restraint, characteristics of the recorded subjects, the availability of detailed scoring rules, and the characteristics and training of the raters.
引用
收藏
页码:417 / 427
页数:10
相关论文
共 50 条
  • [1] Factors affecting inter-rater agreement in human classification of eye movements: a comparison of three datasets
    Friedman, Lee
    Prokopenko, Vladyslav
    Djanian, Shagen
    Katrychuk, Dmytro
    Komogortsev, Oleg, V
    [J]. BEHAVIOR RESEARCH METHODS, 2023, 55 (01) : 417 - 427
  • [2] Comparison between Inter-rater Reliability and Inter-rater Agreement in Performance Assessment
    Liao, Shih Chieh
    Hunt, Elizabeth A.
    Chen, Walter
    [J]. ANNALS ACADEMY OF MEDICINE SINGAPORE, 2010, 39 (08) : 613 - 618
  • [3] Leveraging Inter-rater Agreement for Classification in the Presence of Noisy Labels
    Bucarelli, Maria Sofia
    Cassanol, Lucas
    Siciliano, Federico
    Mantrachl, Amin
    Silvestri, Fabrizio
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3439 - 3448
  • [4] The Tulip classification of perinatal mortality: introduction and multidisciplinary inter-rater agreement
    Korteweg, FJ
    Gordijn, SJ
    Timmer, A
    Erwich, JJHM
    Bergman, KA
    Bouman, K
    Ravise, JM
    Heringa, MP
    Holm, JP
    [J]. BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2006, 113 (04) : 393 - 401
  • [5] Neuropsychological detection of cognitive impairment: Inter-rater agreement and factors affecting clinical decision-making
    Tuokko, HA
    Gabriel, G
    McDowell, I
    Fisk, JD
    Tierney, MC
    Crossley, M
    Simard, M
    Fisher, N
    Kristjansson, B
    Laforce, R
    Ska, B
    Snow, G
    Woodrow, J
    Bernier, J
    Tellier, A
    Della Malva, L
    Partlo, L
    [J]. JOURNAL OF THE INTERNATIONAL NEUROPSYCHOLOGICAL SOCIETY, 2006, 12 (01) : 72 - 79
  • [6] Assessment of the Human Factors Analysis and Classification System (HFACS): Intra-rater and inter-rater reliability
    Ergai, Awatef
    Cohen, Tara
    Sharp, Julia
    Wiegmann, Doug
    Gramopadhye, Anand
    Shappell, Scott
    [J]. SAFETY SCIENCE, 2016, 82 : 393 - 398
  • [7] Classification of substandard factors in perinatal care: development and multidisciplinary inter-rater agreement of the Groningen-system
    Mariet Th. van Diem
    Albertus Timmer
    Sanne J. Gordijn
    Klasien A. Bergman
    Fleurisca J. Korteweg
    Joke Ravise
    Ellen Vreugdenhil
    Jan Jaap H.M. Erwich
    [J]. BMC Pregnancy and Childbirth, 15
  • [8] Classification of substandard factors in perinatal care: development and multidisciplinary inter-rater agreement of the Groningen-system
    van Diem, Mariet Th.
    Timmer, Albertus
    Gordijn, Sanne J.
    Bergman, Klasien A.
    Korteweg, Fleurisca J.
    Ravise, Joke
    Vreugdenhil, Ellen
    Erwich, Jan Jaap H. M.
    [J]. BMC PREGNANCY AND CHILDBIRTH, 2015, 15
  • [9] Inter-Rater Agreement of the Classification of Intraoperative Adverse Events (ClassIntra) in Abdominal Surgery
    Krielen, P.
    Gawria, L.
    Stommel, M. W. J.
    Dell-Kuster, S.
    Rosenthal, R.
    ten Broek, R. P. G.
    van Goor, H.
    [J]. ANNALS OF SURGERY, 2023, 277 (02) : E273 - E279
  • [10] Inter-Rater Agreement in the Assessment of Video Recordings of Eye Drop Instillation by Glaucoma Patients
    Park, Meghan S.
    Patel, Marguerite M.
    Sarezky, Daniel
    Rojas, Carin
    Choo, Clara
    Choi, Michael
    Liu, Dachao
    Rademaker, Alfred W.
    Tanna, Angelo P.
    [J]. PLOS ONE, 2016, 11 (01):