An IRT forecasting model: linking proper scoring rules to item response theory

被引:0
|
作者
Bo, Yuanchao Emily [1 ]
Budescu, David V. [2 ]
Lewis, Charles [2 ]
Tetlock, Philip E. [3 ]
Mellers, Barbara [3 ]
机构
[1] NWEA, Portland, OR 97209 USA
[2] Fordham Univ, Bronx, NY 10458 USA
[3] Univ Penn, Philadelphia, PA 19104 USA
来源
JUDGMENT AND DECISION MAKING | 2017年 / 12卷 / 02期
关键词
IRT; Forecasting; Brier scores; Proper Scoring Rules; Good Judgment Project; Gibbs sampling; DOMINANCE ANALYSIS; PREDICTORS; ACCURACY;
D O I
暂无
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
This article proposes an Item Response Theoretical (IRT) forecasting model that incorporates proper scoring rules and provides evaluations of forecasters' expertise in relation to the features of the specific questions they answer. We illustrate the model using geopolitical forecasts obtained by the Good Judgment Project (GJP) (see Mellers, Ungar, Baron, Ramos, Gurcay, Fincher, Scott, Moore, Atanasov, Swift, Murray, Stone & Tetlock, 2014). The expertise estimates from the IRT model, which take into account variation in the difficulty and discrimination power of the events, capture the underlying construct being measured and are highly correlated with the forecasters' Brier scores. Furthermore, our expertise estimates based on the first three years of the GJP data are better predictors of both the forecasters' fourth year Brier scores and their activity level than the overall Brier scores obtained and Merkle's (2016) predictions, based on the same period. Lastly, we discuss the benefits of using event-characteristic information in forecasting.
引用
收藏
页码:90 / 103
页数:14
相关论文
共 50 条
  • [21] Implementation Strategies for Learning: An Item Response Theory (IRT)-Based Analysis
    Correa Brant, Sandra Regina
    Pilati, Ronaldo
    Borges-Andrade, Jairo Eduardo
    PSICOLOGIA-REFLEXAO E CRITICA, 2015, 28 (01): : 1 - 10
  • [22] IRT-Lab: Software for research and pedagogy in item response theory
    Penfield, RD
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2003, 27 (04) : 301 - 302
  • [23] MEASURING AND MONITORING TRENDS IN MORBIDITY USING ITEM RESPONSE THEORY (IRT)
    Fries, J. F.
    GERONTOLOGIST, 2010, 50 : 88 - 88
  • [24] Bayesian Model Selection Based on Proper Scoring Rules - Comment
    Grazian, C.
    Masiani, I.
    Robert, C. P.
    BAYESIAN ANALYSIS, 2015, 10 (02): : 511 - 515
  • [25] Using SAS PROC IRT for Multidimensional Item Response Theory Analysis
    Cole, Ki
    Paek, Insu
    MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2022, 20 (01) : 49 - 55
  • [26] Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model
    Tay, Louis
    Huang, Qiming
    Vermunt, Jeroen K.
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2016, 76 (01) : 22 - 42
  • [27] IPLINK: Multidimensional and unidimensional item parameter linking in item response theory
    Lee, K
    Oshima, TC
    APPLIED PSYCHOLOGICAL MEASUREMENT, 1996, 20 (03) : 230 - 230
  • [28] Allostatic load scoring using item response theory
    Liu, Shelley H.
    Juster, Robert-Paul
    Dams-O'Connor, Kristen
    Spicer, Julie
    COMPREHENSIVE PSYCHONEUROENDOCRINOLOGY, 2021, 5
  • [29] On Item Response Theory models in allostatic load scoring
    Coutinho-Lourenco, Fausto
    Pompeia, Sabine
    Cogo-Moreira, Hugo
    COMPREHENSIVE PSYCHONEUROENDOCRINOLOGY, 2021, 6
  • [30] Item Response Theory Scoring and the Detection of Curvilinear Relationships
    Carter, Nathan T.
    Dalal, Dev K.
    Guan, Li
    LoPilato, Alexander C.
    Withrow, Scott A.
    PSYCHOLOGICAL METHODS, 2017, 22 (01) : 191 - 203