Global Interpretable Calibration Index, a New Metric to Estimate Machine Learning Models' Calibration

被引:1
|
作者
Cabitza, Federico [1 ,2 ]
Campagner, Andrea [1 ]
Famiglini, Lorenzo [1 ]
机构
[1] Univ Milano Bicocca, Dipartimento Informat Sistemist & Comunicaz, Viale Sarca 336, I-20126 Milan, Italy
[2] IRCCS Ist Ortoped Galeazzi, Milan, Italy
关键词
Calibration; Re-calibration; Interpretability; Medical machine learning; LOCALLY WEIGHTED REGRESSION; PREDICTION;
D O I
10.1007/978-3-031-14463-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of calibration is key in the development and validation of Machine Learning models, especially in sensitive contexts such as the medical one. However, existing calibration metrics can be difficult to interpret and are affected by theoretical limitations. In this paper, we present a new metric, called GICI (Global Interpretable Calibration Index), which is characterized by being local and defined only in terms of simple geometrical primitives, which makes it both simpler to interpret, and more general than other commonly used metrics, as it can be used also in recalibration procedures. Also, compared to traditional metrics, the GICI allows for a more comprehensive evaluation, as it provides a three-level information: a bin-level local estimate, a global one, and an estimate of the extent confidence scores are either over- or under-confident with respect to actual error rate. We also report the results from experiments aimed at testing the above statements and giving insights about the practical utility of this metric also to improve discriminative accuracy.
引用
收藏
页码:82 / 99
页数:18
相关论文
共 50 条
  • [1] Calibration of Stochastic Radio Propagation Models Using Machine Learning
    Adeogun, Ramoni
    [J]. IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2019, 18 (12): : 2538 - 2542
  • [2] Estimating the water quality index based on interpretable machine learning models
    Yang, Shiwei
    Liang, Ruifeng
    Chen, Junguang
    Wang, Yuanming
    Li, Kefeng
    [J]. WATER SCIENCE AND TECHNOLOGY, 2024, 89 (05) : 1340 - 1356
  • [3] Interpretable Differencing of Machine Learning Models
    Haldar, Swagatam
    Saha, Diptikalyan
    Wei, Dennis
    Nair, Rahul
    Daly, Elizabeth M.
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 788 - 797
  • [4] Graphical calibration curves and the integrated calibration index (ICI) for survival models
    Austin, Peter C.
    Harrell, Frank E., Jr.
    van Klaveren, David
    [J]. STATISTICS IN MEDICINE, 2020, 39 (21) : 2714 - 2742
  • [5] Calibration of probability predictions from machine-learning and statistical models
    Dormann, Carsten F.
    [J]. GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2020, 29 (04): : 760 - 765
  • [6] Calibration drift in regression and machine learning models for acute kidney injury
    Davis, Sharon E.
    Lasko, Thomas A.
    Chen, Guanhua
    Siew, Edward D.
    Matheny, Michael E.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (06) : 1052 - 1061
  • [7] Transferability of machine-learning-based global calibration models for NO2 and NO low-cost sensors
    Abu-Hani, Ayah
    Chen, Jia
    Balamurugan, Vigneshkumar
    Wenzel, Adrian
    Bigi, Alessandro
    [J]. ATMOSPHERIC MEASUREMENT TECHNIQUES, 2024, 17 (13) : 3917 - 3931
  • [8] Machine Learning for Light Sensor Calibration
    Zhang, Yichao
    Wijeratne, Lakitha O. H.
    Talebi, Shawhin
    Lary, David J.
    [J]. SENSORS, 2021, 21 (18)
  • [9] Graphical calibration curves and the integrated calibration index (ICI) for competing risk models
    Austin, Peter C.
    Putter, Hein
    Giardiello, Daniele
    van Klaveren, David
    [J]. DIAGNOSTIC AND PROGNOSTIC RESEARCH, 2022, 6 (01)
  • [10] Dynamic calibration of differential equations using machine learning, with application to turbulence models
    Boureima, I
    Gyrya, V
    Saenz, J. A.
    Kurien, S.
    Francois, M.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 457