Calibrating Random Forests

被引:35
|
作者
Bostrom, Henrik [1 ]
机构
[1] Univ Skovde, Informat Res Ctr, S-54128 Skovde, Sweden
关键词
D O I
10.1109/ICMLA.2008.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When using the output of classifiers to calculate the expected utility of different alternatives in decision situations, the correctness of predicted class probabilities may be of crucial importance. However even very accurate classifiers may, output class probabilities of rather poor quality. One way of overcoming this problem is by means of calibration, i.e., mapping the original class probabilities to more accurate ones. Previous studies have however indicated that random forests are difficult to calibrate by standard calibration methods. In this work, a novel calibration method is introduced, which is based on a recent finding that probabilities predicted by forests of classification trees have a lower squared error compared to those predicted by forests of probability estimation trees (PETs). The novel calibration method is compared to the two standard methods, Platt scaling and isotonic regression, on 34 datasets from the UCI repository. The experiment shows that random forests of PETs calibrated by the novel method significantly outperform uncalibrated random forests of both PETs and classification trees, as well as random forests calibrated with the two standard methods, with respect to the squared error of predicted class probabilities.
引用
收藏
页码:121 / 126
页数:6
相关论文
共 50 条
  • [1] Calibrating random forests for probability estimation
    Dankowski, Theresa
    Ziegler, Andreas
    [J]. STATISTICS IN MEDICINE, 2016, 35 (22) : 3949 - 3960
  • [2] Posture-invariant myoelectric control with self-calibrating random forests
    Jiang, Xinyu
    Ma, Chenfei
    Nazarpour, Kianoush
    [J]. Frontiers in Neurorobotics, 2024, 18
  • [3] Imprecise Extensions of Random Forests and Random Survival Forests
    Utkin, Lev, V
    Kovalev, Maxim S.
    Meldo, Anna A.
    Coolen, Frank P. A.
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES: THEORIES AND APPLICATIONS (ISIPTA 2019), 2019, 103 : 404 - 413
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Random Forests
    Leo Breiman
    [J]. Machine Learning, 2001, 45 : 5 - 32
  • [6] Random forests
    Pavlov, YL
    [J]. PROBABILISTIC METHODS IN DISCRETE MATHEMATICS, 1997, : 11 - 18
  • [7] Calibrating Dependence Between Random Elements
    Kagan, Abram M.
    Szekely, Gabor J.
    [J]. JOURNAL OF THEORETICAL PROBABILITY, 2021, 34 (02) : 784 - 790
  • [8] Calibrating Dependence Between Random Elements
    Abram M. Kagan
    Gábor J. Székely
    [J]. Journal of Theoretical Probability, 2021, 34 : 784 - 790
  • [9] Calibrating the liana crown occupancy index in Amazonian forests
    van der Heijden, Geertje M. F.
    Feldpausch, Ted R.
    Herrero, Ana de la Fuente
    van der Velden, Naomi K.
    Phillips, Oliver L.
    [J]. FOREST ECOLOGY AND MANAGEMENT, 2010, 260 (04) : 549 - 555
  • [10] Random Prism: An Alternative to Random Forests
    Stahl, Frederic
    Bramer, Max
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVIII: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XIX, 2011, : 5 - 18