Censoring Unbiased Regression Trees and Ensembles

被引:32
|
作者
Steingrimsson, Jon Arni [1 ]
Diao, Liqun [2 ]
Strawderman, Robert L. [3 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON, Canada
[3] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
关键词
Classification and regression trees (CART); Doubly robust; Loss estimation; Random forests; Risk prediction; RANDOM FORESTS; SURVIVAL;
D O I
10.1080/01621459.2017.1407775
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article proposes a novel paradigm for building regression trees and ensemble learning in survival analysis. Generalizations of the classification and regression trees (CART) and random forests (RF) algorithms for general loss functions, and in the latter case more general bootstrap procedures, are both introduced. These results, in combination with an extension of the theory of censoring unbiased transformations (CUTs) applicable to loss functions, underpin the development of two new classes of algorithms for constructing survival trees and survival forests: censoring unbiased regression trees and censoring unbiased regression ensembles. For a certain doubly robust CUT of squared error loss, we further show how these new algorithms can be implemented using existing software (e.g., CART, RF). Comparisons of these methods to existing ensemble procedures for predicting survival probabilities are provided in both simulated settings and through applications to four datasets. It is shown that these new methods either improve upon, or remain competitive with, existing implementations of random survival forests, conditional inference forests, and recursively imputed survival trees.
引用
收藏
页码:370 / 383
页数:14
相关论文
共 50 条
  • [31] Quantile regression with censoring and sample selection
    Chen, Songnian
    Wang, Qian
    [J]. JOURNAL OF ECONOMETRICS, 2023, 234 (01) : 205 - 226
  • [32] Classification trees with unbiased multiway splits
    Kim, H
    Loh, WY
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) : 589 - 604
  • [33] Managing diversity in regression ensembles
    Brown, G
    Wyatt, JL
    Tino, P
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 1621 - 1650
  • [34] Boosting diversity in regression ensembles
    Bourel, Mathias
    Cugliari, Jairo
    Goude, Yannig
    Poggi, Jean-Michel
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (01)
  • [35] Random Oracles for Regression Ensembles
    Pardo, Carlos
    Rodriguez, Juan J.
    Diez-Pastor, Jose F.
    Garcia-Osorio, Cesar
    [J]. ENSEMBLES IN MACHINE LEARNING APPLICATIONS, 2011, 373 : 181 - 199
  • [36] Multinomial Logistic Regression Ensembles
    Lee, Kyewon
    Ahn, Hongshik
    Moon, Hojin
    Kodell, Ralph L.
    Chen, James J.
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2013, 23 (03) : 681 - 694
  • [37] UNBIASED TESTING IN EXPONENTIAL FAMILY REGRESSION
    COHEN, A
    KEMPERMAN, JHB
    SACKROWITZ, HB
    [J]. ANNALS OF STATISTICS, 1994, 22 (04): : 1931 - 1946
  • [38] Unbiased estimators for random design regression
    Derezinski, Michal
    Warmuth, Manfred K.
    Hsu, Daniel
    [J]. Journal of Machine Learning Research, 2022, 23
  • [39] Genetically evolved trees representing ensembles
    Johansson, Ulf
    Lofstrom, Tuve
    Konig, Rikard
    Niklasson, Lars
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 613 - 622
  • [40] Ensembles of Decision Trees for Imbalanced Data
    Rodriguez, Juan J.
    Diez-Pastor, Jose F.
    Garcia-Osorio, Cesar
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2011, 6713 : 76 - 85