Censoring Unbiased Regression Trees and Ensembles

被引:32
|
作者
Steingrimsson, Jon Arni [1 ]
Diao, Liqun [2 ]
Strawderman, Robert L. [3 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON, Canada
[3] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
关键词
Classification and regression trees (CART); Doubly robust; Loss estimation; Random forests; Risk prediction; RANDOM FORESTS; SURVIVAL;
D O I
10.1080/01621459.2017.1407775
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article proposes a novel paradigm for building regression trees and ensemble learning in survival analysis. Generalizations of the classification and regression trees (CART) and random forests (RF) algorithms for general loss functions, and in the latter case more general bootstrap procedures, are both introduced. These results, in combination with an extension of the theory of censoring unbiased transformations (CUTs) applicable to loss functions, underpin the development of two new classes of algorithms for constructing survival trees and survival forests: censoring unbiased regression trees and censoring unbiased regression ensembles. For a certain doubly robust CUT of squared error loss, we further show how these new algorithms can be implemented using existing software (e.g., CART, RF). Comparisons of these methods to existing ensemble procedures for predicting survival probabilities are provided in both simulated settings and through applications to four datasets. It is shown that these new methods either improve upon, or remain competitive with, existing implementations of random survival forests, conditional inference forests, and recursively imputed survival trees.
引用
收藏
页码:370 / 383
页数:14
相关论文
共 50 条
  • [1] Regression trees for interval-censored failure time data based on censoring unbiased transformations and pseudo-observations
    Yang, Ce
    Li, Xianwei
    Diao, Liqun
    Cook, Richard J.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024,
  • [2] Unbiased regression trees for longitudinal and clustered data
    Fu, Wei
    Simonoff, Jeffrey S.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 88 : 53 - 74
  • [3] Regression trees and ensembles for cumulative incidence functions
    Cho, Youngjoo
    Molinaro, Annette M.
    Hu, Chen
    Strawderman, Robert L.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2022, 18 (02): : 397 - 419
  • [4] Fast Traversal of Large Ensembles of Regression Trees
    Lucchese, Claudio
    Nardini, Franco Maria
    Perego, Raffaele
    Tonellotto, Nicola
    Orlando, Salvatore
    Venturini, Rossano
    [J]. ERCIM NEWS, 2016, (107): : 28 - 29
  • [5] Regression trees with unbiased variable selection and interaction detection
    Loh, WY
    [J]. STATISTICA SINICA, 2002, 12 (02) : 361 - 386
  • [6] Tourism Demand Forecasting using Ensembles of Regression Trees
    Cankurt, Selcuk
    [J]. 2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, : 702 - 708
  • [7] Ensembles of Extremely Randomized Trees for Multi-target Regression
    Kocev, Dragi
    Ceci, Michelangelo
    [J]. DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 86 - 100
  • [8] MERCS: Multi-Directional Ensembles of Regression and Classification Trees
    Van Wolputte, Elia
    Korneva, Evgeniya
    Blockeel, Hendrik
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4276 - 4283
  • [9] Unbiased sampling of network ensembles
    Squartini, Tiziano
    Mastrandrea, Rossana
    Garlaschelli, Diego
    [J]. NEW JOURNAL OF PHYSICS, 2015, 17
  • [10] QuickScorer: A Fast Algorithm to Rank Documents with Additive Ensembles of Regression Trees
    Lucchese, Claudio
    Nardini, Franco Maria
    Orlando, Salvatore
    Perego, Raffaele
    Tonellotto, Nicola
    Venturini, Rossano
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 73 - 82