Association between split selection instability and predictive error in survival trees

被引:0
|
作者
Radespiel-Troeger, M.
Gefeller, O.
Rabenstein, T.
Hothorn, T.
机构
[1] Univ Erlangen Nurnberg, Dept Med Informat Biometry & Epidemiol, D-91054 Erlangen, Germany
[2] Univ Erlangen Nurnberg, Dept Med 1, Erlangen, Germany
关键词
recursive partitioning; trees; censored data; survival; Brier score;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: To evaluate split selection instability in six survival tree algorithms and its relationship with predictive error by means of a bootstrap study. Methods: We study the following algorithms: logrank statistic with multivariate p-value adjustment without pruning (LR), Kaplan-Meier distance of survival curves (KM), martingale residuals (MR), Poisson regression for censored data (PR), within-node impurity (WI), and exponential log-likelihood loss (XL). With the exception of LR, initial trees are pruned by using split-complexity, and final trees are selected by means of cross-validation. We employ a real dataset from a clinical study of patients with gallbladder stones. The predictive error is evaluated using the integrated Brier score for censored data. The relationship between split selection instability and predictive error, is evaluated by means of box-percentile plots, covariate and cutpoint selection entropy, and cutpoint selection coefficients of variation, respectively, in the root node. Results: We found a positive association between covariate selection instability and predictive error in the root node. LR yields the lowest predictive error, while KM and MR yield the highest predictive error. Conclusions: The predictive error of survival trees is related to split selection instability. Based on the low predictive error of LR, we recommend the use of this algorithm for the construction of survival trees. Unpruned survival trees with multivariate p-value adjustment can perform equally well compared to pruned trees. The analysis of split selection instability can be used to communicate the results of tree-based analyses to clinicians and to support the application of survival trees.
引用
收藏
页码:548 / 556
页数:9
相关论文
共 50 条
  • [1] SURVIVAL TREES BY GOODNESS OF SPLIT
    LEBLANC, M
    CROWLEY, J
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) : 457 - 467
  • [2] Split selection methods for classification trees
    Loh, WY
    Shih, YS
    [J]. STATISTICA SINICA, 1997, 7 (04) : 815 - 840
  • [3] A note on split selection bias in classification trees
    Shih, YS
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (03) : 457 - 466
  • [4] Comparing split criteria for constructing survival trees
    Radespiel-Tröger, M
    Rabenstein, T
    Höpfner, L
    Schneider, HT
    [J]. EXPLORATORY DATA ANALYSIS IN EMPIRICAL RESEARCH, PROCEEDINGS, 2003, : 357 - 365
  • [5] The relationship between the error catastrophe, survival of the flattest, and natural selection
    Tejero, Hector
    Marin, Arturo
    Montero, Francisco
    [J]. BMC EVOLUTIONARY BIOLOGY, 2011, 11
  • [6] The relationship between the error catastrophe, survival of the flattest, and natural selection
    Héctor Tejero
    Arturo Marín
    Francisco Montero
    [J]. BMC Evolutionary Biology, 11
  • [7] Split criterions for variable selection using decision trees
    Abellan, Joaquin
    Masegosa, Andres R.
    [J]. SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, PROCEEDINGS, 2007, 4724 : 489 - +
  • [8] Unbiased split selection for classification trees based on the Gini Index
    Strobl, Carohn
    Boulesteix, Anne-Laure
    Augustin, Thomas
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 483 - 501
  • [9] Trees for correlated survival data by goodness of split, with applications to tooth prognosis
    Fan, Juanjuan
    Su, Xiao-Gang
    Levine, Richard A.
    Nunn, Martha E.
    LeBlanc, Michael
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (475) : 959 - 967
  • [10] Survival analysis with semi-supervised predictive clustering trees
    Roy, Bijit
    Stepis, Tomaz
    Pooled Resource Open-Access Als Clinical Trials Consortium, The
    Vens, Celine
    Dzeroski, Saso
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 141