Survival forest with partial least squares for high dimensional censored data

被引:4
|
作者
Zhou, Lifeng [1 ]
Wang, Hong [2 ]
Xu, Qingsong [2 ]
机构
[1] Changsha Univ, Sch Econ & Management, Changsha, Hunan, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha, Hunan, Peoples R China
关键词
Survival ensemble; Partial least squares; Random survival forest; Censored data; Buckley-James transformation; VARIABLE SELECTION; ROTATION FOREST; COX REGRESSION; CLASSIFICATION; MODELS; CLASSIFIERS; PREDICTION; TESTS; TIME;
D O I
10.1016/j.chemolab.2018.05.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Random forest and partial least squares have proved wide applicability in numerous contexts. However, the combination of these versatile tools has seldom been studied. Inspired by a relatively new decision tree ensemble called rotation forest, we introduce a new survival ensemble algorithm using partial least squares regression and the Buckley-James estimator within the framework of random forest. First, the approach taken to cope with the high dimensionality is to reduce the dimension by a random subspace method. Then, censored survival times are imputed by the Buckley-James estimator. After dimension reduction and time imputation, partial least squares regression is applied to extract the features. Similar to rotation forest, all extracted components are used as covariates in a bagged survival tree to predict the survival probabilities. Experimental results on a variety of simulation and real datasets demonstrate that the proposed approach is a strong competitor to other popular survival prediction models under high or ultra-high dimensional setting.
引用
收藏
页码:12 / 21
页数:10
相关论文
共 50 条
  • [11] Sparse partial least-squares regression for high-throughput survival data analysis
    Lee, Donghwan
    Lee, Youngjo
    Pawitan, Yudi
    Lee, Woojoo
    [J]. STATISTICS IN MEDICINE, 2013, 32 (30) : 5340 - 5352
  • [12] One component partial least squares, high dimensional regression, data splitting, and the multitude of models
    Olive, David J.
    Zhang, Lingling
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024,
  • [13] Multiset sparse partial least squares path modeling for high dimensional omics data analysis
    Csala, Attila
    Zwinderman, Aeilko H.
    Hof, Michel H.
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [14] Multiset sparse partial least squares path modeling for high dimensional omics data analysis
    Attila Csala
    Aeilko H. Zwinderman
    Michel H. Hof
    [J]. BMC Bioinformatics, 21
  • [15] Partial Least Squares for Heterogeneous Data
    Buhlmann, Peter
    [J]. MULTIPLE FACETS OF PARTIAL LEAST SQUARES AND RELATED METHODS, 2016, 173 : 3 - 15
  • [16] Partial least squares for dependent data
    Singer, Marco
    Krivobokova, Tatyana
    Munk, Axel
    De Groot, Bert
    [J]. BIOMETRIKA, 2016, 103 (02) : 351 - 362
  • [17] Least squares support vector regression for complex censored data
    Liu, Xinrui
    Dong, Xiaogang
    Zhang, Le
    Chen, Jia
    Wang, Chunjie
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 136
  • [18] Iterative least squares method for regression analysis with censored data
    Conde, RF
    Smeyers-Verbeke, J
    Massart, DL
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2004, 23 (02) : 146 - 156
  • [19] Least squares estimators of the regression function with twice censored data
    Kebabi, K.
    Laroussi, I.
    Messaci, F.
    [J]. STATISTICS & PROBABILITY LETTERS, 2011, 81 (11) : 1588 - 1593
  • [20] Random Forest Regression Based on Partial Least Squares
    Hao, Zhulin
    Du, Jianqiang
    Nie, Bin
    Yu, Fang
    Yu, Riyue
    Xiong, Wangping
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127