Survival forest with partial least squares for high dimensional censored data

被引:4
|
作者
Zhou, Lifeng [1 ]
Wang, Hong [2 ]
Xu, Qingsong [2 ]
机构
[1] Changsha Univ, Sch Econ & Management, Changsha, Hunan, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha, Hunan, Peoples R China
关键词
Survival ensemble; Partial least squares; Random survival forest; Censored data; Buckley-James transformation; VARIABLE SELECTION; ROTATION FOREST; COX REGRESSION; CLASSIFICATION; MODELS; CLASSIFIERS; PREDICTION; TESTS; TIME;
D O I
10.1016/j.chemolab.2018.05.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Random forest and partial least squares have proved wide applicability in numerous contexts. However, the combination of these versatile tools has seldom been studied. Inspired by a relatively new decision tree ensemble called rotation forest, we introduce a new survival ensemble algorithm using partial least squares regression and the Buckley-James estimator within the framework of random forest. First, the approach taken to cope with the high dimensionality is to reduce the dimension by a random subspace method. Then, censored survival times are imputed by the Buckley-James estimator. After dimension reduction and time imputation, partial least squares regression is applied to extract the features. Similar to rotation forest, all extracted components are used as covariates in a bagged survival tree to predict the survival probabilities. Experimental results on a variety of simulation and real datasets demonstrate that the proposed approach is a strong competitor to other popular survival prediction models under high or ultra-high dimensional setting.
引用
收藏
页码:12 / 21
页数:10
相关论文
共 50 条