A two-stage stacked-based heterogeneous ensemble learning for cancer survival prediction

被引:0
|
作者
Fangzhou Yan
Yi Feng
机构
[1] Sichuan University,College of Electrical Engineering
[2] Sichuan University,Business School
来源
关键词
Stacked generalization strategy; Cancer survival prediction; Feature selection; Heterogeneous ensemble learning; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext];
D O I
暂无
中图分类号
学科分类号
摘要
Cancer survival prediction is one of the three major tasks of cancer prognosis. To improve the accuracy of cancer survival prediction, in this paper, we propose a priori knowledge- and stability-based feature selection (PKSFS) method and develop a novel two-stage heterogeneous stacked ensemble learning model (BQAXR) to predict the survival status of cancer patients. Specifically, PKSFS first obtains the optimal feature subsets from the high-dimensional cancer datasets to guide the subsequent model construction. Then, BQAXR seeks to generate five high-quality heterogeneous learners, among which the shortcomings of the learners are overcome by using improved methods, and integrate them in two stages through the stacked generalization strategy based on optimal feature subsets. To verify the merits of PKSFS and BQAXR, this paper collected the real survival datasets of gastric cancer and skin cancer from the Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute, and conducted extensive numerical experiments from different perspectives based on these two datasets. The accuracy and AUC of the proposed method are 0.8209 and 0.8203 in the gastric cancer dataset, and 0.8336 and 0.8214 in the skin cancer dataset. The results show that PKSFS has marked advantages over popular feature selection methods in processing high-dimensional datasets. By taking full advantage of heterogeneous high-quality learners, BQAXR is not only superior to mainstream machine learning methods, but also outperforms improved machine learning methods, which indicates can effectively improve the accuracy of cancer survival prediction and provide a reference for doctors to make medical decisions.
引用
收藏
页码:4619 / 4639
页数:20
相关论文
共 50 条
  • [21] A two-stage sampling based ensemble learning method for hyperspectral image classification
    Peng, Yanbin
    Zheng, Zhijun
    [J]. Journal of Computational Information Systems, 2015, 11 (14): : 5135 - 5142
  • [22] A Two-Stage Learning Method for Response Prediction
    Chen, Kuan-Hsi
    Ting, Zih-Yun
    Shen, Jia-Ying
    Hu, Yuh-Jyh
    Liang, Tyne
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1336 - 1341
  • [23] Learning Algorithm in Two-Stage Selective Prediction
    Ye, Weicheng
    Chen, Dangxing
    Ramazanli, Ilqar
    [J]. 2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 512 - 521
  • [24] A Novel Ensemble Machine Learning Model for Oil Production Prediction with Two-Stage Data Preprocessing
    Fan, Zhe
    Liu, Xiusen
    Wang, Zuoqian
    Liu, Pengcheng
    Wang, Yanwei
    [J]. PROCESSES, 2024, 12 (03)
  • [25] A novel deep learning ensemble model based on two-stage feature selection and intelligent optimization for water quality prediction
    Liu, Wenli
    Liu, Tianxiang
    Liu, Zihan
    Luo, Hanbin
    Pei, Hanmin
    [J]. ENVIRONMENTAL RESEARCH, 2023, 224
  • [26] HETEROGENEOUS IMAGE CHANGE DETECTION BASED ON TWO-STAGE JOINT FEATURE LEARNING
    Han, Te
    Tang, Yuqi
    Chen, Yuzeng
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3215 - 3218
  • [27] A two-stage credit risk scoring method with stacked-generalisation ensemble learning in peer-to-peer lending
    Wang, Chongren
    Liu, Qigang
    Li, Shuping
    [J]. INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2022, 15 (02) : 158 - 166
  • [28] A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning
    Xu, Ying
    Yan, Cuijuan
    Peng, Shaoliang
    Nojima, Yusuke
    [J]. APPLIED INTELLIGENCE, 2020, 50 (11) : 3852 - 3867
  • [29] A Two-Stage Prediction Model in Breast Cancer Using Machine Learning Methods
    Wang, Jishuai
    Gu, De
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 : 43 - 44
  • [30] A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning
    Ying Xu
    Cuijuan Yang
    Shaoliang Peng
    Yusuke Nojima
    [J]. Applied Intelligence, 2020, 50 : 3852 - 3867