A two-stage stacked-based heterogeneous ensemble learning for cancer survival prediction

被引:0
|
作者
Fangzhou Yan
Yi Feng
机构
[1] Sichuan University,College of Electrical Engineering
[2] Sichuan University,Business School
来源
关键词
Stacked generalization strategy; Cancer survival prediction; Feature selection; Heterogeneous ensemble learning; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext];
D O I
暂无
中图分类号
学科分类号
摘要
Cancer survival prediction is one of the three major tasks of cancer prognosis. To improve the accuracy of cancer survival prediction, in this paper, we propose a priori knowledge- and stability-based feature selection (PKSFS) method and develop a novel two-stage heterogeneous stacked ensemble learning model (BQAXR) to predict the survival status of cancer patients. Specifically, PKSFS first obtains the optimal feature subsets from the high-dimensional cancer datasets to guide the subsequent model construction. Then, BQAXR seeks to generate five high-quality heterogeneous learners, among which the shortcomings of the learners are overcome by using improved methods, and integrate them in two stages through the stacked generalization strategy based on optimal feature subsets. To verify the merits of PKSFS and BQAXR, this paper collected the real survival datasets of gastric cancer and skin cancer from the Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute, and conducted extensive numerical experiments from different perspectives based on these two datasets. The accuracy and AUC of the proposed method are 0.8209 and 0.8203 in the gastric cancer dataset, and 0.8336 and 0.8214 in the skin cancer dataset. The results show that PKSFS has marked advantages over popular feature selection methods in processing high-dimensional datasets. By taking full advantage of heterogeneous high-quality learners, BQAXR is not only superior to mainstream machine learning methods, but also outperforms improved machine learning methods, which indicates can effectively improve the accuracy of cancer survival prediction and provide a reference for doctors to make medical decisions.
引用
收藏
页码:4619 / 4639
页数:20
相关论文
共 50 条
  • [1] A two-stage stacked-based heterogeneous ensemble learning for cancer survival prediction
    Yan, Fangzhou
    Feng, Yi
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 4619 - 4639
  • [2] Heterogeneous defect prediction with two-stage ensemble learning
    Zhiqiang Li
    Xiao-Yuan Jing
    Xiaoke Zhu
    Hongyu Zhang
    Baowen Xu
    Shi Ying
    [J]. Automated Software Engineering, 2019, 26 : 599 - 651
  • [3] Heterogeneous defect prediction with two-stage ensemble learning
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Zhang, Hongyu
    Xu, Baowen
    Ying, Shi
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2019, 26 (03) : 599 - 651
  • [4] SELF: a stacked-based ensemble learning framework for breast cancer classification
    Jakhar, Amit Kumar
    Gupta, Aman
    Singh, Mrityunjay
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (03) : 1341 - 1356
  • [5] Software defect prediction using stacked denoising autoencoders and two-stage ensemble learning
    Tong, Haonan
    Liu, Bin
    Wang, Shihai
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 96 : 94 - 111
  • [6] Stacked-Based Ensemble Machine Learning Model for Positioning Footballer
    Buyrukoglu, Selim
    Savas, Serkan
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1371 - 1383
  • [7] Carbon trading price prediction based on a two-stage heterogeneous ensemble method
    Cui, Shaoze
    Wang, Dujuan
    Yin, Yunqiang
    Fan, Xin
    Dhamotharan, Lalitha
    Kumar, Ajay
    [J]. ANNALS OF OPERATIONS RESEARCH, 2022,
  • [8] Stacked-Based Ensemble Machine Learning Model for Positioning Footballer
    Selim Buyrukoğlu
    Serkan Savaş
    [J]. Arabian Journal for Science and Engineering, 2023, 48 : 1371 - 1383
  • [9] A tree ensemble-based two-stage model for advanced-stage colorectal cancer survival prediction
    Wang, Yuyan
    Wang, Dujuan
    Ye, Xin
    Wang, Yanzhang
    Yin, Yunqiang
    Jin, Yaochu
    [J]. INFORMATION SCIENCES, 2019, 474 : 106 - 124
  • [10] Two-stage stacking heterogeneous ensemble learning method for gasoline octane number loss prediction
    Cui, Shaoze
    Qiu, Huaxin
    Wang, Sutong
    Wang, Yanzhang
    [J]. APPLIED SOFT COMPUTING, 2021, 113