PRFE-driven gene selection with multi-classifier ensemble for cancer classification

被引:0
|
作者
Behuria, Smitirekha [1 ]
Swain, Sujata [1 ]
Bandyopadhyay, Anjan [1 ]
Al-Sadoon, Mohammad Khalid [2 ]
Mallik, Saurav [3 ,4 ]
机构
[1] Kalinga Inst Ind Technol, Sch Comp Engn, Bhubaneswar 751024, Odisha, India
[2] King Saud Univ, Coll Sci, Dept Zool, POB 2455, Riyadh 11451, Saudi Arabia
[3] Harvard TH Chan Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[4] Univ Arizona, Dept Pharmacol & Toxicol, Tucson, MA 85721 USA
关键词
Principal recursive feature eliminator (PRFE); Recursive feature elimination; Long short-term memory; LightGBM; CatBoost; Convolutional neural network; Gene expression analysis; BREAST-CANCER; EXPRESSION; ALGORITHM;
D O I
10.1016/j.eij.2025.100637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this era, cancer remains a paramount concern due to its pervasive impact on individuals and societies, persistent challenges in treatment and prevention, and the ongoing need for global collaboration and innovation to improve outcomes and reduce its burden. Cancer marked by uncontrolled cell growth is a leading global cause of mortality, necessitating advanced methods for accurate diagnosis. This study introduces an innovative unsupervised feature selection technique Principal Recursive Feature Eliminator (PRFE) for selection of genes and cancer classification. Subsequently, seven different classifiers-Support Vector Machine, Random Forest, CatBoost, Light Gradient Boosting Method, Artificial Neural Network, Convolutional Neural Network, Long Short-Term Memory are used to increase the model's robustness. The proposed approach is evaluated on nine benchmark gene expression datasets with a combination of two different algorithms. A series of experiments are conducted to assess the proposed method, focusing on the selected features and identifying optimal classifiers. We have calculated F1-Score, accuracy, recall, and precision. The suggested strategy performs better than expected, as the results highlight its potential to improve cancer classification techniques with an accuracy of 99.98%. We conclude from this analysis that, across many datasets, the CatBoost and CNN model outperforms the other models. This research contributes to the ongoing efforts to improve diagnostic precision and treatment outcomes in cancer research.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Network Traffic Classification Based on Multi-Classifier Selective Ensemble
    Tao, Xiaoling
    Wang, Yong
    Wei, Yi
    Long, Ye
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2015, 8 (02) : 88 - 94
  • [2] Multi-classifier ensemble based on dynamic weights
    Ren, Fuji
    Li, Yanqiu
    Hu, Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21083 - 21107
  • [3] A Multi-Classifier Approach to Fingerprint Classification
    Raffaele Cappelli
    Dario Maio
    Davide Maltoni
    Pattern Analysis & Applications, 2002, 5 : 136 - 144
  • [4] Multi-classifier ensemble based on dynamic weights
    Fuji Ren
    Yanqiu Li
    Min Hu
    Multimedia Tools and Applications, 2018, 77 : 21083 - 21107
  • [5] A multi-classifier approach to fingerprint classification
    Cappelli, R
    Maio, D
    Maltoni, D
    PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (02) : 136 - 144
  • [6] Webshell detection based on multi-classifier ensemble model
    Wenjuan-Lian
    Qi-Fan
    Dandan-Shi
    Qili-An
    Jia, Bin
    Journal of Computers (Taiwan), 2020, 31 (01): : 242 - 252
  • [7] Extraction of Larch Plantation Based on Multi-Classifier Ensemble
    Ma T.
    Li C.
    Tang F.
    Lü J.
    Linye Kexue/Scientia Silvae Sinicae, 2021, 57 (11): : 105 - 118
  • [8] Multi-classifier framework for lung tissue classification
    Dash, Jatindra Kumar
    Mukhopadhyay, Sudipta
    Garg, Mandeep Kumar
    Prabhakar, Nidhi
    Khandelwal, Niranjan
    2014 IEEE STUDENTS' TECHNOLOGY SYMPOSIUM (IEEE TECHSYM), 2014, : 264 - 269
  • [9] A multi-classifier system for pulmonary nodule classification
    Antonelli, Michela
    Cococcioni, Marco
    Lazzerini, Beatrice
    Marcelloni, Francesco
    Stefanescu, Dan
    PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2008, : 587 - +
  • [10] SVM multi-classifier and web document classification
    Liang, JZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1347 - 1351