PRFE-driven gene selection with multi-classifier ensemble for cancer classification

被引：0

作者：

Behuria, Smitirekha ^{[1
]}

Swain, Sujata ^{[1
]}

Bandyopadhyay, Anjan ^{[1
]}

Al-Sadoon, Mohammad Khalid ^{[2
]}

Mallik, Saurav ^{[3
,4
]}

机构：

[1] Kalinga Inst Ind Technol, Sch Comp Engn, Bhubaneswar 751024, Odisha, India

[2] King Saud Univ, Coll Sci, Dept Zool, POB 2455, Riyadh 11451, Saudi Arabia

[3] Harvard TH Chan Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA

[4] Univ Arizona, Dept Pharmacol & Toxicol, Tucson, MA 85721 USA

来源：

EGYPTIAN INFORMATICS JOURNAL | 2025年 / 30卷

关键词：

Principal recursive feature eliminator (PRFE); Recursive feature elimination; Long short-term memory; LightGBM; CatBoost; Convolutional neural network; Gene expression analysis; BREAST-CANCER; EXPRESSION; ALGORITHM;

D O I：

10.1016/j.eij.2025.100637

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this era, cancer remains a paramount concern due to its pervasive impact on individuals and societies, persistent challenges in treatment and prevention, and the ongoing need for global collaboration and innovation to improve outcomes and reduce its burden. Cancer marked by uncontrolled cell growth is a leading global cause of mortality, necessitating advanced methods for accurate diagnosis. This study introduces an innovative unsupervised feature selection technique Principal Recursive Feature Eliminator (PRFE) for selection of genes and cancer classification. Subsequently, seven different classifiers-Support Vector Machine, Random Forest, CatBoost, Light Gradient Boosting Method, Artificial Neural Network, Convolutional Neural Network, Long Short-Term Memory are used to increase the model's robustness. The proposed approach is evaluated on nine benchmark gene expression datasets with a combination of two different algorithms. A series of experiments are conducted to assess the proposed method, focusing on the selected features and identifying optimal classifiers. We have calculated F1-Score, accuracy, recall, and precision. The suggested strategy performs better than expected, as the results highlight its potential to improve cancer classification techniques with an accuracy of 99.98%. We conclude from this analysis that, across many datasets, the CatBoost and CNN model outperforms the other models. This research contributes to the ongoing efforts to improve diagnostic precision and treatment outcomes in cancer research.

引用

页数：15

共 50 条

[1] Network Traffic Classification Based on Multi-Classifier Selective Ensemble
Tao, Xiaoling
Wang, Yong
Wei, Yi
Long, Ye
RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2015, 8 (02) : 88 - 94
[2] Multi-classifier ensemble based on dynamic weights
Ren, Fuji
Li, Yanqiu
Hu, Min
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21083 - 21107
[3] A Multi-Classifier Approach to Fingerprint Classification
Raffaele Cappelli
Dario Maio
Davide Maltoni
Pattern Analysis & Applications, 2002, 5 : 136 - 144
[4] Multi-classifier ensemble based on dynamic weights
Fuji Ren
Yanqiu Li
Min Hu
Multimedia Tools and Applications, 2018, 77 : 21083 - 21107
[5] A multi-classifier approach to fingerprint classification
Cappelli, R
Maio, D
Maltoni, D
PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (02) : 136 - 144
[6] Webshell detection based on multi-classifier ensemble model
Wenjuan-Lian
Qi-Fan
Dandan-Shi
Qili-An
Jia, Bin
Journal of Computers (Taiwan), 2020, 31 (01): : 242 - 252
[7] Extraction of Larch Plantation Based on Multi-Classifier Ensemble
Ma T.
Li C.
Tang F.
Lü J.
Linye Kexue/Scientia Silvae Sinicae, 2021, 57 (11): : 105 - 118
[8] Multi-classifier framework for lung tissue classification
Dash, Jatindra Kumar
Mukhopadhyay, Sudipta
Garg, Mandeep Kumar
Prabhakar, Nidhi
Khandelwal, Niranjan
2014 IEEE STUDENTS' TECHNOLOGY SYMPOSIUM (IEEE TECHSYM), 2014, : 264 - 269
[9] A multi-classifier system for pulmonary nodule classification
Antonelli, Michela
Cococcioni, Marco
Lazzerini, Beatrice
Marcelloni, Francesco
Stefanescu, Dan
PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2008, : 587 - +
[10] SVM multi-classifier and web document classification
Liang, JZ
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1347 - 1351

← 1 2 3 4 5 →