A Novel Leukemia Gene Features Extraction and Selection Technique for Robust Type Prediction Using Machine Learning

被引：0

作者：

Ilyas, Mahwish ^{[1
]}

Aamir, Khalid Mahmood ^{[1
]}

Jaleel, Abdul ^{[2
]}

Deriche, Mohamed ^{[3
]}

机构：

[1] Univ Sargodha, Dept Comp Sci & Informat Technol, Sargodha 40162, Punjab, Pakistan

[2] Univ Engn & Technol, Dept Comp Sci, GRW, RCET, Lahore, Pakistan

[3] Ajman Univ, Coll Engn & Informat Technol, Artificial Intelligence Res Ctr AIRC, Ajman, U Arab Emirates

来源：

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2024年 / 49卷 / 12期

关键词：

Leukemia prediction; Gene features extraction; Linear discriminant analysis; Dimensionality reduction; LINEAR DISCRIMINANT-ANALYSIS; EXPRESSION DATA; CLASSIFICATION; ALGORITHM; HYBRID;

D O I：

10.1007/s13369-024-09254-5

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The broad term 'leukemia' refers to different types of cancer related to blood cells. Detecting and identifying the specific type of leukemia continues to be a major challenge in the medical field. Diverse machine learning techniques can be vital in analyzing gene expression data from microarray experiments in cancer research related to leukemia. In particular, the Leukemia Gene Expression data from the Curated Microarray Database (CuMiDa) is used here. Microarrays can be challenging in determining expression patterns. In this work, we use Fisher's linear discriminant analysis, a popular technique for dimensionality reduction, together with a new feature selection approach to predict leukemia using microarray data. Our machine learning model is used to predict five types of leukemia including AML, PBSC CD34, Bone Marrow, and CD34 from the bone marrow. This is achieved by first rescaling the data features. We then use a feature selection technique to obtain the 25 most significant features from the dataset's 22,283 features, then further reduce the dimension to 5 features only, to reduce computational complexity. These features are then fed into a Fisher's linear discriminant module and a likelihood-based index for classification. The overall performance of our model was excellent. We examine the results using 2, 4, 5, 6, and 7 selected features. The best classification accuracies are 89.6%, 96.92%, and 96.15%, for 2, 5, and 7 selected features, respectively. Our results outperform the state-of-the-art by about 4%, with an excellent task completion time of less than 100 ms.

引用

页码：16845 / 16863

页数：19

共 50 条

[31] Prediction of groundwater quality using efficient machine learning technique
Singha, Sudhakar
Pasupuleti, Srinivas
Singha, Soumya S.
Singh, Rambabu
Kumar, Suresh
CHEMOSPHERE, 2021, 276
[32] PREDICTION OF LUNG CANCER USING MACHINE LEARNING TECHNIQUE: A SURVEY
Kumar, M. Siddardha
Rao, K. Venkata
2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
[33] Prediction of the CNC Tool Wear Using the Machine Learning Technique
Lee, Kangbae
Park, Sungho
Sung, Sangha
Park, Domyeong
2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 296 - 299
[34] Chatter prediction in boring process using machine learning technique
Saravanamurugan S.
Thiyagu S.
Sakthivel N.R.
Nair B.B.
Saravanamurugan, S. (s_saravana@cb.amrita.edu), 2017, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (12) : 405 - 422
[35] A study of job involvement prediction using machine learning technique
Choi, Youngkeun
Choi, Jae Won
INTERNATIONAL JOURNAL OF ORGANIZATIONAL ANALYSIS, 2021, 29 (03) : 788 - 800
[36] Lameness prediction in broiler chicken using a machine learning technique
Naas, Irenilza de Alencar
Lima, Nilsa Duarte da Silva
Goncalves, Rodrigo Franco
de Lima, Luiz Antonio
Ungaro, Henry
Abe, Jair Minoro
INFORMATION PROCESSING IN AGRICULTURE, 2021, 8 (03): : 409 - 418
[37] Prediction of Brain Tumor Progression using a Machine Learning Technique
Shen, Yufei
Banerjee, Debrup
Li, Jiang
Chandler, Adam
Shen, Yuzhong
McKenzie, Frederic D.
Wang, Jihong
MEDICAL IMAGING 2010: COMPUTER - AIDED DIAGNOSIS, 2010, 7624
[38] PREDICTION OF OBSTRUCTIVE SLEEP APNEA USING MACHINE LEARNING TECHNIQUE
Huang, W.
Lee, P.
Liu, Y.
Lai, F.
SLEEP, 2018, 41 : A186 - A186
[39] Rice Crop Disease Prediction Using Machine Learning Technique
Patel, Bharati
Sharaff, Aakanksha
INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2021, 12 (04)
[40] Prediction of Integrated Water Vapor Using a Machine Learning Technique
Bisht, Deepak S.
Rao, T. Narayana
Rao, N. Rama
Chandrakanth, S. V.
Sharma, Akshit
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

← 1 2 3 4 5 →