A Novel Leukemia Gene Features Extraction and Selection Technique for Robust Type Prediction Using Machine Learning

被引:0
|
作者
Ilyas, Mahwish [1 ]
Aamir, Khalid Mahmood [1 ]
Jaleel, Abdul [2 ]
Deriche, Mohamed [3 ]
机构
[1] Univ Sargodha, Dept Comp Sci & Informat Technol, Sargodha 40162, Punjab, Pakistan
[2] Univ Engn & Technol, Dept Comp Sci, GRW, RCET, Lahore, Pakistan
[3] Ajman Univ, Coll Engn & Informat Technol, Artificial Intelligence Res Ctr AIRC, Ajman, U Arab Emirates
关键词
Leukemia prediction; Gene features extraction; Linear discriminant analysis; Dimensionality reduction; LINEAR DISCRIMINANT-ANALYSIS; EXPRESSION DATA; CLASSIFICATION; ALGORITHM; HYBRID;
D O I
10.1007/s13369-024-09254-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The broad term 'leukemia' refers to different types of cancer related to blood cells. Detecting and identifying the specific type of leukemia continues to be a major challenge in the medical field. Diverse machine learning techniques can be vital in analyzing gene expression data from microarray experiments in cancer research related to leukemia. In particular, the Leukemia Gene Expression data from the Curated Microarray Database (CuMiDa) is used here. Microarrays can be challenging in determining expression patterns. In this work, we use Fisher's linear discriminant analysis, a popular technique for dimensionality reduction, together with a new feature selection approach to predict leukemia using microarray data. Our machine learning model is used to predict five types of leukemia including AML, PBSC CD34, Bone Marrow, and CD34 from the bone marrow. This is achieved by first rescaling the data features. We then use a feature selection technique to obtain the 25 most significant features from the dataset's 22,283 features, then further reduce the dimension to 5 features only, to reduce computational complexity. These features are then fed into a Fisher's linear discriminant module and a likelihood-based index for classification. The overall performance of our model was excellent. We examine the results using 2, 4, 5, 6, and 7 selected features. The best classification accuracies are 89.6%, 96.92%, and 96.15%, for 2, 5, and 7 selected features, respectively. Our results outperform the state-of-the-art by about 4%, with an excellent task completion time of less than 100 ms.
引用
收藏
页码:16845 / 16863
页数:19
相关论文
共 50 条
  • [31] Prediction of groundwater quality using efficient machine learning technique
    Singha, Sudhakar
    Pasupuleti, Srinivas
    Singha, Soumya S.
    Singh, Rambabu
    Kumar, Suresh
    CHEMOSPHERE, 2021, 276
  • [32] PREDICTION OF LUNG CANCER USING MACHINE LEARNING TECHNIQUE: A SURVEY
    Kumar, M. Siddardha
    Rao, K. Venkata
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [33] Prediction of the CNC Tool Wear Using the Machine Learning Technique
    Lee, Kangbae
    Park, Sungho
    Sung, Sangha
    Park, Domyeong
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 296 - 299
  • [34] Chatter prediction in boring process using machine learning technique
    Saravanamurugan S.
    Thiyagu S.
    Sakthivel N.R.
    Nair B.B.
    Saravanamurugan, S. (s_saravana@cb.amrita.edu), 2017, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (12) : 405 - 422
  • [35] A study of job involvement prediction using machine learning technique
    Choi, Youngkeun
    Choi, Jae Won
    INTERNATIONAL JOURNAL OF ORGANIZATIONAL ANALYSIS, 2021, 29 (03) : 788 - 800
  • [36] Lameness prediction in broiler chicken using a machine learning technique
    Naas, Irenilza de Alencar
    Lima, Nilsa Duarte da Silva
    Goncalves, Rodrigo Franco
    de Lima, Luiz Antonio
    Ungaro, Henry
    Abe, Jair Minoro
    INFORMATION PROCESSING IN AGRICULTURE, 2021, 8 (03): : 409 - 418
  • [37] Prediction of Brain Tumor Progression using a Machine Learning Technique
    Shen, Yufei
    Banerjee, Debrup
    Li, Jiang
    Chandler, Adam
    Shen, Yuzhong
    McKenzie, Frederic D.
    Wang, Jihong
    MEDICAL IMAGING 2010: COMPUTER - AIDED DIAGNOSIS, 2010, 7624
  • [38] PREDICTION OF OBSTRUCTIVE SLEEP APNEA USING MACHINE LEARNING TECHNIQUE
    Huang, W.
    Lee, P.
    Liu, Y.
    Lai, F.
    SLEEP, 2018, 41 : A186 - A186
  • [39] Rice Crop Disease Prediction Using Machine Learning Technique
    Patel, Bharati
    Sharaff, Aakanksha
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2021, 12 (04)
  • [40] Prediction of Integrated Water Vapor Using a Machine Learning Technique
    Bisht, Deepak S.
    Rao, T. Narayana
    Rao, N. Rama
    Chandrakanth, S. V.
    Sharma, Akshit
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19