Optimized gene selection and classification of cancer from microarray gene expression data using deep learning

被引:39
|
作者
Shah, Shamveel Hussain [1 ]
Iqbal, Muhammad Javed [1 ]
Ahmad, Iftikhar [2 ]
Khan, Suleman [3 ]
Rodrigues, Joel J. P. C. [4 ,5 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci, Taxila, Pakistan
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah, Saudi Arabia
[3] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[4] Fed Univ Piaui UFPI, Teresina, PI, Brazil
[5] Inst Telecomunicacoes, Covilha, Portugal
关键词
Microarray data; Deep learning; Laplacian score (LS); Convolutional neural network (CNN); MACHINE; HYBRID;
D O I
10.1007/s00521-020-05367-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is the major leading reason of death around the world. However, the early identification and prediction of a cancer type is very critical for patient's health. Recently, microarray gene expression data was utilized for efficient and early diagnosis of cancer. Previous work shows that microarray data has two major issues which are high dimensionality and small sample size. Several researchers have analyzed and evaluated the cancer classification problem using different statistical and machine learning-based approaches but there are still some issues with these approaches that make cancer classification a nontrivial task. Such as, the inability of certain machine learning algorithms to use unstructured data has limited their utility in the cancer classification process. Convolutional neural networks are proven to very suitable to analyze variety of unstructured data. This ability allowed the deep learning algorithms to play a vibrant part in early detection of cancer through data classification. In this research, a hybrid deep learning model based on Laplacian Score-Convolutional Neural Network (LS-CNN) is employed for the classification of given cancer's data. The performance of the proposed system was evaluated on 10 different benchmark datasets using various performance measurement metrics such as accuracy and confusion matrix. The experimental results conclude that proposed LS-CNN model outperformed compared to traditional machine learning and recently used deep learning approaches.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Lung cancer classification based on enhanced deep learning using gene expression data
    Yuvaraj, V.
    Maheswari, D.
    [J]. Measurement: Sensors, 2023, 30
  • [22] Optimized kernel machines for cancer classification using gene expression data
    Xiong, H
    Chen, XW
    [J]. Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 268 - 274
  • [23] Gene selection and classification for cancer microarray data based on machine learning and similarity measures
    Qingzhong Liu
    Andrew H Sung
    Zhongxue Chen
    Jianzhong Liu
    Lei Chen
    Mengyu Qiao
    Zhaohui Wang
    Xudong Huang
    Youping Deng
    [J]. BMC Genomics, 12
  • [24] Gene selection and classification for cancer microarray data based on machine learning and similarity measures
    Liu, Qingzhong
    Sung, Andrew H.
    Chen, Zhongxue
    Liu, Jianzhong
    Chen, Lei
    Qiao, Mengyu
    Wang, Zhaohui
    Huang, Xudong
    Deng, Youping
    [J]. BMC GENOMICS, 2011, 12
  • [25] Gene selection in microarray data analysis for brain cancer classification
    Leung, Y. Y.
    Chang, C. Q.
    Hung, Y. S.
    Fung, P. C. W.
    [J]. 2006 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2006, : 99 - +
  • [26] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    [J]. IEEE ACCESS, 2019, 7 : 78533 - 78548
  • [27] A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
    Wang, Hong
    Jing, Xingjian
    Niu, Ben
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 8 - 19
  • [28] Feature selection methods on gene expression microarray data for cancer classification: A systematic review
    Alhenawi, Esra'a
    Al-Sayyed, Rizik
    Hudaib, Amjad
    Mirjalili, Seyedali
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
  • [29] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    [J]. AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [30] Analysis of Microarray Gene Expression Data Using Various Feature Selection and Classification Techniques
    Singh, W. Jai
    Kavitha, R. K.
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 105 - 108