A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data

被引:55
|
作者
Tabares-Soto, Reinel [1 ]
Orozco-Arias, Simon [2 ,3 ]
Romero-Cano, Victor [4 ]
Segovia Bucheli, Vanesa [5 ]
Luis Rodriguez-Sotelo, Jose [1 ]
Felipe Jimenez-Varon, Cristian [6 ]
机构
[1] Univ Autonoma Manizales, Dept Elect & Automat, Manizales, Caldas, Colombia
[2] Univ Autonoma Manizales, Dept Comp Sci, Manizales, Caldas, Colombia
[3] Univ Caldas, Dept Syst & Informat, Manizales, Caldas, Colombia
[4] Univ Autonoma Occidente, Dept Automat & Elect, Cali, Valle Del Cauca, Colombia
[5] Dokuz Eylul Univ, Izmir Int Biomed & Genome Inst, Izmir, Turkey
[6] Univ Autonoma Manizales, Dept Phys & Math, Manizales, Caldas, Colombia
关键词
Machine Learning; Deep Learning; Cancer classification; Microarray gene expression; 11_tumor database; Bioinformatics; CLASSIFICATION;
D O I
10.7717/peerj-cs.270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer classification is a topic of major interest in medicine since it allows accurate and efficient diagnosis and facilitates a successful outcome in medical treatments. Previous studies have classified human tumors using a large-scale RNA profiling and supervised Machine Learning (ML) algorithms to construct a molecular-based classification of carcinoma cells from breast, bladder, adenocarcinoma, colorectal, gastro esophagus, kidney, liver, lung, ovarian, pancreas, and prostate tumors. These datasets are collectively known as the 11_tumor database, although this database has been used in several works in the ML field, no comparative studies of different algorithms can be found in the literature. On the other hand, advances in both hardware and software technologies have fostered considerable improvements in the precision of solutions that use ML, such as Deep Learning (DL). In this study, we compare the most widely used algorithms in classical ML and DL to classify the tumors described in the 11_tumor database. We obtained tumor identification accuracies between 90.6% (Logistic Regression) and 94.43% (Convolutional Neural Networks) using k-fold cross-validation. Also, we show how a tuning process may or may not significantly improve algorithms' accuracies. Our results demonstrate an efficient and accurate classification method based on gene expression (microarray data) and ML/DL algorithms, which facilitates tumor type prediction in a multi-cancer-type scenario.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A comparative study of different machine learning methods on microarray gene expression data
    Mehdi Pirooznia
    Jack Y Yang
    Mary Qu Yang
    Youping Deng
    [J]. BMC Genomics, 9
  • [2] A comparative study of different machine learning methods on microarray gene expression data
    Pirooznia, Mehdi
    Yang, Jack Y.
    Yang, Mary Qu
    Deng, Youping
    [J]. BMC GENOMICS, 2008, 9 (Suppl 1)
  • [3] Gene reduction and machine learning algorithms for cancer classification based on microarray gene expression data: A comprehensive review
    Osama, Sarah
    Shaban, Hassan
    Ali, Abdelmgeid A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [4] Cancer Classification Based on Microarray Gene Expression Data Using Deep Learning
    Guillen, Pablo
    Ebalunode, Jerry
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1403 - 1405
  • [5] Deep learning techniques for cancer classification using microarray gene expression data
    Gupta, Surbhi
    Gupta, Manoj K.
    Shabaz, Mohammad
    Sharma, Ashutosh
    [J]. FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [6] Comparative study and analysis on skin cancer detection using machine learning and deep learning algorithms
    V. Auxilia Osvin Nancy
    P. Prabhavathy
    Meenakshi S. Arya
    B. Shamreen Ahamed
    [J]. Multimedia Tools and Applications, 2023, 82 : 45913 - 45957
  • [7] Comparative Performance of Deep Learning and Machine Learning Algorithms on Imbalanced Handwritten Data
    Amri, A'Inur A'Fifah
    Ismail, Amelia Ritahani
    Zarir, Abdullah Ahmad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (02) : 258 - 264
  • [8] Comparative study and analysis on skin cancer detection using machine learning and deep learning algorithms
    Nancy, V. Auxilia Osvin
    Prabhavathy, P.
    Arya, Meenakshi S.
    Ahamed, B. Shamreen
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (29) : 45913 - 45957
  • [9] A Comparative Study of Different Machine Learning Algorithms on Gene Expression Profile Classification
    Chen, Tao
    Hu, Shengli
    Cui, Man
    Cao, Yang
    Quan, Shuangyan
    Wei, Jun
    Yang, Xiao
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY, CII 2017, 2017, : 200 - 204
  • [10] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang Fangchao
    Tong Lingling
    Shi Chen
    Zuo Rui
    Wang Liwei
    Wang Yan
    [J]. 母胎医学杂志(英文)., 2024, 06 (03)