Architectures and accuracy of artificial neural network for disease classification from omics data

被引:57
|
作者
Yu, Hui [1 ]
Samuels, David C. [2 ]
Zhao, Ying-yong [3 ]
Guo, Yan [1 ]
机构
[1] Univ New Mexico, Dept Internal Med, Albuquerque, NM 87131 USA
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Sch Med, Vanderbilt Genet Inst, Nashville, TN 37232 USA
[3] Northwest Univ, Sch Life Sci, Key Lab Resource Biol & Biotechnol Western China, Xian 710069, Shaanxi, Peoples R China
关键词
Supervised classification; Cancer diagnosis; Artificial neural network; Deep learning; Omics; MULTILAYER; MICROARRAY;
D O I
10.1186/s12864-019-5546-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundDeep learning has made tremendous successes in numerous artificial intelligence applications and is unsurprisingly penetrating into various biomedical domains. High-throughput omics data in the form of molecular profile matrices, such as transcriptomes and metabolomes, have long existed as a valuable resource for facilitating diagnosis of patient statuses/stages. It is timely imperative to compare deep learning neural networks against classical machine learning methods in the setting of matrix-formed omics data in terms of classification accuracy and robustness.ResultsUsing 37 high throughput omics datasets, covering transcriptomes and metabolomes, we evaluated the classification power of deep learning compared to traditional machine learning methods. Representative deep learning methods, Multi-Layer Perceptrons (MLP) and Convolutional Neural Networks (CNN), were deployed and explored in seeking optimal architectures for the best classification performance. Together with five classical supervised classification methods (Linear Discriminant Analysis, Multinomial Logistic Regression, Naive Bayes, Random Forest, Support Vector Machine), MLP and CNN were comparatively tested on the 37 datasets to predict disease stages or to discriminate diseased samples from normal samples. MLPs achieved the highest overall accuracy among all methods tested. More thorough analyses revealed that single hidden layer MLPs with ample hidden units outperformed deeper MLPs. Furthermore, MLP was one of the most robust methods against imbalanced class composition and inaccurate class labels.ConclusionOur results concluded that shallow MLPs (of one or two hidden layers) with ample hidden neurons are sufficient to achieve superior and robust classification performance in exploiting numerical matrix-formed omics data for diagnosis purpose. Specific observations regarding optimal network width, class imbalance tolerance, and inaccurate labeling tolerance will inform future improvement of neural network applications on functional genomics data.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Architectures and accuracy of artificial neural network for disease classification from omics data
    Hui Yu
    David C. Samuels
    Ying-yong Zhao
    Yan Guo
    [J]. BMC Genomics, 20
  • [2] Omics data classification using constitutive artificial neural network optimized with single candidate optimizer
    Madhan, Subramaniam
    Kalaiselvan, Anbarasan
    [J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,
  • [3] ATM data classification with an artificial neural network
    Sundstrom, K
    Rueda, A
    McLeod, RD
    [J]. IEEE WESCANEX 97 COMMUNICATIONS, POWER AND COMPUTING CONFERENCE PROCEEDINGS, 1997, : 276 - 279
  • [4] Optimizing artificial neural network architectures for enhanced soil type classification
    Aydin, Yaren
    Bekdas, Gebrail
    Isikdag, Uemit
    Nigdeli, Sinan Melih
    Geem, Zong Woo
    [J]. GEOMECHANICS AND ENGINEERING, 2024, 37 (03) : 263 - 277
  • [5] PANNA: Properties from Artificial Neural Network Architectures
    Lot, Ruggero
    Pellegrini, Franco
    Shaidu, Yusuf
    Kucukbenli, Emine
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2020, 256
  • [6] Hybrid Artificial Neural Network for Data Classification Problem
    Kaur, Jaspreet
    Kalra, Ashima
    [J]. PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC 2K17), 2017, : 66 - 71
  • [7] Classification of Robotic Data using Artificial Neural Network
    Gopalapillai, Radhakrishnan
    Vidhya, J.
    Gupta, Deepa
    Sudarshan, T. S. B.
    [J]. 2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 333 - 337
  • [8] Evolvable binary artificial neural network for data classification
    Starzyk, J
    Pang, J
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 1353 - 1359
  • [9] Neural Network Architectures for Artificial Noses
    Wilamowski, Bogdan M.
    Vodyanoy, Vitaly J.
    [J]. 2008 CONFERENCE ON HUMAN SYSTEM INTERACTIONS, VOLS 1 AND 2, 2008, : 737 - +
  • [10] LEAF DISEASE CLASSIFICATION USING ARTIFICIAL NEURAL NETWORK
    Ishak, Syafiqah
    Rahiman, Mohd Hafiz Fazalul
    Kanafiah, Siti Nurul Aqmariah Mohd
    Saad, Hashim
    [J]. JURNAL TEKNOLOGI, 2015, 77 (17): : 109 - 114