CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction

被引:38
|
作者
Al-Ajlan, Amani [1 ]
El Allali, Achraf [1 ]
机构
[1] King Saud Univ, Dept Comp Sci, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Gene prediction; Metagenomics; ORF; Convolutional neural network; Deep learning; DEEP; GENOME;
D O I
10.1007/s12539-018-0313-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate gene prediction in metagenomics fragments is a computationally challenging task due to the short-read length, incomplete, and fragmented nature of the data. Most gene-prediction programs are based on extracting a large number of features and then applying statistical approaches or supervised classification approaches to predict genes. In our study, we introduce a convolutional neural network for metagenomics gene prediction (CNN-MGP) program that predicts genes in metagenomics fragments directly from raw DNA sequences, without the need for manual feature extraction and feature selection stages. CNN-MGP is able to learn the characteristics of coding and non-coding regions and distinguish coding and non-coding open reading frames (ORFs). We train 10 CNN models on 10 mutually exclusive datasets based on pre-defined GC content ranges. We extract ORFs from each fragment; then, the ORFs are encoded numerically and inputted into an appropriate CNN model based on the fragment-GC content. The output from the CNN is the probability that an ORF will encode a gene. Finally, a greedy algorithm is used to select the final gene list. Overall, CNN-MGP is effective and achieves a 91% accuracy on testing dataset. CNN-MGP shows the ability of deep learning to predict genes in metagenomics fragments, and it achieves an accuracy higher than or comparable to state-of-the-art gene-prediction programs that use pre-defined features.
引用
收藏
页码:628 / 635
页数:8
相关论文
共 50 条
  • [1] CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction
    Amani Al-Ajlan
    Achraf El Allali
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2019, 11 : 628 - 635
  • [2] Phylogenetic convolutional neural networks in metagenomics
    Fioravanti, Diego
    Giarratano, Ylenia
    Maggio, Valerio
    Agostinelli, Claudio
    Chierici, Marco
    Jurman, Giuseppe
    Furlanello, Cesare
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [3] Phylogenetic convolutional neural networks in metagenomics
    Diego Fioravanti
    Ylenia Giarratano
    Valerio Maggio
    Claudio Agostinelli
    Marco Chierici
    Giuseppe Jurman
    Cesare Furlanello
    [J]. BMC Bioinformatics, 19
  • [4] AMPred-CNN: Ames mutagenicity prediction model based on convolutional neural networks
    Tran, Thi Tuyet Van
    Tayara, Hilal
    Chong, Kil To
    [J]. Computers in Biology and Medicine, 2024, 176
  • [5] C-CNN: Contourlet Convolutional Neural Networks
    Liu, Mengkun
    Jiao, Licheng
    Liu, Xu
    Li, Lingling
    Liu, Fang
    Yang, Shuyuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2636 - 2649
  • [6] CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization
    Wang, Zijie J.
    Turko, Robert
    Shaikh, Omar
    Park, Haekyu
    Das, Nilaksh
    Hohman, Fred
    Kahng, Minsuk
    Chau, Duen Horng
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1396 - 1406
  • [7] CNN 101: Interactive Visual Learning for Convolutional Neural Networks
    Wang, Zijie J.
    Turko, Robert
    Shaikh, Omar
    Park, Haekyu
    Das, Nilaksh
    Hohman, Fred
    Kahng, Minsuk
    Chau, Duen Horng
    [J]. CHI'20: EXTENDED ABSTRACTS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2020,
  • [8] A-CNN: Annularly Convolutional Neural Networks on Point Clouds
    Komarichev, Artem
    Zhong, Zichun
    Hua, Jing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7413 - 7422
  • [9] Review on Convolutional Neural Networks (CNN) in vegetation remote sensing
    Kattenborn, Teja
    Leitloff, Jens
    Schiefer, Felix
    Hinz, Stefan
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 173 : 24 - 49
  • [10] Classification of House Categories Using Convolutional Neural Networks (CNN)
    Viratkapan, Vichai
    Mruetusatorn, Saprangsit
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 509 - 514