Protein Family Classification from Scratch: A CNN Based Deep Learning Approach

被引:18
|
作者
Zhang, Da [1 ]
Kabuka, Mansur R. [2 ]
机构
[1] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33145 USA
[2] Univ Miami, Coral Gables, FL 33146 USA
基金
美国国家卫生研究院;
关键词
Proteins; Feature extraction; Amino acids; Hidden Markov models; Deep learning; Data mining; Machine learning algorithms; Protein family classification; convolutional neural network; feature engineering; PREDICTION;
D O I
10.1109/TCBB.2020.2966633
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing techniques provide us with an opportunity for generating sequenced proteins and identifying the biological families and functions of these proteins. However, compared with identified proteins, uncharacterized proteins consist of a notable percentage of the overall proteins in the bioinformatics research field. Traditional family classification methods often devote themselves to extracting N-Gram features from sequences while ignoring motif information as well as affinity information between motifs and adjacent amino acids. Previous clustering-based algorithms have typically been used to define protein features with domain knowledge and annotate protein families based on extensive data samples. In this paper, we apply CNN based amino acid representation learning with limited characterized proteins to explore the performances of annotated protein families by taking into account the amino acid location information. Additionally, we apply the method to all reviewed protein sequences with their families retrieved from the UniProt database to evaluate our approach. Last but not least, we verify our model using those unreviewed protein records, which is typically ignored by other methods.
引用
收藏
页码:1996 / 2007
页数:12
相关论文
共 50 条
  • [41] A Stellar Spectrum Classification Algorithm Based on CNN and LSTM Composite Deep Learning Model
    Li Hao
    Zhao Qing
    Cui Chen-zhou
    Fan Dong-wei
    Zhang Cheng-kui
    Shi Yan-cui
    Wang Yuan
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44 (06) : 1668 - 1675
  • [42] Prediction of Flow Based on a CNN-LSTM Combined Deep Learning Approach
    Li, Peifeng
    Zhang, Jin
    Krebs, Peter
    [J]. WATER, 2022, 14 (06)
  • [43] A deep learning based CNN approach on MRI for Alzheimer's disease detection
    Roy, Sanjiban Sekhar
    Sikaria, Raghav
    Susan, Aarti
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (04): : 495 - 505
  • [44] Deep transfer learning CNN based approach for COVID-19 detection
    Muhammad, Wazir
    Bhutto, Zuhaibuddin
    Shah, Syed Ali Raza
    Shah, Jalal
    Shaikh, Murtaza Hussain
    Hussain, Ayaz
    Thaheem, Imdadullah
    Ali, Shamshad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2022, 9 (04): : 44 - 52
  • [45] Deep learning-based classification of protein subcellular localization from immunohistochemistry images
    Hu, Jin-Xian
    Xu, Ying-Ying
    Yang-Yang
    Shen, Hong-Bin
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 599 - 604
  • [46] Accurate classification of cherry fruit using deep CNN based on hybrid pooling approach
    Momeny, Mohammad
    Jahanbakhshi, Ahmad
    Jafarnezhad, Khalegh
    Zhang, Yu-Dong
    [J]. POSTHARVEST BIOLOGY AND TECHNOLOGY, 2020, 166
  • [47] Comparative evaluation of deep transfer learning with learning-from-scratch for Alzheimer disease MRI images Classification
    Tiwari, Anuj
    Dhavamani, Sugasini
    Patel, Tushar
    Ramasamy, Jagadeesh
    Gesing, Sandra
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (03) : S218 - S218
  • [48] Classification of Malware from the Network Traffic Using Hybrid and Deep Learning Based Approach
    Pardhi P.R.
    Rout J.K.
    Ray N.K.
    Sahu S.K.
    [J]. SN Computer Science, 5 (1)
  • [49] A Hybrid RNN based Deep Learning Approach for Text Classification
    Sunagar, Pramod
    Kanavalli, Anita
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 289 - 295
  • [50] A Deep-Learning-Based Approach to the Classification of Fire Types
    Refaee, Eshrag Ali
    Sheneamer, Abdullah
    Assiri, Basem
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):