A prediction model of nonclassical secreted protein based on deep learning

被引:0
|
作者
Zhang, Fan [1 ,2 ]
Liu, Chaoyang [2 ]
Wang, Binjie [1 ]
He, Yiru [3 ]
Zhang, Xinhong [3 ]
机构
[1] Henan Univ, Huaihe Hosp, Radiol Dept, Kaifeng, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Sch Software, Kaifeng 475004, Peoples R China
关键词
bioinformatics; deep learning; nonclassical secreted protein; prediction; WEB SERVER; PLASMA; CLASSIFICATION;
D O I
10.1002/cem.3553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current nonclassical proteins prediction methods involve manual feature selection, such as constructing features of samples based on the physicochemical properties of proteins and position-specific scoring matrix (PSSM). However, these tasks require researchers to perform some tedious search work to obtain the physicochemical properties of proteins. This paper proposes an end-to-end nonclassical secreted protein prediction model based on deep learning, named DeepNCSPP, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively. Among the experiments conducted on the independent test dataset, DeepNCSPP achieved excellent results with an accuracy of 88.24%, Matthews coefficient (MCC) of 77.01%, and F1-score of 87.50%. Independent test dataset testing and 10-fold cross-validation show that DeepNCSPP achieves competitive performance with state-of-the-art methods and can be used as a reliable nonclassical secreted protein prediction model. A web server has been constructed for the convenience of researchers. The web link is . The source code of DeepNCSPP has been hosted on GitHub and is available online (). This paper proposes an end-to-end nonclassical secreted protein prediction model DeepNCSPP based on deep learning, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Deep Learning-Based Model for Financial Distress Prediction
    Elhoseny, Mohamed
    Metawa, Noura
    Sztano, Gabor
    El-hasnony, Ibrahim M.
    ANNALS OF OPERATIONS RESEARCH, 2025, 345 (2-3) : 885 - 907
  • [32] QoS Prediction Model of Cloud Services Based on Deep Learning
    WenJun Huang
    PeiYun Zhang
    YuTong Chen
    MengChu Zhou
    Yusuf Al-Turki
    Abdullah Abusorrah
    IEEE/CAAJournalofAutomaticaSinica, 2022, 9 (03) : 564 - 566
  • [33] Deep Learning Based Customer Product Rating Prediction Model
    Park, Yongcheon
    Park, Jeongmin
    Lee, Eunkyong
    Lee, Kyoungchul
    Hong, Jiman
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 203 - 204
  • [34] Research on Rice Yield Prediction Model Based on Deep Learning
    Han, Xiao
    Liu, Fangbiao
    He, Xiaoliang
    Ling, Fenglou
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [35] Optimized deep learning-based prediction model for chiller performance prediction
    Sathesh, Tamilarasan
    Shih, Yang-Cheng
    DATA & KNOWLEDGE ENGINEERING, 2023, 144
  • [36] QoS Prediction Model of Cloud Services Based on Deep Learning
    Huang, WenJun
    Zhang, PeiYun
    Chen, YuTong
    Zhou, MengChu
    Al-Turki, Yusuf
    Abusorrah, Abdullah
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 564 - 566
  • [37] Research on soil moisture prediction model based on deep learning
    Cai, Yu
    Zheng, Wengang
    Zhang, Xin
    Zhangzhong, Lili
    Xue, Xuzhang
    PLOS ONE, 2019, 14 (04):
  • [38] A SHARED ECONOMY DATA PREDICTION MODEL BASED ON DEEP LEARNING
    Zhou, Min
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (06): : 5441 - 5450
  • [39] Blood cancer prediction model based on deep learning technique
    Shehta, Amr I.
    Nasr, Mona
    El Ghazali, Alaa El Din M.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [40] ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction
    Tubiana, Jerome
    Schneidman-Duhovny, Dina
    Wolfson, Haim J.
    NATURE METHODS, 2022, 19 (06) : 730 - +