ACVPred: Enhanced prediction of anti-coronavirus peptides by transfer learning combined with data augmentation

被引:4
|
作者
Xu, Yi [1 ]
Liu, Tianyuan [2 ,3 ]
Yang, Yu
Kang, Juanjuan [3 ]
Ren, Liping [4 ]
Ding, Hui [5 ]
Zhang, Yang [3 ]
机构
[1] Tsinghua Univ, Tsinghua Peking Ctr Life Sci, Beijing 100084, Peoples R China
[2] Univ Tsukuba, Tsukuba Life Sci Innovat Program, Tsukuba 3058577, Japan
[3] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Acad Interdiscipline, Chengdu 611137, Peoples R China
[4] Chengdu Neusoft Univ, Sch Healthcare Technol, Chengdu 611844, Peoples R China
[5] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Anti-Coronavirus peptide; Transfer learning; Data augmentation; Model interpretation; Motif;
D O I
10.1016/j.future.2024.06.008
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Anti-coronavirus peptides (ACVPs) have garnered significant attention in COVID-19 therapeutic research due to their precise targeting, low risk of drug resistance, flexible synthesis, and effectiveness against viral mutations. Although some in-silico methods have been developed to predict ACVPs, they suffer from challenges such as limited datasets and a lack of interpretability. Hence, this study introduces ACVPred, an algorithm for ACVP prediction, based on two few -shot learning strategies: transfer learning and data augmentation strategies. Our experiments demonstrate that data augmentation can significantly enhance model performance, while transfer learning can effectively prevent overfitting and strengthen generalizability. Compared to existing methods, ACVPred exhibits superior performance and robust generalization both in training and independent test datasets. Moreover, the interpretability study of the model reveals that its transformer -based core can effectively capture key motifs on ACVP sequences, demonstrating strong feature learning capabilities. Additionally, the findings suggest that the sequence feature weights and key motif positions tend to be distributed towards the N -terminal end of ACVP sequences, providing vital clues for the design of ACVPs. In summary, ACVPred is not only a practical and valuable tool for aiding in the design of ACVPs, but its algorithmic concept also serves as an important reference for research on other small sample prediction problems.
引用
收藏
页码:305 / 315
页数:11
相关论文
共 50 条
  • [1] Generative Adversarial Network-Based Data Augmentation Method for Anti-coronavirus Peptides Prediction
    Xu, Jiliang
    Xu, Chungui
    Cao, Ruifen
    He, Yonghui
    Bin, Yannan
    Zheng, Chun-Hou
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 67 - 76
  • [2] A database of anti-coronavirus peptides
    Qianyue Zhang
    Xue Chen
    Bowen Li
    Chunying Lu
    Shanshan Yang
    Jinjin Long
    Heng Chen
    Jian Huang
    Bifang He
    Scientific Data, 9
  • [3] A database of anti-coronavirus peptides
    Zhang, Qianyue
    Chen, Xue
    Li, Bowen
    Lu, Chunying
    Yang, Shanshan
    Long, Jinjin
    Chen, Heng
    Huang, Jian
    He, Bifang
    SCIENTIFIC DATA, 2022, 9 (01)
  • [4] PACVP: Prediction of Anti-Coronavirus Peptides Using a Stacking Learning Strategy With Effective Feature Representation
    Chen, Shouzhi
    Liao, Yanhong
    Zhao, Jianping
    Bin, Yannan
    Zheng, Chunhou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3106 - 3116
  • [5] ACP-Dnnel: anti-coronavirus peptides’ prediction based on deep neural network ensemble learning
    Mingyou Liu
    Hongmei Liu
    Tao Wu
    Yingxue Zhu
    Yuwei Zhou
    Ziru Huang
    Changcheng Xiang
    Jian Huang
    Amino Acids, 2023, 55 : 1121 - 1136
  • [6] ACP-Dnnel: anti-coronavirus peptides' prediction based on deep neural network ensemble learning
    Liu, Mingyou
    Liu, Hongmei
    Wu, Tao
    Zhu, Yingxue
    Zhou, Yuwei
    Huang, Ziru
    Xiang, Changcheng
    Huang, Jian
    AMINO ACIDS, 2023, 55 (09) : 1121 - 1136
  • [7] ACVPICPred: Inhibitory activity prediction of anti-coronavirus peptides based on artificial neural network
    Li, Min
    Wu, Yifei
    Li, Bowen
    Lu, Chunying
    Jian, Guifen
    Shang, Xing
    Chen, Heng
    Huang, Jian
    He, Bifang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 3625 - 3633
  • [8] Identifying anti-coronavirus peptides by incorporating different negative datasets and imbalanced learning strategies
    Pang, Yuxuan
    Wang, Zhuo
    Jhong, Jhih-Hua
    Lee, Tzong-Yi
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) : 1085 - 1095
  • [9] Enhanced transfer learning with data augmentation
    Su, Jianjun
    Yu, Xuejiao
    Wang, Xiru
    Wang, Zhijin
    Chao, Guoqing
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [10] EACVP: An ESM-2 LM Framework Combined CNN and CBAM Attention to Predict Anti-coronavirus Peptides
    Zhang, Shengli
    Jing, Yuanyuan
    Liang, Yunyun
    CURRENT MEDICINAL CHEMISTRY, 2024,