ACVPred: Enhanced prediction of anti-coronavirus peptides by transfer learning combined with data augmentation

被引:4
|
作者
Xu, Yi [1 ]
Liu, Tianyuan [2 ,3 ]
Yang, Yu
Kang, Juanjuan [3 ]
Ren, Liping [4 ]
Ding, Hui [5 ]
Zhang, Yang [3 ]
机构
[1] Tsinghua Univ, Tsinghua Peking Ctr Life Sci, Beijing 100084, Peoples R China
[2] Univ Tsukuba, Tsukuba Life Sci Innovat Program, Tsukuba 3058577, Japan
[3] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Acad Interdiscipline, Chengdu 611137, Peoples R China
[4] Chengdu Neusoft Univ, Sch Healthcare Technol, Chengdu 611844, Peoples R China
[5] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Anti-Coronavirus peptide; Transfer learning; Data augmentation; Model interpretation; Motif;
D O I
10.1016/j.future.2024.06.008
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Anti-coronavirus peptides (ACVPs) have garnered significant attention in COVID-19 therapeutic research due to their precise targeting, low risk of drug resistance, flexible synthesis, and effectiveness against viral mutations. Although some in-silico methods have been developed to predict ACVPs, they suffer from challenges such as limited datasets and a lack of interpretability. Hence, this study introduces ACVPred, an algorithm for ACVP prediction, based on two few -shot learning strategies: transfer learning and data augmentation strategies. Our experiments demonstrate that data augmentation can significantly enhance model performance, while transfer learning can effectively prevent overfitting and strengthen generalizability. Compared to existing methods, ACVPred exhibits superior performance and robust generalization both in training and independent test datasets. Moreover, the interpretability study of the model reveals that its transformer -based core can effectively capture key motifs on ACVP sequences, demonstrating strong feature learning capabilities. Additionally, the findings suggest that the sequence feature weights and key motif positions tend to be distributed towards the N -terminal end of ACVP sequences, providing vital clues for the design of ACVPs. In summary, ACVPred is not only a practical and valuable tool for aiding in the design of ACVPs, but its algorithmic concept also serves as an important reference for research on other small sample prediction problems.
引用
收藏
页码:305 / 315
页数:11
相关论文
共 50 条
  • [21] Enhanced Nasal Deposition and Anti-Coronavirus Effect of Favipiravir-Loaded Mucoadhesive Chitosan-Alginate Nanoparticles
    Alcantara, Khent Primo
    Nalinratana, Nonthaneth
    Chutiwitoonchai, Nopporn
    Castillo, Agnes L.
    Banlunara, Wijit
    Vajragupta, Opa
    Rojsitthisak, Pornchai
    Rojsitthisak, Pranee
    PHARMACEUTICS, 2022, 14 (12)
  • [22] Enhanced phase prediction of high-entropy alloys through machine learning and data augmentation
    Wu, Song
    Song, Zihao
    Wang, Jianwei
    Niu, Xiaobin
    Chen, Haiyuan
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2025, 27 (02) : 717 - 729
  • [23] DATA AUGMENTATION FOR DISCHARGING TIME PREDICTION OF PARTICLE FLOW: A DEEP LEARNING AND STYLE TRANSFER APPROACH
    Wu, Mengqi
    Liu, Yang
    Li, Bin
    Zhang, Zhen
    Gui, Nan
    Tu, Jiyuan
    PROCEEDINGS OF 2024 31ST INTERNATIONAL CONFERENCE ON NUCLEAR ENGINEERING, VOL 6, ICONE31 2024, 2024,
  • [24] Transfer Learning and Data Augmentation in the Diagnosis of Knee MRI
    Haddadian, John
    Balamurali, Mehala
    AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 452 - 463
  • [25] Brain Tumor Diagnosis with Transfer Learning and Data Augmentation
    Kavi, Deniz
    2021 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2021), 2021, : 69 - 73
  • [26] Data Augmentation for Face Recognition with CNN Transfer Learning
    Uchoa, Valeska
    Aires, Kelson
    Veras, Rodrigo
    Paiva, Anselmo
    Britto, Laurindo
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 27TH EDITION, 2020, : 143 - 148
  • [27] Salinity Modeling Using Deep Learning with Data Augmentation and Transfer Learning
    Qi, Siyu
    He, Minxue
    Hoang, Raymond
    Zhou, Yu
    Namadi, Peyman
    Tom, Bradley
    Sandhu, Prabhjot
    Bai, Zhaojun
    Chung, Francis
    Ding, Zhi
    Anderson, Jamie
    Roh, Dong Min
    Huynh, Vincent
    WATER, 2023, 15 (13)
  • [28] Spectrum Prediction Based on GAN and Deep Transfer Learning: A Cross-Band Data Augmentation Framework
    Lin, Fandi
    Chen, Jin
    Ding, Guoru
    Jiao, Yutao
    Sun, Jiachen
    Wang, Haichao
    CHINA COMMUNICATIONS, 2021, 18 (01) : 18 - 32
  • [29] Spectrum Prediction Based on GAN and Deep Transfer Learning: A Cross-Band Data Augmentation Framework
    Fandi Lin
    Jin Chen
    Guoru Ding
    Yutao Jiao
    Jiachen Sun
    Haichao Wang
    中国通信, 2021, 18 (01) : 18 - 32
  • [30] Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients
    Deng, Yixiang
    Lu, Lu
    Aponte, Laura
    Angelidi, Angeliki M.
    Novak, Vera
    Karniadakis, George Em
    Mantzoros, Christos S.
    NPJ DIGITAL MEDICINE, 2021, 4 (01)