ACVPred: Enhanced prediction of anti-coronavirus peptides by transfer learning combined with data augmentation

被引:4
|
作者
Xu, Yi [1 ]
Liu, Tianyuan [2 ,3 ]
Yang, Yu
Kang, Juanjuan [3 ]
Ren, Liping [4 ]
Ding, Hui [5 ]
Zhang, Yang [3 ]
机构
[1] Tsinghua Univ, Tsinghua Peking Ctr Life Sci, Beijing 100084, Peoples R China
[2] Univ Tsukuba, Tsukuba Life Sci Innovat Program, Tsukuba 3058577, Japan
[3] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Acad Interdiscipline, Chengdu 611137, Peoples R China
[4] Chengdu Neusoft Univ, Sch Healthcare Technol, Chengdu 611844, Peoples R China
[5] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Anti-Coronavirus peptide; Transfer learning; Data augmentation; Model interpretation; Motif;
D O I
10.1016/j.future.2024.06.008
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Anti-coronavirus peptides (ACVPs) have garnered significant attention in COVID-19 therapeutic research due to their precise targeting, low risk of drug resistance, flexible synthesis, and effectiveness against viral mutations. Although some in-silico methods have been developed to predict ACVPs, they suffer from challenges such as limited datasets and a lack of interpretability. Hence, this study introduces ACVPred, an algorithm for ACVP prediction, based on two few -shot learning strategies: transfer learning and data augmentation strategies. Our experiments demonstrate that data augmentation can significantly enhance model performance, while transfer learning can effectively prevent overfitting and strengthen generalizability. Compared to existing methods, ACVPred exhibits superior performance and robust generalization both in training and independent test datasets. Moreover, the interpretability study of the model reveals that its transformer -based core can effectively capture key motifs on ACVP sequences, demonstrating strong feature learning capabilities. Additionally, the findings suggest that the sequence feature weights and key motif positions tend to be distributed towards the N -terminal end of ACVP sequences, providing vital clues for the design of ACVPs. In summary, ACVPred is not only a practical and valuable tool for aiding in the design of ACVPs, but its algorithmic concept also serves as an important reference for research on other small sample prediction problems.
引用
收藏
页码:305 / 315
页数:11
相关论文
共 50 条
  • [31] A surrogate model with data augmentation and deep transfer learning for temperature field prediction of heat source layout
    Xiaoyu Zhao
    Zhiqiang Gong
    Jun Zhang
    Wen Yao
    Xiaoqian Chen
    Structural and Multidisciplinary Optimization, 2021, 64 : 2287 - 2306
  • [32] A surrogate model with data augmentation and deep transfer learning for temperature field prediction of heat source layout
    Zhao, Xiaoyu
    Gong, Zhiqiang
    Zhang, Jun
    Yao, Wen
    Chen, Xiaoqian
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2021, 64 (04) : 2287 - 2306
  • [33] Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients
    Yixiang Deng
    Lu Lu
    Laura Aponte
    Angeliki M. Angelidi
    Vera Novak
    George Em Karniadakis
    Christos S. Mantzoros
    npj Digital Medicine, 4
  • [34] ENHANCED VEHICLE CLASSIFICATION USING TRANSFER LEARNING AND A NOVEL DUPLICATION-BASED DATA AUGMENTATION TECHNIQUE
    Harras, Abdelrahman
    Tsuji, Akinori
    Karungaru, Stephen
    Terada, Kenji
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2021, 17 (06): : 2201 - 2216
  • [35] Data Augmentation and Transfer Learning for Data Quality Assessment in Respiratory Monitoring
    Rozo, Andrea
    Moeyersons, Jonathan
    Morales, John
    Garcia van der Westen, Roberto
    Lijnen, Lien
    Smeets, Christophe
    Jantzen, Sjors
    Monpellier, Valerie
    Ruttens, David
    Van Hoof, Chris
    Van Huffel, Sabine
    Groenendaal, Willemijn
    Varon, Carolina
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [36] ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation
    Chen, Xian-gan
    Zhang, Wen
    Yang, Xiaofei
    Li, Chenhong
    Chen, Hengling
    FRONTIERS IN GENETICS, 2021, 12
  • [37] Transfer Learning Enhanced Gaussian Process Model for Surface Roughness Prediction with Small Data
    Hu, L.
    Gao, R.
    Guo, Y. B.
    MANUFACTURING LETTERS, 2023, 35 : 1103 - 1108
  • [38] Transfer Learning Enhanced Gaussian Process Model for Surface Roughness Prediction with Small Data
    Hu, L.
    Gao, R.
    Guo, Y. B.
    MANUFACTURING LETTERS, 2023, 35 : 1103 - 1108
  • [39] Prediction of material toughness using ensemble learning and data augmentation
    Smyrnov, Mykyta
    Funcke, Florian
    Kabliman, Evgeniya
    PHILOSOPHICAL MAGAZINE LETTERS, 2024, 104 (01)
  • [40] MACHINE LEARNING FOR THE DOCUMENTATION, PREDICTION, AND AUGMENTATION OF HERITAGE STRUCTURE DATA
    Rihal, Satwant
    Assal, Hisham
    29TH CIPA SYMPOSIUM DOCUMENTING, UNDERSTANDING, PRESERVING CULTURAL HERITAGE. HUMANITIES AND DIGITAL TECHNOLOGIES FOR SHAPING THE FUTURE, VOL. 48-M-2, 2023, : 1301 - 1307