PGPointNovo: an efficient neural network-based tool for parallel de novo peptide sequencing

被引:2
|
作者
Xu, Xiaofang [1 ]
Yang, Chunde [1 ]
He, Qiang [3 ]
Shu, Kunxian [5 ]
Xinpu, Yuan [6 ]
Chen, Zhiguang [4 ]
Zhu, Yunping [2 ]
Chen, Tao [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Beijing Inst Life, Beijing Proteome Res Ctr, Natl Ctr Prot Sci Beijing, State Key Lab Prote, Beijing 102206, Peoples R China
[3] Swinburne Univ Technol, Sch Software & Elect Engn, Melbourne, Vic 3122, Australia
[4] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 26469, Peoples R China
[5] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Big Data Bio Intelligence, Chongqing 400065, Peoples R China
[6] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Gen Surg, Beijing, Peoples R China
来源
BIOINFORMATICS ADVANCES | 2023年 / 3卷 / 01期
关键词
CANCER; NUMBER; VALIDATION; CLUSTERS;
D O I
10.1093/bioadv/vbad057
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
De novo peptide sequencing for tandem mass spectrometry data is not only a key technology for novel peptide identification, but also a precedent task for many downstream tasks, such as vaccine and antibody studies. In recent years, neural network models for de novo peptide sequencing have manifested a remarkable ability to accommodate various data sources and outperformed conventional peptide identification tools. However, the excellent model is computationally expensive, taking up to 1 week to process about 400 000 spectrums. This article presents PGPointNovo, a novel neural network-based tool for parallel de novo peptide sequencing. PGPointNovo uses data parallelization technology to accelerate training and inference and optimizes the training obstacles caused by large batch sizes. The results of extensive experiments conducted on multiple datasets of different sizes demonstrate that compared with PointNovo the excellent neural network-based de novo peptide sequencing tool, PGPointNovo, accelerates de novo peptide sequencing by up to 7.35x without precision or recall compromises.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] An efficient algorithm for de novo peptide sequencing
    Brunetti, S
    Dutta, D
    Liberatori, S
    Mori, E
    Varrazzo, D
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2005, : 312 - 315
  • [2] UniNovo: a universal tool for de novo peptide sequencing
    Jeong, Kyowon
    Kim, Sangtae
    Pevzner, Pavel A.
    BIOINFORMATICS, 2013, 29 (16) : 1953 - 1962
  • [3] AUDENS:: A tool for automated peptide de novo sequencing
    Grossmann, J
    Roos, FF
    Cieliebak, M
    Lipták, Z
    Mathis, LK
    Müller, M
    Gruissem, W
    Baginsky, S
    JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) : 1768 - 1774
  • [4] MRUniNovo: an efficient tool for de novo peptide sequencing utilizing the hadoop distributed computing framework
    Li, Chuang
    Chen, Tao
    He, Qiang
    Zhu, Yunping
    Li, Kenli
    BIOINFORMATICS, 2017, 33 (06) : 944 - 946
  • [5] A residual network for de novo peptide sequencing with attention mechanism
    Liu, Zihang
    Zhao, Chunhui
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1165 - 1170
  • [6] An efficient parallel neural network-based multi-instance learning algorithm
    Li, Cheng Hua
    Gondra, Iker
    Liu, Lijun
    JOURNAL OF SUPERCOMPUTING, 2012, 62 (02): : 724 - 740
  • [7] An efficient parallel neural network-based multi-instance learning algorithm
    Cheng Hua Li
    Iker Gondra
    Lijun Liu
    The Journal of Supercomputing, 2012, 62 : 724 - 740
  • [8] PepNovo: De novo peptide sequencing via probabilistic network modeling
    Frank, A
    Pevzner, P
    ANALYTICAL CHEMISTRY, 2005, 77 (04) : 964 - 973
  • [9] SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis
    Li, Chuang
    Li, Kenli
    Li, Keqin
    Xie, Xianghui
    Lin, Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES, 2019, 15 (09): : 1787 - 1801
  • [10] Accurate de novo peptide sequencing using fully convolutional neural networks
    Liu, Kaiyuan
    Ye, Yuzhen
    Li, Sujun
    Tang, Haixu
    NATURE COMMUNICATIONS, 2023, 14 (01)