UniNovo: a universal tool for de novo peptide sequencing

被引:61
|
作者
Jeong, Kyowon [1 ]
Kim, Sangtae [2 ]
Pevzner, Pavel A. [2 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA 92093 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92093 USA
关键词
TANDEM MASS-SPECTROMETER; PROTEIN IDENTIFICATION; LOW-ENERGY; DATABASE; SPECTRA; DISSOCIATION; ETD; PROBABILITY; SEARCH; MS/MS;
D O I
10.1093/bioinformatics/btt338
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function. Results: The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides).
引用
下载
收藏
页码:1953 / 1962
页数:10
相关论文
共 50 条
  • [1] AUDENS:: A tool for automated peptide de novo sequencing
    Grossmann, J
    Roos, FF
    Cieliebak, M
    Lipták, Z
    Mathis, LK
    Müller, M
    Gruissem, W
    Baginsky, S
    JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) : 1768 - 1774
  • [2] Multiplex De Novo Sequencing of Peptide Antibiotics
    Mohimani, Hosein
    Liu, Wei-Ting
    Yang, Yu-Liang
    Gaudencio, Susana P.
    Fenical, William
    Dorrestein, Pieter C.
    Pevzner, Pavel A.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2011, 18 (11) : 1371 - 1381
  • [3] On peptide de novo sequencing:: a new approach
    Bruni, R
    Gianfranceschi, G
    Koch, G
    JOURNAL OF PEPTIDE SCIENCE, 2005, 11 (04) : 225 - 234
  • [4] De novo peptide sequencing by deep learning
    Ngoc Hieu Tran
    Zhang, Xianglilan
    Xin, Lei
    Shan, Baozhen
    Li, Ming
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (31) : 8247 - 8252
  • [5] Multiplex De Novo Sequencing of Peptide Antibiotics
    Mohimani, Hosein
    Liu, Wei-Ting
    Yang, Yu-Liang
    Gaudencio, Susana P.
    Fenical, William
    Dorrestein, Pieter C.
    Pevzner, Pavel A.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, 2011, 6577 : 267 - +
  • [6] An efficient algorithm for de novo peptide sequencing
    Brunetti, S
    Dutta, D
    Liberatori, S
    Mori, E
    Varrazzo, D
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2005, : 312 - 315
  • [7] PGPointNovo: an efficient neural network-based tool for parallel de novo peptide sequencing
    Xu, Xiaofang
    Yang, Chunde
    He, Qiang
    Shu, Kunxian
    Xinpu, Yuan
    Chen, Zhiguang
    Zhu, Yunping
    Chen, Tao
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [8] MRUniNovo: an efficient tool for de novo peptide sequencing utilizing the hadoop distributed computing framework
    Li, Chuang
    Chen, Tao
    He, Qiang
    Zhu, Yunping
    Li, Kenli
    BIOINFORMATICS, 2017, 33 (06) : 944 - 946
  • [9] Peptide and protein de novo sequencing by mass spectrometry
    Standing, KG
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2003, 13 (05) : 595 - 601
  • [10] A model of random sequences for de novo peptide sequencing
    Jarman, KD
    Cannon, WR
    Jarman, KH
    Heredia-Langner, A
    THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 206 - 213