pNovo+: De Novo Peptide Sequencing Using Complementary HCD and ETD Tandem Mass Spectra

被引:77
|
作者
Chi, Hao [1 ,2 ]
Chen, Haifeng [1 ,2 ]
He, Kun [1 ,2 ]
Wu, Long [1 ,2 ]
Yang, Bing [3 ]
Sun, Rui-Xiang [1 ]
Liu, Jianyun [4 ]
Zeng, Wen-Feng [1 ,2 ]
Song, Chun-Qing [3 ]
He, Si-Min [1 ]
Dong, Meng-Qiu [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
[3] Natl Inst Biol Sci, Beijing 102206, Peoples R China
[4] Beihang Univ, Beijing Key Lab Digital Media, Lab Intelligent Recognit & Image Proc, Beijing 100191, Peoples R China
关键词
tandem mass spectrometry; de novo peptide sequencing; HCD; ETD; antisymmetry restriction; k longest paths; INDUCED DISSOCIATION SPECTRA; AUTOMATED INTERPRETATION; PROTEIN IDENTIFICATION; SPECTROMETRIC DATA; SOFTWARE AID; ALGORITHM; MIXTURES; SEARCH; SEQMS; MS/MS;
D O I
10.1021/pr3006843
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
De novo peptide sequencing is the only tool for extracting peptide sequences directly from tandem mass spectrometry (MS) data without any protein database. However, neither the accuracy nor the efficiency of de novo sequencing has been satisfactory, mainly due to incomplete fragmentation information in experimental spectra. Recent advancement in MS technology has enabled acquisition of higher energy collisional dissociation (HCD) and electron transfer dissociation (ETD) spectra of the same precursor. These spectra contain complementary fragmentation information and can be collected with high resolution and high mass accuracy. Taking these advantages, we have developed a new algorithm called pNovo+, which greatly improves the accuracy and speed of de novo sequencing. On tryptic peptides, 86% of the topmost candidate sequences deduced by pNovo+ from HCD + ETD spectral pairs matched the database search results, and the success rate reached 95% if the top three candidates were included, which was much higher than using only HCD (87%) or only ETD spectra (57%). On Asp-N, Glu-C, or Elastase digested peptides, 69-87% of the HCD + ETD spectral pairs were correctly identified by pNovo+ among the topmost candidates, or 84-95% among the top three. On average, it takes pNovo+ only 0.018 s to extract the sequence from a spectrum or spectral pair on a common personal computer. This is more than three times as fast as other de novo sequencing programs. The increase of speed is mainly due to pDAG, a component algorithm of pNovo+. pDAG finds the k longest paths in a directed acyclic graph without the antisymmetry restriction. We have verified that the antisymmetry restriction is unnecessary for high resolution, high mass accuracy data. The extensive use of HCD and ETD spectral information and the pDAG algorithm make pNovo+ an excellent de novo sequencing tool.
引用
收藏
页码:615 / 625
页数:11
相关论文
共 50 条
  • [1] pNovo: De novo Peptide Sequencing and Identification Using HCD Spectra
    Chi, Hao
    Sun, Rui-Xiang
    Yang, Bing
    Song, Chun-Qing
    Wang, Le-Heng
    Liu, Chao
    Fu, Yan
    Yuan, Zuo-Fei
    Wang, Hai-Peng
    He, Si-Min
    Dong, Meng-Qiu
    [J]. JOURNAL OF PROTEOME RESEARCH, 2010, 9 (05) : 2713 - 2724
  • [2] Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs
    Yan, Yan
    Zhang, Kaizhong
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [3] Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs
    Yan Yan
    Kaizhong Zhang
    [J]. BMC Bioinformatics, 17
  • [4] De novo peptide sequencing using CID and HCD spectra pairs
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    [J]. PROTEOMICS, 2016, 16 (20) : 2615 - 2624
  • [5] Peptide de novo sequencing of mixture tandem mass spectra
    Gorshkov, Vladimir
    Hotta, Stephanie Yuki Kolbeck
    Verano-Braga, Thiago
    Kjeldsen, Frank
    [J]. PROTEOMICS, 2016, 16 (18) : 2470 - 2479
  • [6] NovoExD: De novo Peptide Sequencing for ETD/ECD Spectra
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (02) : 337 - 344
  • [7] NovoHCD: De novo Peptide Sequencing From HCD Spectra
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (02) : 65 - 72
  • [8] NovoPair: de novo peptide sequencing for tandem mass spectra pair
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [9] A Framework of De Novo Peptide Sequencing for Multiple Tandem Mass Spectra
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (04) : 478 - 484
  • [10] An ion transformation approach for de novo peptide sequencing via tandem mass spectra
    Yu, Changyong
    Wang, Guoren
    Zhao, Yi
    Zhai, Wendan
    [J]. BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 67 - +