A protein identification algorithm for tandem mass spectrometry by incorporating the abundance of mRNA into a binomial probability scoring model

被引:2
|
作者
Ma, Wen-Tai [1 ]
Liu, Zhao-Yu [1 ]
Chen, Xiao-Zhou [2 ]
Lin, Zhen-Liang [3 ]
Zheng, Zhong-Bing [1 ]
Miao, Wei-Guo [1 ]
Xie, Shang-Qian [1 ]
机构
[1] Hainan Univ, Inst Trop Agr & Forestry, Haikou 570228, Hainan, Peoples R China
[2] Yunnan Minzu Univ, Sch Math & Comp Sci, Kunming 650031, Yunnan, Peoples R China
[3] Wenzhou Med Univ, Dept Gen Surg, Affiliated Cangnan Hosp, Wenzhou 325800, Peoples R China
基金
中国国家自然科学基金;
关键词
Tandem mass spectrometry; RNA-seq; FPKM; Scoring model; Proteomics; SEQ; TRANSCRIPTOME; PROTEOMICS; DISCOVERY; CORRELATE; PEPTIDES;
D O I
10.1016/j.jprot.2019.02.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Peptide-spectrum matches (PSM) scoring between the experimental and theoretical spectrum is a key step in the identification of proteins using mass spectrometry (MS)-based proteomics analyses. Efficient protein identification using MS/MS data remains a challenge. The strategy of using RNA-seq data increases the number of proteins identified by re-constructing the custom search database and integrating mRNA abundance into the false discovery rate of post-PSM. However, this process lacks an algorithm that can allow the incorporation of mRNA abundance into the key scoring model of PSM. Therefore, we developed a novel PSM scoring model, which incorporates mRNA abundance for improved peptide and protein identification. In the new algorithm, abundance information of mRNA was transformed to the prior probability of protein identification and integrated to re-score in PSM using the binomial probability distribution model. Compared with other algorithms using five MS/MS datasets, the results showed that the least improvement ratios of peptide and protein groups were 3.39%-9.79% and 0.48%-8.16% in different datasets (human, rat, zebrafish, yeast, and Arabidopsis thaliana). The new strategy offers an effective solution for MS-based identification of peptides and proteins. Significance: The new algorithm identifies proteins by quantifying mRNA abundance (FPKM) and incorporating it into a scoring model for peptide-spectrum matches. It is important to improve peptide and protein identification from MS/MS datasets in proteomics research.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 50 条
  • [21] An accurate and efficient algorithm for peptide and ptm identification by tandem mass spectrometry
    Ning, Kang
    Ng, Hoong Kee
    Leong, Hon Wai
    GENOME INFORMATICS 2007, VOL 19, 2007, 19 : 119 - 130
  • [22] Protein abundance versus identification by mass spectrometry;: the yeast proteome case
    De Godoy, L.
    De Souza, G.
    Shi, R.
    Olsen, J.
    Mortensen, P.
    Mann, M.
    MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (08) : S252 - S252
  • [23] Protein sequencing with an adaptive Genetic Algorithm from tandem mass spectrometry
    Boisson, Jean-Charles
    Jourdan, Laetitia
    Talbi, El-Ghazali
    Rolando, Christian
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1397 - +
  • [24] Comparison of probability and likelihood models for peptide identification from tandem mass spectrometry data
    Cannon, WR
    Jarman, KH
    Webb-Robertson, BJM
    Baxter, DJ
    Oehmen, CS
    Jarman, KD
    Heredia-Langner, A
    Auberry, KJ
    Anderson, GA
    JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) : 1687 - 1698
  • [25] Tandem mass spectrometry methods for definitive protein identification in proteomics research
    Keough, T
    Lacey, MP
    Fieno, AM
    Grant, RA
    Sun, YP
    Bauer, MD
    Begley, KB
    ELECTROPHORESIS, 2000, 21 (11) : 2252 - 2265
  • [26] Randomized sequence databases for tandem mass spectrometry peptide and protein identification
    Higdon, R
    Hogan, JM
    Van Belle, G
    Kolker, E
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2005, 9 (04) : 364 - 379
  • [27] A review of statistical methods for protein identification using tandem mass spectrometry
    Serang, Oliver
    Noble, William
    STATISTICS AND ITS INTERFACE, 2012, 5 (01) : 3 - 20
  • [28] Identification of nitration sites on surfactant protein A by tandem electrospray mass spectrometry
    Greis, KD
    Zhu, S
    Matalon, S
    ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 1996, 335 (02) : 396 - 402
  • [29] Context-Sensitive Markov Models for Peptide Scoring and Identification from Tandem Mass Spectrometry
    Grover, Himanshu
    Wallstrom, Garrick
    Wu, Christine C.
    Gopalakrishnan, Vanathi
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2013, 17 (02) : 94 - 105
  • [30] PITDI: A Novel Protein Identification Algorithm for Tandem Mass Spectrometry Based on Target-Decoy Matching Information
    Lu, Xiangyu
    Zhu, Simin
    INTERNATIONAL CONFERENCE ON FRONTIERS OF BIOLOGICAL SCIENCES AND ENGINEERING (FBSE 2018), 2019, 2058