A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra

被引:30
|
作者
Kou, Qiang [1 ]
Wu, Si [2 ]
Tolic, Nikola [3 ]
Pasa-Tolic, Ljiljana [3 ]
Liu, Yunlong [4 ,5 ]
Liu, Xiaowen [1 ,5 ]
机构
[1] Indiana Univ Purdue Univ, Dept BioHlth Informat, Indianapolis, IN 46202 USA
[2] Univ Oklahoma, Dept Chem & Biochem, Norman, OK 73019 USA
[3] Pacific Northwest Natl Lab, Environm Mol Sci Lab, Richland, WA 99354 USA
[4] Indiana Univ Sch Med, Dept Med & Mol Genet, Indianapolis, IN 46202 USA
[5] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
基金
美国国家卫生研究院;
关键词
PROTEIN IDENTIFICATION; POSTTRANSLATIONAL MODIFICATIONS; SITE LOCALIZATION; SPECTROMETRY; ALIGNMENT; PEPTIDES; SEARCH;
D O I
10.1093/bioinformatics/btw806
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a 'bird's eye view' of intact proteoforms. The combinatorial explosion of various alterations on a protein may result in billions of possible proteoforms, making proteoform identification a challenging computational problem. Results: We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry datasets showed that TopMG outperformed existing methods in identifying complex proteoforms.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 50 条
  • [1] A graph-based approach for proteoform identification and quantification using top-down homogeneous multiplexed tandem mass spectra
    Zhu, Kaiyuan
    Liu, Xiaowen
    BMC BIOINFORMATICS, 2018, 19
  • [2] A graph-based approach for proteoform identification and quantification using top-down homogeneous multiplexed tandem mass spectra
    Kaiyuan Zhu
    Xiaowen Liu
    BMC Bioinformatics, 19
  • [3] A graph-based filtering method for top-down mass spectral identification
    Runmin Yang
    Daming Zhu
    BMC Genomics, 19
  • [4] A graph-based filtering method for top-down mass spectral identification
    Yang, Runmin
    Zhu, Daming
    BMC GENOMICS, 2018, 19
  • [5] Identification of Ultramodified Proteins Using Top-Down Tandem Mass Spectra
    Liu, Xiaowen
    Hengel, Shawna
    Wu, Si
    Tolic, Nikola
    Pasa-Tolic, Ljiljana
    Pevzner, Pavel A.
    JOURNAL OF PROTEOME RESEARCH, 2013, 12 (12) : 5830 - 5838
  • [6] Proteoform identification based on top-down tandem mass spectra with peak error corrections
    Zhan, Zhaohui
    Wang, Lusheng
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [7] Top-Down Mass Spectrometry: Proteomics to Proteoforms
    Patrie, Steven M.
    MODERN PROTEOMICS - SAMPLE PREPARATION, ANALYSIS AND PRACTICAL APPLICATIONS, 2016, 919 : 171 - 200
  • [8] Spectral probabilities of top-down tandem mass spectra
    Liu, Xiaowen
    Segar, Matthew W.
    Li, Shuai Cheng
    Kim, Sangtae
    BMC GENOMICS, 2014, 15 : 1 - 9
  • [9] Spectral probabilities of top-down tandem mass spectra
    Xiaowen Liu
    Matthew W Segar
    Shuai Cheng Li
    Sangtae Kim
    BMC Genomics, 15
  • [10] A Spectrum Graph-Based Protein Sequence Filtering Algorithm for Proteoform Identification by Top-Down Mass Spectrometry
    Yang, Runmin
    Zhu, Daming
    Kou, Qiang
    Bhat-Nakshatri, Poornima
    Nakshatri, Harikrishna
    Wu, Si
    Liu, Xiaowen
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 222 - 229