Statistical similarities between transcriptomics and quantitative shotgun proteomics data

被引:136
|
作者
Pavelka, Norman [1 ]
Fournier, Marjorie L. [1 ]
Swanson, Selene K. [1 ]
Pelizzola, Mattia [2 ]
Ricciardi-Castagnoli, Paola [3 ]
Florens, Laurence [1 ]
Washburn, Michael P. [1 ]
机构
[1] Stowers Inst Med Res, Kansas City, MO 64110 USA
[2] Univ Milano Bicocca, Dept Biosci & Biotechnol, I-20126 Milan, Italy
[3] Singapore Immunol Network, Singapore 138648, Singapore
关键词
D O I
10.1074/mcp.M700240-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
If the large collection of microarray-specific statistical tools was applicable to the analysis of quantitative shotgun proteomics datasets, it would certainly foster an important advancement of proteomics research. Here we analyze two large multidimensional protein identification technology datasets, one containing eight replicates of the soluble fraction of a yeast whole-cell lysate and one containing nine replicates of a human immunoprecipitate, to test whether normalized spectral abundance factor (NSAF) values share substantially similar statistical properties with transcript abundance values from Affymetrix GeneChip data. First we show similar dynamic range and distribution properties of these two types of numeric values. Next we show that the standard deviation (S.D.) of a protein's NSAF values was dependent on the average NSAF value of the protein itself, following a power law. This relationship can be modeled by a power law global error model (PLGEM), initially developed to describe the variance-versus-mean dependence that exists in GeneChip data. PLGEM parameters obtained from NSAF datasets proved to be surprisingly similar to the typical parameters observed in GeneChip datasets. The most important common feature identified by this approach was that, although in absolute terms the S.D. of replicated abundance values increases as a function of increasing average abundance, the coefficient of variation, a relative measure of variability, becomes progressively smaller under the same conditions. We next show that PLGEM parameters were reasonably stable to decreasing numbers of replicates. We finally illustrate one possible application of PLGEM in the identification of differentially abundant proteins that might potentially outperform standard statistical tests. In summary, we believe that this body of work lays the foundation for the application of microarray-specific tools in the analysis of NSAF datasets.
引用
收藏
页码:631 / 644
页数:14
相关论文
共 50 条
  • [31] Estimating relative abundances of proteins from shotgun proteomics data
    Sean McIlwain
    Michael Mathews
    Michael S Bereman
    Edwin W Rubel
    Michael J MacCoss
    William Stafford Noble
    BMC Bioinformatics, 13
  • [32] Focus on the spectra that matter by clustering of quantification data in shotgun proteomics
    The, Matthew
    Kall, Lukas
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [33] Targeted Feature Detection for Data-Dependent Shotgun Proteomics
    Weisser, Hendrik
    Choudhary, Jyoti S.
    JOURNAL OF PROTEOME RESEARCH, 2017, 16 (08) : 2964 - 2974
  • [34] Estimating relative abundances of proteins from shotgun proteomics data
    McIlwain, Sean
    Mathews, Michael
    Bereman, Michael S.
    Rubel, Edwin W.
    MacCoss, Michael J.
    Noble, William Stafford
    BMC BIOINFORMATICS, 2012, 13 : 308
  • [35] Focus on the spectra that matter by clustering of quantification data in shotgun proteomics
    Matthew The
    Lukas Käll
    Nature Communications, 11
  • [36] On Marathons and Sprints: An Integrated Quantitative Proteomics and Transcriptomics Analysis of Differences Between Slow and Fast Muscle Fibers
    Drexler, Hannes C. A.
    Ruhs, Aaron
    Konzer, Anne
    Mendler, Luca
    Bruckskotten, Mark
    Looso, Mario
    Guenther, Stefan
    Boettger, Thomas
    Krueger, Marcus
    Braun, Thomas
    MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (06)
  • [37] The role of statistical power analysis in quantitative proteomics
    Levin, Yishai
    PROTEOMICS, 2011, 11 (12) : 2565 - 2567
  • [38] mapDIA: Preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry
    Teo, Guoshou
    Kim, Sinae
    Tsou, Chih-Chiang
    Collins, Ben
    Gingras, Anne-Claude
    Nesvizhskii, Alexey I.
    Choi, Hyungwon
    JOURNAL OF PROTEOMICS, 2015, 129 : 108 - 120
  • [39] Quantitative and In-Depth Survey of the Isotopic Abundance Distribution Errors in Shotgun Proteomics
    Chang, Cheng
    Zhang, Jiyang
    Xu, Changming
    Zhao, Yan
    Ma, Jie
    Chen, Tao
    He, Fuchu
    Xie, Hongwei
    Zhu, Yunping
    ANALYTICAL CHEMISTRY, 2016, 88 (13) : 6844 - 6851
  • [40] Quantitative Shotgun Proteomics Analysis of Amyloid From Paraffom-Embedded Tissue
    Dai, Dao-Fu
    Yang, Han-Yin
    Alpers, Charles
    Maccoss, Michael
    Smith, Kelly
    MODERN PATHOLOGY, 2015, 28 : 407A - 407A