Statistical similarities between transcriptomics and quantitative shotgun proteomics data

被引:136
|
作者
Pavelka, Norman [1 ]
Fournier, Marjorie L. [1 ]
Swanson, Selene K. [1 ]
Pelizzola, Mattia [2 ]
Ricciardi-Castagnoli, Paola [3 ]
Florens, Laurence [1 ]
Washburn, Michael P. [1 ]
机构
[1] Stowers Inst Med Res, Kansas City, MO 64110 USA
[2] Univ Milano Bicocca, Dept Biosci & Biotechnol, I-20126 Milan, Italy
[3] Singapore Immunol Network, Singapore 138648, Singapore
关键词
D O I
10.1074/mcp.M700240-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
If the large collection of microarray-specific statistical tools was applicable to the analysis of quantitative shotgun proteomics datasets, it would certainly foster an important advancement of proteomics research. Here we analyze two large multidimensional protein identification technology datasets, one containing eight replicates of the soluble fraction of a yeast whole-cell lysate and one containing nine replicates of a human immunoprecipitate, to test whether normalized spectral abundance factor (NSAF) values share substantially similar statistical properties with transcript abundance values from Affymetrix GeneChip data. First we show similar dynamic range and distribution properties of these two types of numeric values. Next we show that the standard deviation (S.D.) of a protein's NSAF values was dependent on the average NSAF value of the protein itself, following a power law. This relationship can be modeled by a power law global error model (PLGEM), initially developed to describe the variance-versus-mean dependence that exists in GeneChip data. PLGEM parameters obtained from NSAF datasets proved to be surprisingly similar to the typical parameters observed in GeneChip datasets. The most important common feature identified by this approach was that, although in absolute terms the S.D. of replicated abundance values increases as a function of increasing average abundance, the coefficient of variation, a relative measure of variability, becomes progressively smaller under the same conditions. We next show that PLGEM parameters were reasonably stable to decreasing numbers of replicates. We finally illustrate one possible application of PLGEM in the identification of differentially abundant proteins that might potentially outperform standard statistical tests. In summary, we believe that this body of work lays the foundation for the application of microarray-specific tools in the analysis of NSAF datasets.
引用
收藏
页码:631 / 644
页数:14
相关论文
共 50 条
  • [1] Statistical similarities between transcriptomics and quantitative shotgun proteomics data
    Pavelka, N.
    Swanson, S.
    Fournier, M.
    Pelizzola, M.
    Ricciardi-Castagnoli, P.
    Florens, L.
    Washburn, M.
    MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (08) : 60 - 60
  • [2] Cancer proteomics by quantitative shotgun proteomics
    Chen, Emily I.
    Yates, John R., III
    MOLECULAR ONCOLOGY, 2007, 1 (02) : 144 - 159
  • [3] A correlation algorithm for the automated quantitative analysis of shotgun proteomics data
    MacCoss, MJ
    Wu, CC
    Liu, HB
    Sadygov, R
    Yates, JR
    ANALYTICAL CHEMISTRY, 2003, 75 (24) : 6912 - 6921
  • [4] Transcriptomics and quantitative proteomics in virology
    Berard, Alicia
    Kroeker, Andrea L.
    Coombs, Kevin M.
    FUTURE VIROLOGY, 2012, 7 (12) : 1193 - 1204
  • [5] Molecular similarities and dissimilarities between hellp syndrome and preeclampsia (PE) using shotgun and targeted proteomics
    Jing, Hongwu
    Heinze, Sten
    Hearne, Stephanie
    Zhao, Guomao
    Rumpf, Wolfgang
    Ray, William
    Buhimschi, Catalin S.
    Wysocki, Vicki
    Buhimschi, Irina A.
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2019, 220 (01) : S86 - S86
  • [6] Development of a bioinformatics platform for analysis of quantitative transcriptomics and proteomics data: the OMnalysis
    Tyagi, Punit
    Bhide, Mangesh
    PEERJ, 2021, 9
  • [7] Infrared Multiphoton Dissociation for Quantitative Shotgun Proteomics
    Ledvina, Aaron R.
    Lee, M. Violet
    McAlister, Graeme C.
    Westphall, Michael S.
    Coon, Joshua J.
    ANALYTICAL CHEMISTRY, 2012, 84 (10) : 4513 - 4519
  • [8] Optimization of Statistical Methods Impact on Quantitative Proteomics Data
    Pursiheimo, Anna
    Vehmas, Anni P.
    Afzal, Saira
    Suomi, Tomi
    Chand, Thaman
    Strauss, Leena
    Poutanen, Matti
    Rokka, Anne
    Corthals, Garry L.
    Elo, Laura L.
    JOURNAL OF PROTEOME RESEARCH, 2015, 14 (10) : 4118 - 4126
  • [9] Hierarchical Clustering of Shotgun Proteomics Data
    Koskinen, Ville R.
    Emery, Patrick A.
    Creasy, David M.
    Cottrell, John S.
    MOLECULAR & CELLULAR PROTEOMICS, 2011, 10 (06)
  • [10] Computational analysis of shotgun proteomics data
    MacCross, MJ
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2005, 9 (01) : 88 - 94