Discussion on common data analysis strategies used in MS-based proteomics

被引:17
|
作者
Matthiesen, Rune [1 ]
Azevedo, Luisa [1 ]
Amorim, Antonio [1 ,2 ]
Carvalho, Ana Sofia [1 ]
机构
[1] Univ Porto, Inst Mol Pathol & Immunol, P-4200465 Oporto, Portugal
[2] Univ Porto, Fac Sci, P-4200465 Oporto, Portugal
关键词
Algorithms; Bioinformatics; Interpretation of mass spectra; Tandem mass spectra; PEPTIDE MASS-SPECTRA; LIQUID-CHROMATOGRAPHY; STATISTICAL-MODEL; COLORECTAL-CANCER; CHARGE-STATE; POSTTRANSLATIONAL MODIFICATIONS; PROTEIN IDENTIFICATION; EFFICIENT CALCULATION; SIALYL LEWIS(A); BAYES FACTORS;
D O I
10.1002/pmic.201000404
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Current proteomics technology is limited in resolving the proteome complexity of biological systems. The main issue at stake is to increase throughput and spectra quality so that spatiotemporal dimensions, population parameters and the complexity of protein modifications on a quantitative scale can be considered. MS-based proteomics and protein arrays are the main players in large-scale proteome analysis and an integration of these two methodologies is powerful but presently not sufficient for detailed quantitative and spatiotemporal proteome characterization. Improvements of instrumentation for MS-based proteomics have been achieved recently resulting in data sets of approximately one million spectra which is a large step in the right direction. The corresponding raw data range from 50 to 100 Gb and are frequently made available. Multidimensional LC-MS data sets have been demonstrated to identify and quantitate 2000-8000 proteins from whole cell extracts. The analysis of the resulting data sets requires several steps from raw data processing, to database-dependent search, statistical evaluation of the search result, quantitative algorithms and statistical analysis of quantitative data. A large number of software tools have been proposed for the above-mentioned tasks. However, it is not the aim of this review to cover all software tools, but rather discuss common data analysis strategies used by various algorithms for each of the above-mentioned steps in a non-redundant approach and to argue that there are still some areas which need improvements.
引用
收藏
页码:604 / 619
页数:16
相关论文
共 50 条
  • [1] Application of wavelet transform to the MS-based proteomics data preprocessing
    Du, Pan
    Lin, Simon A.
    Kibbe, Warren A.
    Wang, Haihui
    [J]. PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 680 - +
  • [2] CYSTEINE TAGGING FOR MS-BASED PROTEOMICS
    Giron, Priscille
    Dayon, Loic
    Sanchez, Jean-Charles
    [J]. MASS SPECTROMETRY REVIEWS, 2011, 30 (03) : 366 - 395
  • [3] Application of Diffusion based framelet transform to the MS-Based Proteomics Data Preprocessing
    Amir, Saeed
    Wang, Haihui
    Sun, Fangtao
    [J]. 2013 10TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2013, : 109 - 112
  • [4] Affinity prefractionation for MS-based plasma proteomics
    Pernemalm, Maria
    Lewensohn, Rolf
    Lehtio, Janne
    [J]. PROTEOMICS, 2009, 9 (06) : 1420 - 1427
  • [5] A Galaxy of informatics resources for MS-based proteomics
    Mehta, Subina
    Bernt, Matthias
    Chambers, Matthew
    Fahrner, Matthias
    Foell, Melanie Christine
    Gruening, Bjoern
    Horro, Carlos
    Johnson, James E.
    Loux, Valentin
    Rajczewski, Andrew T.
    Schilling, Oliver
    Vandenbrouck, Yves
    Gustafsson, Ove Johan Ragnar
    Thang, W. C. Mike
    Hyde, Cameron
    Price, Gareth
    Jagtap, Pratik D.
    Griffin, Timothy J.
    [J]. EXPERT REVIEW OF PROTEOMICS, 2023, 20 (11) : 251 - 266
  • [6] A critical comparison of three MS-based approaches for quantitative proteomics analysis
    Taverna, Domenico
    Gaspari, Marco
    [J]. JOURNAL OF MASS SPECTROMETRY, 2021, 56 (01):
  • [7] Combining bioinformatics and MS-based proteomics: clinical implications
    Acosta-Martin, Adelina Elena
    Lane, Lydie
    [J]. EXPERT REVIEW OF PROTEOMICS, 2014, 11 (03) : 269 - 284
  • [8] AlphaPept: a modern and open framework for MS-based proteomics
    Strauss M.T.
    Bludau I.
    Zeng W.-F.
    Voytik E.
    Ammar C.
    Schessner J.P.
    Ilango R.
    Gill M.
    Meier F.
    Willems S.
    Mann M.
    [J]. Nature Communications, 15 (1)
  • [9] Glycobioinformatics: Current strategies and tools for data mining in MS-based glycoproteomics
    Li, Feng
    Glinskii, Olga V.
    Glinsky, Vladislav V.
    [J]. PROTEOMICS, 2013, 13 (02) : 341 - 354
  • [10] MS-Based Proteomics of Body Fluids: The End of the Beginning
    Bader, Jakob M.
    Albrecht, Vincent
    Mann, Matthias
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2023, 22 (07)