Ortho-proteogenomics: Multiple proteomes investigation through orthology and a new MS-based protocol

被引:86
|
作者
Gallien, Sebastien [1 ]
Perrodou, Emmanuel [2 ,3 ,4 ,5 ]
Carapito, Christine [1 ]
Deshayes, Caroline [6 ,7 ]
Reyrat, Jean-Marc [6 ,7 ]
Van Dorsselaer, Alain [1 ]
Poch, Olivier [2 ,3 ,4 ,5 ]
Schaeffer, Christine [1 ]
Lecompte, Odile [2 ,3 ,4 ,5 ]
机构
[1] CNRS, UMR7178, ULP, IPHC DSA,Lab Spectrometrie Mass Bioorgan, F-67087 Strasbourg, France
[2] IGBMC, Dept Struct Biol & Genom, F-67400 Illkirch Graffenstaden, France
[3] INSERM, U596, F-67400 Illkirch Graffenstaden, France
[4] CNRS, UMR7104, F-67400 Illkirch Graffenstaden, France
[5] Univ Strasbourg, Fac Sci Vie, F-67000 Strasbourg, France
[6] Univ Paris 05, Fac Med Rene Descartes, F-75730 Paris 15, France
[7] INSERM, U570, Unite Pathogenie Infect Syst, F-75730 Paris 15, France
关键词
TANDEM MASS-SPECTROMETRY; MYCOBACTERIUM-TUBERCULOSIS H37RV; FIXED-CHARGE DERIVATIZATION; N-TERMINAL PEPTIDES; ESCHERICHIA-COLI; POSITIONAL PROTEOMICS; GENOME SEQUENCE; IDENTIFICATION; ANNOTATION; PROTEINS;
D O I
10.1101/gr.081901.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The progress in sequencing technologies irrigates biology with an ever-increasing number of genome sequences. In most cases, the gene repertoire is predicted in silico and conceptually translated into proteins. As recently highlighted, the predicted genes exhibit frequent errors, particularly in start codons, with a serious impact on subsequent biological studies. A new "ortho-proteogenomic" approach is presented here for the annotation refinement of multiple genomes at once. It combines comparative genomics with an original proteomic protocol that allows the characterization of both N-terminal and internal peptides in a single experiment. This strategy was applied to the Mycobacterium genus with Mycobacterium smegmatis as the reference, and identified 946 distinct proteins, including 443 characterized N termini. These experimental data allowed the correction of 19% of the characterized start codons, the identification of 29 proteins missed during the annotation process, and the curation, thanks to comparative genomics, of 4328 sequences of 16 other Mycobacterium proteomes.
引用
收藏
页码:128 / 135
页数:8
相关论文
共 1 条
  • [1] Protocol Metabolic labeling and LC-MS/MS-based identification of interleukin-1a-induced secreted proteomes from epithelial cells in the presence or absence of serum
    Priester, Jasmin
    Soelch, Johanna Meier-
    Weiser, Hendrik
    Heylmann, Daniel
    Weber, Axel
    Linne, Uwe
    Kracht, Michael
    [J]. STAR PROTOCOLS, 2023, 4 (02):