共 50 条
What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome?
被引:32
|作者:
Prensner, John R.
[1
,2
]
Abelin, Jennifer G.
[3
]
Kok, Leron W.
[4
]
Clauser, Karl R.
[3
]
Mudge, Jonathan M.
[5
]
Ruiz-Orera, Jorge
[6
]
-Sternberg, Michal Bassani
[7
,8
,9
]
Moritz, Robert L.
[10
]
Deutsch, Eric W.
[10
]
van Heesch, Sebastiaan
[4
]
机构:
[1] Univ Michigan, Dept Pediat, Div Pediat Hematol Oncol, Med Sch, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Med Sch, Dept Biol Chem, Ann Arbor, MI 48109 USA
[3] Broad Inst MIT & Harvard, Cambridge, MA USA
[4] Princess Maxima Ctr Pediat Oncol, Utrecht, Netherlands
[5] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Cambridge, England
[6] Helmholtz Assoc MDC, Cardiovasc & Metab Sci, Max Delbruck Ctr Mol Med, Berlin, Germany
[7] Univ Lausanne, Ludwig Inst Canc Res, Agora Ctr Bugnon 25A, Lausanne, Switzerland
[8] Ctr Hosp Univ Vaudois CHUV, Dept Oncol, Lausanne, Switzerland
[9] Agora Canc Res Ctr, Lausanne, Switzerland
[10] Inst Syst Biol ISB, Seattle, WA USA
基金:
英国惠康基金;
美国国家卫生研究院;
美国国家科学基金会;
关键词:
LONG NONCODING RNAS;
OPEN READING FRAMES;
MHC CLASS-I;
MASS-SPECTROMETRY;
SMALL ORFS;
TRANSLATION;
REVEALS;
ANNOTATION;
INITIATION;
DISCOVERY;
D O I:
10.1016/j.mcpro.2023.100631
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from similar to 19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
引用
收藏
页数:21
相关论文