Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures

被引:41
|
作者
Leong, Alyssa Zi-Xin [1 ]
Lee, Pey Yee [1 ]
Mohtar, M. Aiman [1 ]
Syafruddin, Saiful Effendi [1 ]
Pung, Yuh-Fen [2 ]
Low, Teck Yew [1 ]
机构
[1] Univ Kebangsaan Malaysia, UKM Med Mol Biol Inst UMBI, Kuala Lumpur 56000, Malaysia
[2] Univ Nottingham Malaysia, Sch Pharm, Div Biomed Sci, Semenyih 43500, Selangor, Malaysia
关键词
Short open reading frame (sORF); Small open reading frame (smORF); Microproteins; Ribosome profiling (RIBO-Seq); Mass spectrometry; Proteogenomics; RIBOSOME PROFILING REVEALS; MESSENGER-RNA; PROTEIN IDENTIFICATION; FUNCTIONAL ANNOTATION; ENCODED PEPTIDES; UPSTREAM ORFS; IN-VIVO; TRANSLATION; PROTEOMICS; DISCOVERY;
D O I
10.1186/s12929-022-00802-5
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
A short open reading frame (sORFs) constitutes <= 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises <= 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein-protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Genome-Wide Identification of Coding Small Open Reading Frames:The Unknown Transcriptome
    李红梅
    胡传圣
    白玲
    JournalofShanghaiJiaotongUniversity(Science), 2014, 19 (06) : 663 - 668
  • [42] Identification of three additional femAB-like open reading frames in Staphylococcus aureus
    Tschierske, M
    Mori, C
    Rohrer, S
    Ehlert, K
    Shaw, KJ
    Berger-Bächi, B
    FEMS MICROBIOLOGY LETTERS, 1999, 171 (02) : 97 - 102
  • [43] csORF-finder: an effective ensemble learning framework for accurate identification of multi-species coding short open reading frames
    Zhang, Meng
    Zhao, Jian
    Li, Chen
    Ge, Fang
    Wu, Jing
    Jiang, Bin
    Song, Jiangning
    Song, Xiaofeng
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
  • [44] Short internal open reading frames repress the translation of N-terminally truncated proteoforms
    Fettig, Raphael
    Gonda, Zita
    Walter, Niklas
    Sallmann, Paul
    Thanisch, Christiane
    Winter, Markus
    Bauer, Susanne
    Zhang, Lei
    Linden, Greta
    Litfin, Margarethe
    Khamanaeva, Marina
    Storm, Sarah
    Muenzing, Christina
    Etard, Christelle
    Armant, Olivier
    Vazquez, Olalla
    Kassel, Olivier
    EMBO REPORTS, 2025, : 1566 - 1589
  • [45] Proteomic analysis of meiosis and characterization of novel short open reading frames in the fission yeastSchizosaccharomyces pombe
    Huraiova, Barbora
    Kanovits, Judit
    Polakova, Silvia Bagelova
    Cipak, Lubos
    Benko, Zsigmond
    Sevcovicova, Andrea
    Anrather, Dorothea
    Ammerer, Gustav
    Duncan, Caia D. S.
    Mata, Juan
    Gregan, Juraj
    CELL CYCLE, 2020, 19 (14) : 1777 - 1785
  • [46] Three-nucleotide periodicity of nucleotide diversity in a population enables the identification of open reading frames
    Jiang, Mengyun
    Ning, Weidong
    Wu, Shishi
    Wang, Xingwei
    Zhu, Kun
    Li, Aomei
    Li, Yongyao
    Cheng, Shifeng
    Song, Bo
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [47] Identification of eukaryotic open reading frames in metagenomic cDNA libraries made from environmental samples
    Grant, S
    Grant, WD
    Cowan, DA
    Jones, BE
    Ma, YH
    Ventosa, A
    Heaphy, S
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2006, 72 (01) : 135 - 143
  • [48] Identification of small open reading frames in plant lncRNA using class-imbalance learning
    Zhao, Siyuan
    Meng, Jun
    Wekesa, Jael Sanyanda
    Luan, Yushi
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [49] Regulation of fungal gene expression via short open reading frames in the mRNA 5′untranslated region
    Vilela, C
    McCarthy, JEG
    MOLECULAR MICROBIOLOGY, 2003, 49 (04) : 859 - 867
  • [50] Characterization of new proteins found by analysis of short open reading frames from the full yeast genome
    Andrade, MA
    Daruvar, A
    Casari, G
    Schneider, R
    Termier, M
    Sander, C
    YEAST, 1997, 13 (14) : 1363 - 1374