Small open reading frames: a comparative genetics approach to validation

被引:0
|
作者
Niyati Jain
Felix Richter
Ivan Adzhubei
Andrew J. Sharp
Bruce D. Gelb
机构
[1] Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute,Present Address: Committee On Genetics, Genomics, and Systems Biology
[2] Icahn School of Medicine at Mount,Department of Biomedical Informatics
[3] Hess Center for Science and Medicine,Division of Genetics
[4] The University of Chicago,undefined
[5] Department of Pediatrics,undefined
[6] Icahn School of Medicine at Mount Sinai,undefined
[7] Harvard Medical School,undefined
[8] Brigham and Women’s Hospital,undefined
来源
BMC Genomics | / 24卷
关键词
Micropeptides; Small open reading frames; Human genetic variation; Evolutionary conservation; Comparative genetics;
D O I
暂无
中图分类号
学科分类号
摘要
Open reading frames (ORFs) with fewer than 100 codons are generally not annotated in genomes, although bona fide genes of that size are known. Newer biochemical studies have suggested that thousands of small protein-coding ORFs (smORFs) may exist in the human genome, but the true number and the biological significance of the micropeptides they encode remain uncertain. Here, we used a comparative genomics approach to identify high-confidence smORFs that are likely protein-coding. We identified 3,326 high-confidence smORFs using constraint within human populations and evolutionary conservation as additional lines of evidence. Next, we validated that, as a group, our high-confidence smORFs are conserved at the amino-acid level rather than merely residing in highly conserved non-coding regions. Finally, we found that high-confidence smORFs are enriched among disease-associated variants from GWAS. Overall, our results highlight that smORF-encoded peptides likely have important functional roles in human disease.
引用
收藏
相关论文
共 50 条
  • [31] Computational discovery and annotation of conserved small open reading frames in fungal genomes
    Shuhaila Mat-Sharani
    Mohd Firdaus-Raih
    [J]. BMC Bioinformatics, 19
  • [32] Accurate annotation of human protein-coding small open reading frames
    Thomas F. Martinez
    Qian Chu
    Cynthia Donaldson
    Dan Tan
    Maxim N. Shokhirev
    Alan Saghatelian
    [J]. Nature Chemical Biology, 2020, 16 : 458 - 468
  • [33] Small open reading frames in plant research: from prediction to functional characterization
    Ong, Sheue Ni
    Tan, Boon Chin
    Al-Idrus, Aisyafaznim
    Teo, Chee How
    [J]. 3 BIOTECH, 2022, 12 (03)
  • [34] Accurate annotation of human protein-coding small open reading frames
    Martinez, Thomas F.
    Chu, Qian
    Donaldson, Cynthia
    Tan, Dan
    Shokhirev, Maxim N.
    Saghatelian, Alan
    [J]. NATURE CHEMICAL BIOLOGY, 2020, 16 (04) : 458 - +
  • [35] smORF-EP: predicting the effect of variants in small open reading frames
    Fernandes, Maria
    D'Souza, Elston Neil
    Geary, Alex
    Whiffin, Nicola
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 671 - 671
  • [36] Computational discovery and annotation of conserved small open reading frames in fungal genomes
    Mat-Sharani, Shuhaila
    Firdaus-Raih, Mohd
    [J]. BMC BIOINFORMATICS, 2019, 19 (Suppl 13)
  • [37] Small open reading frames in plant research: from prediction to functional characterization
    Sheue Ni Ong
    Boon Chin Tan
    Aisyafaznim Al-Idrus
    Chee How Teo
    [J]. 3 Biotech, 2022, 12
  • [38] SEARCHING FOR CLONES WITH OPEN READING FRAMES
    GRAY, MR
    MAZZARA, GP
    REDDY, P
    ROSBASH, M
    [J]. METHODS IN ENZYMOLOGY, 1987, 154 : 129 - 156
  • [39] OPEN READING FRAMES AND TRANSLATIONAL CONTROL
    KESSEL, M
    GRUSS, P
    [J]. NATURE, 1988, 332 (6160) : 117 - 118
  • [40] Generation of overlapping open reading frames
    Cebrat, S
    Dudek, MR
    [J]. TRENDS IN GENETICS, 1996, 12 (01) : 12 - 12