An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome

被引:1
|
作者
Song, Hongtao [1 ]
Lin, Kui [1 ]
Hu, Jinglu [2 ]
Pang, Erli [1 ]
机构
[1] Beijing Normal Univ, Coll Life Sci, MOE Key Lab Biodivers Sci & Ecol Engn, Beijing, Peoples R China
[2] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka, Japan
来源
基金
中国国家自然科学基金;
关键词
cucumber; gene functional annotation; collinear segments; orthology; protein-coding gene; INFORMATION RESOURCE TAIR; RNA-SEQ; PLANT GENOMES; DRAFT GENOME; SEQUENCE; IDENTIFICATION; DUPLICATION; ORTHOLOGY; SYNTENY; POTATO;
D O I
10.3389/fpls.2018.00325
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allows for better and more robust functional assignments of predicted genes. As one of the most reliable strategies, the determination of collinearity informationmay facilitate reliable orthology inferences among genes from multiple related genomes. Currently, the identification of collinear segments has mainly been based on conservation of gene order and orientation. Over the course of plant genome evolution, various evolutionary events have disrupted or distorted the order of genes along chromosomes, making it difficult to use those genes as genome-wide markers for plant genome comparisons. Results: Using the localized LASTZ/MULTIZ analysis pipeline, we aligned 15 genomes, including cucumber and other related angiosperm plants, and identified a set of genomic segments that are short in length, stable in structure, uniform in distribution and highly conserved across all 15 plants. Compared with protein-coding genes, these conserved segments were more suitable for use as genomic markers for detecting collinear segments among distantly divergent plants. Guided by this set of identified collinear genomic segments, we inferred 94,486 orthologous protein-coding gene pairs (OPPs) between cucumber and 14 other angiosperm species, which were used as proxies for transferring functional terms to cucumber genes from the annotations of the other 14 genomes. In total, 10,885 protein-coding genes were assigned Gene Ontology (GO) terms which was nearly 1,300more than results collected in Uniprot-proteomic database. Our results showed that annotation accuracy would been improved compared with other existing approaches. Conclusions: In this study, we provided an alternative resource for the functional annotation of predicted cucumber protein-coding genes, which we expect will be beneficial for the cucumber's biological study, accessible from http://cmb. bnu. edu. cn/functional_annotation. Meanwhile, using the cucumber reference genome as a case study, we presented an efficient strategy for transferring gene functional information from previously well-characterized protein-coding genes inmodel species to newly sequenced or "non-model" plant species.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Molecular and functional characterization of two DELLA protein-coding genes in litchi
    Wang, Yi
    He, Shae
    Wei, Yongzan
    Dong, Chen
    Liu, Liqin
    Jue, Dengwei
    Shi, Shengyou
    Li, Weicai
    [J]. GENE, 2020, 738
  • [32] Protein-Coding Genes' Retrocopies and Their Functions
    Kubiak, Magdalena Regina
    Makalowska, Izabela
    [J]. VIRUSES-BASEL, 2017, 9 (04):
  • [33] Origins of new protein-coding genes
    不详
    [J]. SCIENCE, 2021, 371 (6531) : 779 - 780
  • [34] Introns in protein-coding genes in Archaea
    Watanabe, Y
    Yokobori, S
    Inaba, T
    Yamagishi, A
    Oshima, T
    Kawarabayasi, Y
    Kikuchi, H
    Kita, K
    [J]. FEBS LETTERS, 2002, 510 (1-2) : 27 - 30
  • [35] Re-prediction of protein-coding genes in the genome of Amsacta moorei entomopoxvirus
    Guo, Feng-Biao
    Yub, Xiu-Juan
    [J]. JOURNAL OF VIROLOGICAL METHODS, 2007, 146 (1-2) : 389 - 392
  • [36] Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
    Richardson, Casey R.
    Luo, Qing-Jun
    Gontcharova, Viktoria
    Jiang, Ying-Wen
    Samanta, Manoj
    Youn, Eunseog
    Rock, Christopher D.
    [J]. PLOS ONE, 2010, 5 (05):
  • [37] An updated resource for the detection of protein-coding circRNA with CircProPlus
    Gong, Xue
    Liu, Yunchang
    Wu, Gengze
    Xu, Zheqi
    Zeng, Liping
    Tian, Miao
    Zhang, Runjun
    Zeng, Chunyu
    Chen, Yundai
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [38] Updated Gene Prediction of the Cucumber (9930) Genome through Manual Annotation
    Du, Weixuan
    Xia, Lei
    Li, Rui
    Zhao, Xiaokun
    Jin, Danna
    Wang, Xiaoning
    Pei, Yun
    Zhou, Rong
    Chen, Jinfeng
    Yu, Xiaqing
    [J]. PLANTS-BASEL, 2024, 13 (12):
  • [39] PROMOTER SEQUENCES OF EUKARYOTIC PROTEIN-CODING GENES
    CHAMBON, P
    [J]. HOPPE-SEYLERS ZEITSCHRIFT FUR PHYSIOLOGISCHE CHEMIE, 1981, 362 (04): : 381 - 381
  • [40] The mitochondrial genome of lberobaenia (Coleoptera: Iberobaeniidae): first rearrangement of protein-coding genes in the beetles
    Andujar, Carmelo
    Arribas, Paula
    Linard, Benjamin
    Kundrata, Robin
    Bocak, Ladislav
    Vogler, Alfried P.
    [J]. MITOCHONDRIAL DNA PART A, 2017, 28 (1-2) : 156 - 158