An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome

被引:1
|
作者
Song, Hongtao [1 ]
Lin, Kui [1 ]
Hu, Jinglu [2 ]
Pang, Erli [1 ]
机构
[1] Beijing Normal Univ, Coll Life Sci, MOE Key Lab Biodivers Sci & Ecol Engn, Beijing, Peoples R China
[2] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka, Japan
来源
基金
中国国家自然科学基金;
关键词
cucumber; gene functional annotation; collinear segments; orthology; protein-coding gene; INFORMATION RESOURCE TAIR; RNA-SEQ; PLANT GENOMES; DRAFT GENOME; SEQUENCE; IDENTIFICATION; DUPLICATION; ORTHOLOGY; SYNTENY; POTATO;
D O I
10.3389/fpls.2018.00325
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allows for better and more robust functional assignments of predicted genes. As one of the most reliable strategies, the determination of collinearity informationmay facilitate reliable orthology inferences among genes from multiple related genomes. Currently, the identification of collinear segments has mainly been based on conservation of gene order and orientation. Over the course of plant genome evolution, various evolutionary events have disrupted or distorted the order of genes along chromosomes, making it difficult to use those genes as genome-wide markers for plant genome comparisons. Results: Using the localized LASTZ/MULTIZ analysis pipeline, we aligned 15 genomes, including cucumber and other related angiosperm plants, and identified a set of genomic segments that are short in length, stable in structure, uniform in distribution and highly conserved across all 15 plants. Compared with protein-coding genes, these conserved segments were more suitable for use as genomic markers for detecting collinear segments among distantly divergent plants. Guided by this set of identified collinear genomic segments, we inferred 94,486 orthologous protein-coding gene pairs (OPPs) between cucumber and 14 other angiosperm species, which were used as proxies for transferring functional terms to cucumber genes from the annotations of the other 14 genomes. In total, 10,885 protein-coding genes were assigned Gene Ontology (GO) terms which was nearly 1,300more than results collected in Uniprot-proteomic database. Our results showed that annotation accuracy would been improved compared with other existing approaches. Conclusions: In this study, we provided an alternative resource for the functional annotation of predicted cucumber protein-coding genes, which we expect will be beneficial for the cucumber's biological study, accessible from http://cmb. bnu. edu. cn/functional_annotation. Meanwhile, using the cucumber reference genome as a case study, we presented an efficient strategy for transferring gene functional information from previously well-characterized protein-coding genes inmodel species to newly sequenced or "non-model" plant species.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] RNA-Seq improves annotation of protein-coding genes in the cucumber genome
    Li, Zhen
    Zhang, Zhonghua
    Yan, Pengcheng
    Huang, Sanwen
    Fei, Zhangjun
    Lin, Kui
    [J]. BMC GENOMICS, 2011, 12
  • [2] RNA-Seq improves annotation of protein-coding genes in the cucumber genome
    Zhen Li
    Zhonghua Zhang
    Pengcheng Yan
    Sanwen Huang
    Zhangjun Fei
    Kui Lin
    [J]. BMC Genomics, 12
  • [3] Genome-wide annotation of protein-coding genes in pig
    Max Karlsson
    Evelina Sjöstedt
    Per Oksvold
    Åsa Sivertsson
    Jinrong Huang
    María Bueno Álvez
    Muhammad Arif
    Xiangyu Li
    Lin Lin
    Jiaying Yu
    Tao Ma
    Fengping Xu
    Peng Han
    Hui Jiang
    Adil Mardinoglu
    Cheng Zhang
    Kalle von Feilitzen
    Xun Xu
    Jian Wang
    Huanming Yang
    Lars Bolund
    Wen Zhong
    Linn Fagerberg
    Cecilia Lindskog
    Fredrik Pontén
    Jan Mulder
    Yonglun Luo
    Mathias Uhlen
    [J]. BMC Biology, 20
  • [4] Genome-wide annotation of protein-coding genes in pig
    Karlsson, Max
    Sjostedt, Evelina
    Oksvold, Per
    Sivertsson, Asa
    Huang, Jinrong
    Alvez, Maria Bueno
    Arif, Muhammad
    Li, Xiangyu
    Lin, Lin
    Yu, Jiaying
    Ma, Tao
    Xu, Fengping
    Han, Peng
    Jiang, Hui
    Mardinoglu, Adil
    Zhang, Cheng
    von Feilitzen, Kalle
    Xu, Xun
    Wang, Jian
    Yang, Huanming
    Bolund, Lars
    Zhong, Wen
    Fagerberg, Linn
    Lindskog, Cecilia
    Ponten, Fredrik
    Mulder, Jan
    Luo, Yonglun
    Uhlen, Mathias
    [J]. BMC BIOLOGY, 2022, 20 (01)
  • [5] ANNOTATION OF PROTEIN-CODING GENES IN FUNGAL GENOMES
    Martinez, Diego
    Grigoriev, Igor
    Salamov, Asaf
    [J]. APPLIED AND COMPUTATIONAL MATHEMATICS, 2010, 9 : 56 - 65
  • [6] Accurate annotation of protein-coding genes in mitochondrial genomes
    Al Arab, Marwa
    zu Siederdissen, Christian Hoener
    Tout, Kifah
    Sahyoun, Abdullah H.
    Stadler, Peter F.
    Bernt, Matthias
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2017, 106 : 209 - 216
  • [7] Current methods for automated annotation of protein-coding genes
    Hoff, K. J.
    Stanke, M.
    [J]. CURRENT OPINION IN INSECT SCIENCE, 2015, 7 : 8 - 14
  • [8] ISOGO: Functional annotation of protein-coding splice variants
    Juan A Ferrer-Bonsoms
    Ignacio Cassol
    Pablo Fernández-Acín
    Carlos Castilla
    Fernando Carazo
    Angel Rubio
    [J]. Scientific Reports, 10
  • [9] ISOGO: Functional annotation of protein-coding splice variants
    Ferrer-Bonsoms, Juan A.
    Cassol, Ignacio
    Fernandez-Acin, Pablo
    Castilla, Carlos
    Carazo, Fernando
    Rubio, Angel
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [10] Computational analysis on two putative mitochondrial protein-coding genes from the Emydura subglobosa genome: A functional annotation approach
    Yu, Megan
    [J]. PLOS ONE, 2022, 17 (08):