Re-annotation of the woodland strawberry (Fragaria vesca) genome

被引:48
|
作者
Darwish, Omar [1 ]
Shahan, Rachel [2 ]
Liu, Zhongchi [2 ]
Slovin, Janet P. [3 ]
Alkharouf, Nadim W. [1 ]
机构
[1] Towson Univ, Dept Comp & Informat Sci, Towson, MD 21252 USA
[2] Univ Maryland, Dept Cell Biol & Mol Genet, College Pk, MD 20742 USA
[3] USDA ARS, Genet Improvement Fruits & Vegetables Lab, Beltsville, MD 20705 USA
来源
BMC GENOMICS | 2015年 / 16卷
基金
美国国家科学基金会;
关键词
Annotation; Strawberry; Fragaria vesca; Transcriptome; Genome; RNA-Seq; Gene; Rosaceae; HIDDEN MARKOV MODEL; RNA-SEQ; GENE PREDICTION; ALIGNMENT; IDENTIFICATION; TRANSCRIPTS; SEQUENCES; BLAST2GO; TOPHAT; TOOL;
D O I
10.1186/s12864-015-1221-1
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Fragaria vesca is a low-growing, small-fruited diploid strawberry species commonly called woodland strawberry. It is native to temperate regions of Eurasia and North America and while it produces edible fruits, it is most highly useful as an experimental perennial plant system that can serve as a model for the agriculturally important Rosaceae family. A draft of the F. vesca genome sequence was published in 2011 [Nat Genet 43:223,2011]. The first generation annotation (version 1.1) were developed using GeneMark-ES+[Nuc Acids Res 33:6494,2005] which is a self-training gene prediction tool that relies primarily on the combination of ab initio predictions with mapping high confidence ESTs in addition to mapping gene deserts from transposable elements. Based on over 25 different tissue transcriptomes, we have revised the F. vesca genome annotation, thereby providing several improvements over version 1.1. Results: The new annotation, which was achieved using Maker, describes many more predicted protein coding genes compared to the GeneMark generated annotation that is currently hosted at the Genome Database for Rosaceae (http://www.rosaceae.org/). Our new annotation also results in an increase in the overall total coding length, and the number of coding regions found. The total number of gene predictions that do not overlap with the previous annotations is 2286, most of which were found to be homologous to other plant genes. We have experimentally verified one of the new gene model predictions to validate our results. Conclusions: Using the RNA-Seq transcriptome sequences from 25 diverse tissue types, the re-annotation pipeline improved existing annotations by increasing the annotation accuracy based on extensive transcriptome data. It uncovered new genes, added exons to current genes, and extended or merged exons. This complete genome re-annotation will significantly benefit functional genomic studies of the strawberry and other members of the Rosaceae.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Re-annotation of the woodland strawberry (Fragaria vesca) genome
    Omar Darwish
    Rachel Shahan
    Zhongchi Liu
    Janet P Slovin
    Nadim W Alkharouf
    [J]. BMC Genomics, 16
  • [2] The genome of woodland strawberry (Fragaria vesca)
    Shulaev, Vladimir
    Sargent, Daniel J.
    Crowhurst, Ross N.
    Mockler, Todd C.
    Folkerts, Otto
    Delcher, Arthur L.
    Jaiswal, Pankaj
    Mockaitis, Keithanne
    Liston, Aaron
    Mane, Shrinivasrao P.
    Burns, Paul
    Davis, Thomas M.
    Slovin, Janet P.
    Bassil, Nahla
    Hellens, Roger P.
    Evans, Clive
    Harkins, Tim
    Kodira, Chinnappa
    Desany, Brian
    Crasta, Oswald R.
    Jensen, Roderick V.
    Allan, Andrew C.
    Michael, Todd P.
    Setubal, Joao Carlos
    Celton, Jean-Marc
    Rees, D. Jasper G.
    Williams, Kelly P.
    Holt, Sarah H.
    Rojas, Juan Jairo Ruiz
    Chatterjee, Mithu
    Liu, Bo
    Silva, Herman
    Meisel, Lee
    Adato, Avital
    Filichkin, Sergei A.
    Troggio, Michela
    Viola, Roberto
    Ashman, Tia-Lynn
    Wang, Hao
    Dharmawardhana, Palitha
    Elser, Justin
    Raja, Rajani
    Priest, Henry D.
    Bryant, Douglas W., Jr.
    Fox, Samuel E.
    Givan, Scott A.
    Wilhelm, Larry J.
    Naithani, Sushma
    Christoffels, Alan
    Salama, David Y.
    [J]. NATURE GENETICS, 2011, 43 (02) : 109 - 116
  • [3] The genome of woodland strawberry (Fragaria vesca)
    Vladimir Shulaev
    Daniel J Sargent
    Ross N Crowhurst
    Todd C Mockler
    Otto Folkerts
    Arthur L Delcher
    Pankaj Jaiswal
    Keithanne Mockaitis
    Aaron Liston
    Shrinivasrao P Mane
    Paul Burns
    Thomas M Davis
    Janet P Slovin
    Nahla Bassil
    Roger P Hellens
    Clive Evans
    Tim Harkins
    Chinnappa Kodira
    Brian Desany
    Oswald R Crasta
    Roderick V Jensen
    Andrew C Allan
    Todd P Michael
    Joao Carlos Setubal
    Jean-Marc Celton
    D Jasper G Rees
    Kelly P Williams
    Sarah H Holt
    Juan Jairo Ruiz Rojas
    Mithu Chatterjee
    Bo Liu
    Herman Silva
    Lee Meisel
    Avital Adato
    Sergei A Filichkin
    Michela Troggio
    Roberto Viola
    Tia-Lynn Ashman
    Hao Wang
    Palitha Dharmawardhana
    Justin Elser
    Rajani Raja
    Henry D Priest
    Douglas W Bryant
    Samuel E Fox
    Scott A Givan
    Larry J Wilhelm
    Sushma Naithani
    Alan Christoffels
    David Y Salama
    [J]. Nature Genetics, 2011, 43 : 109 - 116
  • [4] Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
    Li, Yongping
    Wei, Wei
    Feng, Jia
    Luo, Huifeng
    Pi, Mengting
    Liu, Zhongchi
    Kang, Chunying
    [J]. DNA RESEARCH, 2018, 25 (01) : 61 - 70
  • [5] The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry)
    Buti, Matteo
    Moretto, Marco
    Barghini, Elena
    Mascagni, Flavia
    Natali, Lucia
    Brilli, Matteo
    Lomsadze, Alexandre
    Sonego, Paolo
    Giongo, Lara
    Alonge, Michael
    Velasco, Riccardo
    Varotto, Claudio
    Surbanovski, Nada
    Borodovsky, Mark
    Ward, Judson A.
    Engelen, Kristof
    Cavallini, Andrea
    Cestaro, Alessandro
    Sargent, Daniel James
    [J]. GIGASCIENCE, 2018, 7 (04): : 1 - 14
  • [6] Updated annotation of the wild strawberry Fragaria vesca V4 genome
    Li, Yongping
    Pi, Mengting
    Gao, Qi
    Liu, Zhongchi
    Kang, Chunying
    [J]. HORTICULTURE RESEARCH, 2019, 6
  • [7] FragariaCyc: A Metabolic Pathway Database for Woodland Strawberry Fragaria vesca
    Naithani, Sushma
    Partipilo, Christina M.
    Raja, Rajani
    Elser, Justin L.
    Jaiswal, Pankaj
    [J]. FRONTIERS IN PLANT SCIENCE, 2016, 7
  • [8] Survey of simple sequence repeats in woodland strawberry (Fragaria vesca)
    Guan, L.
    Huang, J. F.
    Feng, G. Q.
    Wang, X. W.
    Wang, Y.
    Chen, B. Y.
    Qiao, Y. S.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2013, 12 (03): : 2637 - 2651
  • [9] Integrated Karyotyping of Woodland Strawberry (Fragaria vesca) with Oligopaint FISH Probes
    Qu, Manman
    Li, Kunpeng
    Han, Yanli
    Chen, Lei
    Li, Zongyun
    Han, Yonghua
    [J]. CYTOGENETIC AND GENOME RESEARCH, 2017, 153 (03) : 158 - 164
  • [10] Genome-wide identification and expression analysis of the SPL gene family in woodland strawberry Fragaria vesca
    Xiong, Jin-Song
    Zheng, Dan
    Zhu, Hong-Yu
    Chen, Jian-Qiu
    Na, Ran
    Cheng, Zong-Ming
    [J]. GENOME, 2018, 61 (09) : 675 - 683