The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes

被引:15
|
作者
Estill, James C. [1 ]
Bennetzen, Jeffrey L. [2 ]
机构
[1] Univ Georgia, Dept Plant Biol, Athens, GA 30602 USA
[2] Univ Georgia, Dept Genet, Athens, GA 30602 USA
关键词
DE-NOVO IDENTIFICATION; DATABASE; PREDICTION; ALIGNMENT; SEQUENCE; PROGRAM; FAMILIES; VISUALIZATION; RESOURCE; BROWSER;
D O I
10.1186/1746-4811-5-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: High quality annotation of the genes and transposable elements in complex genomes requires a human-curated integration of multiple sources of computational evidence. These evidences include results from a diversity of ab initio prediction programs as well as homology-based searches. Most of these programs operate on a single contiguous sequence at a time, and the results are generated in a diverse array of readable formats that must be translated to a standardized file format. These translated results must then be concatenated into a single source, and then presented in an integrated form for human curation. Results: We have designed, implemented, and assessed a Perl-based workflow named DAWGPAWS for the generation of computational results for human curation of the genes and transposable elements in plant genomes. The use of DAWGPAWS was found to accelerate annotation of 80-200 kb wheat DNA inserts in bacterial artificial chromosome (BAC) vectors by approximately twenty-fold and to also significantly improve the quality of the annotation in terms of completeness and accuracy. Conclusion: The DAWGPAWS genome annotation pipeline fills an important need in the annotation of plant genomes by generating computational evidences in a high throughput manner, translating these results to a common file format, and facilitating the human curation of these computational results. We have verified the value of DAWGPAWS by using this pipeline to annotate the genes and transposable elements in 220 BAC insertions from the hexaploid wheat genome (Triticum aestivum L.). DAWGPAWS can be applied to annotation efforts in other plant genomes with minor modifications of program-specific configuration files, and the modular design of the workflow facilitates integration into existing pipelines.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes
    James C Estill
    Jeffrey L Bennetzen
    Plant Methods, 5
  • [2] Arguments for standardizing transposable element annotation in plant genomes
    Ragupathy, Raja
    You, Frank M.
    Cloutier, Sylvie
    TRENDS IN PLANT SCIENCE, 2013, 18 (07) : 367 - 376
  • [3] Characterization and functional annotation of nested transposable elements in eukaryotic genomes
    Gao, Caihua
    Xiao, Meili
    Ren, Xiaodong
    Hayward, Alice
    Yin, Jiaming
    Wu, Likun
    Fu, Donghui
    Li, Jiana
    GENOMICS, 2012, 100 (04) : 222 - 230
  • [4] Impact of transposable elements on polyploid plant genomes
    Vicient, Carlos M.
    Casacuberta, Josep M.
    ANNALS OF BOTANY, 2017, 120 (02) : 195 - 207
  • [5] Transposable elements and the plant pan-genomes
    Morgante, Michele
    De Paoli, Emanuele
    Radovic, Slobodanka
    CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (02) : 149 - 155
  • [6] CAULIFINDER: a pipeline for the automated detection and annotation of caulimovirid endogenous viral elements in plant genomes
    Héléna Vassilieff
    Sana Haddad
    Véronique Jamilloux
    Nathalie Choisne
    Vikas Sharma
    Delphine Giraud
    Mariène Wan
    Saad Serfraz
    Andrew D. W. Geering
    Pierre-Yves Teycheney
    Florian Maumus
    Mobile DNA, 13
  • [7] CAULIFINDER: a pipeline for the automated detection and annotation of caulimovirid endogenous viral elements in plant genomes
    Vassilieff, Helena
    Haddad, Sana
    Jamilloux, Veronique
    Choisne, Nathalie
    Sharma, Vikas
    Giraud, Delphine
    Wan, Mariene
    Serfraz, Saad
    Geering, Andrew D. W.
    Teycheney, Pierre-Yves
    Maumus, Florian
    MOBILE DNA, 2022, 13 (01)
  • [8] What makes up plant genomes: The vanishing line between transposable elements and genes
    Zhao, Dongyan
    Ferguson, Ann A.
    Jiang, Ning
    BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS, 2016, 1859 (02): : 366 - 380
  • [9] The Functional Impact of Transposable Elements on the Diversity of Plant Genomes
    Grzebelus, Dariusz
    DIVERSITY-BASEL, 2018, 10 (02):
  • [10] Transposable elements associated with normal plant genes
    Wessler, SR
    PHYSIOLOGIA PLANTARUM, 1998, 103 (04) : 581 - 586