De Novo Annotation of Transposable Elements: Tackling the Fat Genome Issue

被引:12
|
作者
Jamilloux, Veronique [1 ]
Daron, Josquin [2 ,3 ]
Choulet, Frederic [2 ,3 ]
Quesneville, Hadi [1 ]
机构
[1] INRA, INRA Versailles, URGI Res Unit Genom Info, UR1164, F-78026 Versailles, France
[2] INRA, Divers & Ecophysiol Cereals, UMR1095 Genet, F-63039 Clermont Ferrand, France
[3] Univ Blaise Pascal, Divers & Ecophysiol Cereals, UMR1095 Genet, F-63039 Clermont Ferrand, France
关键词
Bioinformatics; genomics; SEQUENCED GENOMES; TANDEM REPEATS; DNA-SEQUENCES; EVOLUTION; REVEALS; IDENTIFICATION; ORGANIZATION; EFFICIENT; FAMILIES; FEATURES;
D O I
10.1109/JPROC.2016.2590833
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transposable elements (TEs) constitute the most dynamic and the largest component of large plant genomes: for example, 80% to 90% of the maize genome and the wheat genome may be TEs. De novo TE annotation is therefore a computational challenge, and we investigated, using current tools in the REPET package, new strategies to overcome the difficulties. We tested our methodological developments on the sequence of the chromosome 3B of the hexaploid wheat; this chromosome is similar to 1 Gb, one of the "fattest" genomes ever sequenced. We successfully established various strategies for annotating TEs in such a complex dataset. Our analyses show that all of our strategies can overcome the current limitations for de novo TE discovery in large plant genomes. Relative to annotation based on a library of known TEs, our de novo approaches improved genome coverage (from 84% to 90%), and the number of full length annotated copies from 14 830 to 15905. We also developed two new metrics for qualifying TE annotation: NTE50 involves measuring the number, and LTE50 the smallest sizes of annotations that cover 50% of the genome. NTE50 decreased the number of annotations from 124868 to 93633 and LTE50 increased it from 1839 to 2659. This work shows how to obtain comprehensive and high-quality automatic TE annotation for a number of economically and agronomically important species.
引用
收藏
页码:474 / 481
页数:8
相关论文
共 50 条
  • [1] De Novo Annotation of Transposable Elements: Tackling the Fat Genome Issue (vol 105, pg 474, 2017)
    Jamilloux, Veronique
    Daron, Josquin
    Choulet, Frederic
    Quesneville, Hadi
    PROCEEDINGS OF THE IEEE, 2017, 105 (05) : 978 - 978
  • [2] Combined evidence annotation of transposable elements in genome sequences
    Quesneville, H
    Bergman, CM
    Andrieu, O
    Autard, D
    Nouaud, D
    Ashburner, M
    Anxolabehere, D
    PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (02) : 166 - 175
  • [3] Considering Transposable Element Diversification in De Novo Annotation Approaches
    Flutre, Timothee
    Duprat, Elodie
    Feuillet, Catherine
    Quesneville, Hadi
    PLOS ONE, 2011, 6 (01):
  • [4] De novo assembly and annotation of the mangrove cricket genome
    Satoh, Aya
    Takasu, Miwako
    Yano, Kentaro
    Terai, Yohey
    BMC RESEARCH NOTES, 2021, 14 (01)
  • [5] De novo assembly and annotation of the singing mouse genome
    Smith, Samantha K.
    Frazel, Paul W.
    Khodadadi-Jamayran, Alireza
    Zappile, Paul
    Marier, Christian
    Okhovat, Mariam
    Brown, Stuart
    Long, Michael A.
    Heguy, Adriana
    Phelps, Steven M.
    BMC GENOMICS, 2023, 24 (01)
  • [6] De novo assembly and annotation of the Ganoderma australe genome
    Agudelo-Valencia, Daniel
    Uribe-Echeverry, Paula Tatiana
    Betancur-Perez, John Fredy
    GENOMICS, 2020, 112 (01) : 930 - 933
  • [7] De novo assembly and annotation of the mangrove cricket genome
    Aya Satoh
    Miwako Takasu
    Kentaro Yano
    Yohey Terai
    BMC Research Notes, 14
  • [8] De novo assembly and annotation of the singing mouse genome
    Samantha K. Smith
    Paul W. Frazel
    Alireza Khodadadi-Jamayran
    Paul Zappile
    Christian Marier
    Mariam Okhovat
    Stuart Brown
    Michael A. Long
    Adriana Heguy
    Steven M. Phelps
    BMC Genomics, 24
  • [9] Transposable element annotation of the rice genome
    Juretic, N
    Bureau, TE
    Bruskiewich, RM
    BIOINFORMATICS, 2004, 20 (02) : 155 - 160
  • [10] A De Novo Whole Genome Assembly and Annotation of Parelaphostrongylus tenuis
    Garwood, Tyler J.
    Richards, Jessie E.
    Macchietto, Marissa G.
    Gerhold, Richard W.
    Kania, Stephen A.
    Garbe, John R.
    Fountain-Jones, Nicholas M.
    Larsen, Peter A.
    Wolf, Tiffany M.
    JOURNAL OF NEMATOLOGY, 2024, 56 (01)