PANNOTATOR: an automated tool for annotation of pan-genomes

被引:14
|
作者
Santos, A. R. [1 ,5 ]
Barbosa, E. [1 ]
Fiaux, K. [1 ]
Zurita-Turk, M. [1 ]
Chaitankar, V. [2 ]
Kamapantula, B. [2 ]
Abdelzaher, A. [2 ]
Ghosh, P. [2 ]
Tiwari, S. [3 ]
Barve, N. [3 ]
Jain, N. [3 ]
Barh, D. [3 ]
Silva, A. [4 ]
Miyoshi, A. [1 ]
Azevedo, V. [1 ]
机构
[1] Univ Fed Minas Gerais, Inst Ciencias Biol, Lab Genet Celular & Mol, Belo Horizonte, MG, Brazil
[2] Virginia Commonwealth Univ, Dept Comp Sci, Biol Networks Lab, Richmond, VA USA
[3] Inst Integrat Omics & Appl Biotechnol, Ctr Genom & Appl Gene Technol, Purba Medinipur, W Bengal, India
[4] Fed Univ Para, Lab Polimorfismo DNA, BR-66059 Belem, Para, Brazil
[5] Univ Fed Uberlandia, Fac Comp, BR-38400 Uberlandia, MG, Brazil
基金
美国国家科学基金会;
关键词
Bacterial pan-genomes; Cut-off value parameterized; Automatic annotation; Reference genome; Web interface; SEQUENCE; SERVER;
D O I
10.4238/2013.August.16.2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Due to next-generation sequence technologies, sequencing of bacterial genomes is no longer one of the main bottlenecks in bacterial research and the number of new genomes deposited in public databases continues to increase at an accelerating rate. Among these new genomes, several belong to the same species and were generated for pan-genomic studies. A pan-genomic study allows investigation of strain phenotypic differences based on genotypic differences. Along with a need for good assembly quality, it is also fundamental to guarantee good functional genome annotation of the different strains. In order to ensure quality and standards for functional genome annotation among different strains, we developed and made available PANNOTATOR (http://bnet.egr.vcu.edu/iioab/agenote.php), a web-based automated pipeline for the annotation of closely related and well-suited genomes for pan-genome studies, aiming at reducing the manual work to generate reports and corrections of various genome strains. PANNOTATOR achieved 98 and 76% of correctness for gene name and function, respectively, as result of an annotation transfer, with a similarity cut-off of 70%, compared with a gold standard annotation for the same species. These results surpassed the RAST and BASys softwares by 41 and 21% and 66 and 17% for gene name and function annotation, respectively, when there were reliable genome annotations of closely related species. PANNOTATOR provides fast and reliable pan-genome annotation; thereby allowing us to maintain the research focus on the main genotype differences between strains.
引用
收藏
页码:2982 / 2989
页数:8
相关论文
共 50 条
  • [31] Plant pan-genomes are the new reference (vol 6, pg 914, 2020)
    Bayer, Philipp E.
    Golicz, Agnieszka A.
    Scheben, Armin
    Batley, Jacqueline
    Edwards, David
    NATURE PLANTS, 2020, 6 (11) : 1389 - 1389
  • [32] Comparative Analysis of Chloroplast Pan-Genomes and Transcriptomics Reveals Cold Adaptation in Medicago sativa
    Zhang, Tianxiang
    Chen, Xiuhua
    Yan, Wei
    Li, Manman
    Huang, Wangqi
    Liu, Qian
    Li, Yanan
    Guo, Changhong
    Shu, Yongjun
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (03)
  • [33] Chloroplast Pan-Genomes and Comparative Transcriptomics Reveal Genetic Variation and Temperature Adaptation in the Cucumber
    Xia, Lei
    Wang, Han
    Zhao, Xiaokun
    Obel, Hesbon Ochieng
    Yu, Xiaqing
    Lou, Qunfeng
    Chen, Jinfeng
    Cheng, Chunyan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (10)
  • [34] Distributed hybrid-indexing of compressed pan-genomes for scalable and fast sequence alignment
    Maarala, Altti Ilari
    Arasalo, Ossi
    Valenzuela, Daniel
    Makinen, Veli
    Heljanko, Keijo
    PLOS ONE, 2021, 16 (08):
  • [35] Phased diploid genome assemblies and pan-genomes provide insights into the genetic history of apple domestication
    Sun, Xuepeng
    Jiao, Chen
    Schwaninger, Heidi
    Chao, C. Thomas
    Ma, Yumin
    Duan, Naibin
    Khan, Awais
    Ban, Seunghyun
    Xu, Kenong
    Cheng, Lailiang
    Zhong, Gan-Yuan
    Fei, Zhangjun
    NATURE GENETICS, 2020, 52 (12) : 1423 - 1432
  • [36] BGDMdocker: a Docker workflow for data mining and visualization of bacterial pan-genomes and biosynthetic gene clusters
    Cheng, Gong
    Lu, Quan
    Ma, Ling
    Zhang, Guocai
    Xu, Liang
    Zhou, Zongshan
    PEERJ, 2017, 5
  • [37] Phased diploid genome assemblies and pan-genomes provide insights into the genetic history of apple domestication
    Xuepeng Sun
    Chen Jiao
    Heidi Schwaninger
    C. Thomas Chao
    Yumin Ma
    Naibin Duan
    Awais Khan
    Seunghyun Ban
    Kenong Xu
    Lailiang Cheng
    Gan-Yuan Zhong
    Zhangjun Fei
    Nature Genetics, 2020, 52 : 1423 - 1432
  • [38] MSPminer: abundance-based reconstitution of microbial pan-genomes from shotgun metagenomic data
    Onate, Florian Plaza
    Le Chatelier, Emmanuelle
    Almeida, Mathieu
    Cervino, Alessandra C. L.
    Gauthier, Franck
    Magoules, Frederic
    Ehrlich, S. Dusko
    Pichaud, Matthieu
    BIOINFORMATICS, 2019, 35 (09) : 1544 - 1552
  • [39] Scoary2: rapid association of phenotypic multi-omics data with microbial pan-genomes
    Roder, Thomas
    Pimentel, Gregory
    Fuchsmann, Pascal
    Stern, Mireille Tena
    von Ah, Ueli
    Vergeres, Guy
    Peischl, Stephan
    Brynildsrud, Ola
    Bruggmann, Remy
    Bar, Cornelia
    GENOME BIOLOGY, 2024, 25 (01)
  • [40] Nutrition or nature: using elementary flux modes to disentangle the complex forces shaping prokaryote pan-genomes
    Garza, Daniel R.
    von Meijenfeldt, F. A. Bastiaan
    van Dijk, Bram
    Boleij, Annemarie
    Huynen, Martijn A.
    Dutilh, Bas E.
    BMC ECOLOGY AND EVOLUTION, 2022, 22 (01):