nGASP - the nematode genome annotation assessment project

被引:31
|
作者
Coghlan, Avril [2 ]
Fiedler, Tristan J. [3 ]
Mckay, Sheldon J. [1 ]
Flicek, Paul [4 ]
Harris, Todd W. [1 ]
Blasiar, Darin [5 ]
Stein, Lincoln D. [1 ]
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[2] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[3] Florida Inst Technol, Dept Biol Sci, Melbourne, FL 32901 USA
[4] European Bioinformat Inst, Cambridge CB10 1SD, England
[5] Washington Univ, Sch Med, St Louis, MO 63108 USA
基金
英国惠康基金; 美国国家卫生研究院;
关键词
D O I
10.1186/1471-2105-9-549
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. Results: The most accurate gene-finders were 'combiner' algorithms, which made use of transcript-and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with unusually many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs posed the greatest difficulty for gene-finders. Conclusion: This experiment establishes a baseline of gene prediction accuracy in Caenorhabditis genomes, and has guided the choice of gene-finders for the annotation of newly sequenced genomes of Caenorhabditis and other nematode species. We have created new gene sets for C. briggsae, C. remanei, C. brenneri, C. japonica, and Brugia malayi using some of the best-performing gene-finders.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Genome annotation: Authentic research project for entry-level community college students
    Beagley, Tim
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [32] Annotation of the human genome
    Gerstein, M
    [J]. SCIENCE, 2000, 288 (5471) : 1590 - 1590
  • [33] Evolutionary annotation of the genome
    Easteal, S
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (12) : 1775 - 1775
  • [34] Assessment of genome annotation using gene function similarity within the gene neighborhood
    Se-Ran Jun
    Intawat Nookaew
    Loren Hauser
    Andrey Gorin
    [J]. BMC Bioinformatics, 18
  • [35] Assessment of genome annotation using gene function similarity within the gene neighborhood
    Jun, Se-Ran
    Nookaew, Intawat
    Hauser, Loren
    Gorin, Andrey
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [36] Genotoxicology and risk assessment in the era of the human genome project
    Vorce, RL
    Stemmer, PM
    [J]. JOURNAL OF TOXICOLOGY-CLINICAL TOXICOLOGY, 1996, 34 (05): : 521 - 523
  • [37] Assessment and improvement of the Plasmodium yoelii yoelii genome annotation through comparative analysis
    Vaughan, Ashley
    Chiu, Sum-Ying
    Ramasamy, Gowthaman
    Li, Ling
    Gardner, Malcolm J.
    Tarun, Alice S.
    Kappe, Stefan H. I.
    Peng, Xinxia
    [J]. BIOINFORMATICS, 2008, 24 (13) : I383 - I389
  • [38] Nematode Genome Announcement: A Draft Genome of Seed Gall Nematode, Anguina tritici
    Singh, Ashish Kumar
    Das, Antara
    Joshi, Ila
    Kumar, Manish
    Kohli, Deshika
    Pankaj, Kishor
    Gaikwad, Kishor
    Jain, Pradeep Kumar
    Sirohi, Anil
    [J]. JOURNAL OF NEMATOLOGY, 2023, 55 (01)
  • [39] The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species
    Gaudet, Pascale
    Chisholm, Rex
    Berardini, Tanya
    Dimmer, Emily
    Engel, Stacia R.
    Fey, Petra
    Hill, David P.
    Howe, Doug
    Hu, James C.
    Huntley, Rachael
    Khodiyar, Varsha K.
    Kishore, Ranjana
    Li, Donghui
    Lovering, Ruth C.
    McCarthy, Fiona
    Ni, Li
    Petri, Victoria
    Siegele, Deborah A.
    Tweedie, Susan
    Van Auken, Kimberly
    Wood, Valerie
    Basu, Siddhartha
    Carbon, Seth
    Dolan, Mary
    Mungall, Christopher J.
    Dolinski, Kara
    Thomas, Paul
    Ashburner, Michael
    Blake, Judith A.
    Cherry, J. Michael
    Lewis, Suzanna E.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
  • [40] Nematode genome announcement: The draft genome sequence of entomopathogenic nematode Heterorhabditis indica
    Bhat, Chaitra G.
    Somvanshi, Vishal S.
    Budhwar, Roli
    Godwin, Jeffrey
    Rao, Uma
    [J]. JOURNAL OF NEMATOLOGY, 2021, 53 : 1 - 3