An Educational Bioinformatics Project to Improve Genome Annotation

被引:5
|
作者
Amatore, Zoie [1 ]
Gunn, Susan [2 ]
Harris, Laura K. [1 ]
机构
[1] Davenport Univ, Sci Dept, Harris Interdisciplinary Res, Lansing, MI 49512 USA
[2] Davenport Univ, Coll Urban Educ, Grand Rapids, MI USA
关键词
bioinformatics; hypothetical protein; genome annotation; education; classroom; undergraduate; HYPOTHETICAL PROTEINS; FUNCTIONAL ANNOTATION; PSI-BLAST; INTERACTION NETWORKS; ENRICHMENT ANALYSIS; I-TASSER; PREDICTION; IDENTIFICATION; SEQUENCE; SEARCH;
D O I
10.3389/fmicb.2020.577497
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student's investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor's, and experiential learning projects.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Bioinformatics assisted gene discovery and annotation of human genome
    Wang, W
    Wang, YH
    Li, W
    [J]. CHEMICAL RESEARCH IN CHINESE UNIVERSITIES, 2002, 18 (04) : 491 - 494
  • [3] Mouse genome annotation by the RefSeq project
    McGarvey, Kelly M.
    Goldfarb, Tamara
    Cox, Eric
    Farrell, Catherine M.
    Gupta, Tripti
    Joardar, Vinita S.
    Kodali, Vamsi K.
    Murphy, Michael R.
    O'Leary, Nuala A.
    Pujar, Shashikant
    Rajput, Bhanu
    Rangwala, Sanjida H.
    Riddick, Lillian D.
    Webb, David
    Wright, Mathew W.
    Murphy, Terence D.
    Pruitt, Kim D.
    [J]. MAMMALIAN GENOME, 2015, 26 (9-10) : 379 - 390
  • [4] Eukaryotic genome annotation using RNA-seq and homology information Keywords: bioinformatics, eukaryotic genome, genome annotation
    Fukuta, Kentaro
    Shinji, Kondo
    Fumiwo, Ejima
    Noguchi, Hideki
    [J]. HUMAN GENOMICS, 2018, 12
  • [5] Mouse genome annotation by the RefSeq project
    Kelly M. McGarvey
    Tamara Goldfarb
    Eric Cox
    Catherine M. Farrell
    Tripti Gupta
    Vinita S. Joardar
    Vamsi K. Kodali
    Michael R. Murphy
    Nuala A. O’Leary
    Shashikant Pujar
    Bhanu Rajput
    Sanjida H. Rangwala
    Lillian D. Riddick
    David Webb
    Mathew W. Wright
    Terence D. Murphy
    Kim D. Pruitt
    [J]. Mammalian Genome, 2015, 26 : 379 - 390
  • [6] Bioinformatics as educational resource: Genetic engineering project
    Olaya-Abril, Alfonso
    Cejas-Molina, Maria
    [J]. EDMETIC, 2018, 7 (01): : 174 - 195
  • [7] Improved annotation of Lutzomyia longipalpis genome using bioinformatics analysis
    Yang, Zhiyuan
    Wu, Ying
    [J]. PEERJ, 2019, 7
  • [8] Current challenges in genome annotation through structural biology and bioinformatics
    Furnham, Nicholas
    de Beer, Tjaart A. P.
    Thornton, Janet M.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2012, 22 (05) : 594 - 601
  • [9] nGASP - the nematode genome annotation assessment project
    Coghlan, Avril
    Fiedler, Tristan J.
    Mckay, Sheldon J.
    Flicek, Paul
    Harris, Todd W.
    Blasiar, Darin
    Stein, Lincoln D.
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [10] THE OYSTER GENOME PROJECT: AN UPDATE ON ASSEMBLY AND ANNOTATION
    Zhang, Guofan
    Guo, Ximing
    Li, Li
    Xu, Fei
    Wang, Xiaotong
    Qi, Haigang
    Zhang, Linlin
    Que, Huayong
    Wu, Hougang
    Wang, Shihuan
    Hedgecock, Dennis
    Gaffney, Patrick M.
    Luo, Ruibang
    Fang, Xiaodong
    Wang, Jun
    [J]. JOURNAL OF SHELLFISH RESEARCH, 2011, 30 (02): : 567 - 567