An Educational Bioinformatics Project to Improve Genome Annotation

被引:5
|
作者
Amatore, Zoie [1 ]
Gunn, Susan [2 ]
Harris, Laura K. [1 ]
机构
[1] Davenport Univ, Sci Dept, Harris Interdisciplinary Res, Lansing, MI 49512 USA
[2] Davenport Univ, Coll Urban Educ, Grand Rapids, MI USA
关键词
bioinformatics; hypothetical protein; genome annotation; education; classroom; undergraduate; HYPOTHETICAL PROTEINS; FUNCTIONAL ANNOTATION; PSI-BLAST; INTERACTION NETWORKS; ENRICHMENT ANALYSIS; I-TASSER; PREDICTION; IDENTIFICATION; SEQUENCE; SEARCH;
D O I
10.3389/fmicb.2020.577497
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student's investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor's, and experiential learning projects.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Applying negative rule mining to improve genome annotation
    Artamonova, Irena I.
    Frishman, Goar
    Frishman, Dmitrij
    BMC BIOINFORMATICS, 2007, 8
  • [22] A bioinformatics approach to reanalyze the genome annotation of kinetoplastid protozoan parasite Leishmania donovani
    Pawar, Harsh
    Kulkarni, Aditi
    Dixit, Tanwi
    Chaphekar, Deepa
    Patole, Milind S.
    GENOMICS, 2014, 104 (06) : 554 - 561
  • [23] Curated genome annotation of Oryza sativa ssp japonica and comparative genome analysis with Arabidopsis thaliana -: The Rice Annotation Project
    Gojobori, Takashi
    GENOME RESEARCH, 2007, 17 (02) : 175 - 183
  • [24] A biologist's view of the Drosophila genome annotation assessment project
    Ashburner, M
    GENOME RESEARCH, 2000, 10 (04) : 391 - 393
  • [25] Gene Re-annotation in Genome of the Extremophile Pyrobaculum Aerophilum by Using Bioinformatics Methods
    Du, Meng-Ze
    Guo, Feng-Biao
    Chen, Yue-Yun
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2011, 29 (02): : 391 - 401
  • [26] The Genome Solver Project: Faculty Training and Student Performance Gains in Bioinformatics
    Mathuri, Vinayak
    Arora, Gaurav S.
    McWilliams, Mindy
    Russell, Janet
    Rosenwald, Anne G.
    JOURNAL OF MICROBIOLOGY & BIOLOGY EDUCATION, 2019, 20 (01)
  • [27] EDUCATIONAL RESOURCES - TEACHING ABOUT THE HUMAN GENOME PROJECT
    BAUMILLER, RC
    AMERICAN JOURNAL OF HUMAN GENETICS, 1991, 49 (02) : 501 - 502
  • [28] An educational project to improve knowledge related to pulse oximetry
    Attin, M
    Cardin, S
    Dee, V
    Doering, L
    Dunn, D
    Ellstrom, K
    Erickson, V
    Etchepare, M
    Gawlinski, A
    Haley, T
    Henneman, E
    Keckeisen, M
    Malmet, M
    Olson, L
    AMERICAN JOURNAL OF CRITICAL CARE, 2002, 11 (06) : 529 - 534
  • [29] Genome annotation
    Aubourg, S
    Rouzé, P
    PLANT PHYSIOLOGY AND BIOCHEMISTRY, 2001, 39 (3-4) : 181 - 193
  • [30] Annotation confidence score for genome annotation: a genome comparison approach
    Yang, Youngik
    Gilbert, Donald
    Kim, Sun
    BIOINFORMATICS, 2010, 26 (01) : 22 - 29