ProtaBank: A repository for protein design and engineering data

被引:41
|
作者
Wang, Connie Y. [1 ]
Chang, Paul M. [1 ]
Ary, Marie L. [1 ]
Allen, Benjamin D. [1 ,2 ,3 ]
Chica, Roberto A. [4 ]
Mayo, Stephen L. [1 ,5 ,6 ]
Olafson, Barry D. [1 ]
机构
[1] Protabit LLC, 129 N Hill Ave,Suite 102, Pasadena, CA 91106 USA
[2] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
[3] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
[4] Univ Ottawa, Dept Chem & Biomol Sci, Ottawa, ON K1N 6N5, Canada
[5] CALTECH, Div Biol & Biol Engn, Pasadena, CA 91125 USA
[6] CALTECH, Div Chem & Chem Engn, Pasadena, CA 91125 USA
基金
美国国家卫生研究院;
关键词
protein engineering; protein design; relational database; protein mutants; data resource; protein stability; data sets; IMMUNOGLOBULIN BINDING DOMAIN; CRYSTAL-STRUCTURE; BETA-LACTAMASE; STABILITY; OPTIMIZATION; GENERATION; AFFINITY; MUTANTS; GENE;
D O I
10.1002/pro.3406
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present ProtaBank, a repository for storing, querying, analyzing, and sharing protein design and engineering data in an actively maintained and updated database. ProtaBank provides a format to describe and compare all types of protein mutational data, spanning a wide range of properties and techniques. It features a user-friendly web interface and programming layer that streamlines data deposition and allows for batch input and queries. The database schema design incorporates a standard format for reporting protein sequences and experimental data that facilitates comparison of results across different data sets. A suite of analysis and visualization tools are provided to facilitate discovery, to guide future designs, and to benchmark and train new predictive tools and algorithms. ProtaBank will provide a valuable resource to the protein engineering community by storing and safeguarding newly generated data, allowing for fast searching and identification of relevant data from the existing literature, and exploring correlations between disparate data sets. ProtaBank invites researchers to contribute data to the database to make it accessible for search and analysis. ProtaBank is available at .
引用
收藏
页码:1113 / 1124
页数:12
相关论文
共 50 条
  • [21] A metadata-driven approach to data repository design
    Harvey, Matthew J.
    McLean, Andrew
    Rzepa, Henry S.
    JOURNAL OF CHEMINFORMATICS, 2017, 9
  • [22] A metadata-driven approach to data repository design
    Matthew J. Harvey
    Andrew McLean
    Henry S. Rzepa
    Journal of Cheminformatics, 9
  • [23] Repository design
    Christopher St., M.J.
    Underground Space, 1981, 6 (4-5): : 247 - 258
  • [24] REPOSITORY DESIGN
    STJOHN, CM
    UNDERGROUND SPACE, 1982, 6 (4-5): : 247 - 258
  • [25] The Design and Implementation of a Repository for the Management of Spatial Data Integrity Constraints
    Sophie Cockcroft
    GeoInformatica, 2004, 8 : 49 - 69
  • [26] DESIGN AND SPECIFICATIONS OF A REPOSITORY FOR REAL-TIME OPEN DATA
    Lutchman, Sudesh
    Hosein, Patrick
    PROCEEDINGS OF THE 2014 ITU KALEIDOSCOPE ACADEMIC CONFERENCE: LIVING IN A CONVERGED WORLD: IMPOSSIBLE WITHOUT STANDARDS?, 2014,
  • [27] The Listening and Spoken Language Data Repository: Design and Project Overview
    Bradham, Tamala S.
    Fonnesbeck, Christopher
    Toll, Alice
    Hecht, Barbara F.
    LANGUAGE SPEECH AND HEARING SERVICES IN SCHOOLS, 2018, 49 (01) : 108 - 120
  • [28] Protein design for pathway engineering
    Eriksen, Dawn T.
    Lian, Jiazhang
    Zhao, Huimin
    JOURNAL OF STRUCTURAL BIOLOGY, 2014, 185 (02) : 234 - 242
  • [29] The design and implementation of a repository for the management of spatial data integrity constraints
    Cockcroft, S
    GEOINFORMATICA, 2004, 8 (01) : 49 - 69
  • [30] Design, development, and implementation of IsoBank: A centralized repository for isotopic data
    Shipley, Oliver N.
    Dabrowski, Anna J.
    Bowen, Gabriel J.
    Hayden, Brian
    Pauli, Jonathan N.
    Jordan, Christopher
    Anderson, Lesleigh
    Bailey, Adriana
    Bataille, Clement P.
    Cicero, Carla
    Close, Hilary G.
    Cook, Craig
    Cook, Joseph A.
    Desai, Ankur R.
    Evaristo, Jaivime
    Filley, Tim R.
    France, Christine A. M.
    Jackson, Andrew L.
    Kim, Sora Lee
    Kopf, Sebastian
    Loisel, Julie
    Manlick, Philip J.
    Mcfarlin, Jamie M.
    Mcmeans, Bailey C.
    O'Connell, Tamsin C.
    Pilaar Birch, Suzanne E.
    Putman, Annie L.
    Semmens, Brice X.
    Stantis, Chris
    Stricker, Craig A.
    Szejner, Paul
    Trammell, Tara L. E.
    Uhen, Mark D.
    Weintraub-Leff, Samantha
    Wooller, Matthew J.
    Williams, John W.
    Yarnes, Christopher T.
    Vander Zanden, Hannah B.
    Newsome, Seth D.
    PLOS ONE, 2024, 19 (09):