Integrating genomic information with protein sequence and 3D atomic level structure at the RCSB protein data bank

被引:11
|
作者
Prlic, Andreas [1 ]
Kalro, Tara [1 ]
Bhattacharya, Roshni [4 ]
Christie, Cole [1 ]
Burley, Stephen K. [1 ,2 ,3 ]
Rose, Peter W. [1 ]
机构
[1] Univ Calif San Diego, San Diego Supercomp Ctr, RCSB Prot Data Bank, La Jolla, CA 92093 USA
[2] Rutgers State Univ, Inst Quantitat Biomed, Ctr Integrat Prote Res, RCSB Prot Data Bank,Dept Chem & Chem Biol, Piscataway, NJ 08854 USA
[3] Rutgers State Univ, Rutgers Canc Inst New Jersey, Piscataway, NJ 08854 USA
[4] San Diego State Univ, Bioinformat & Med Informat, San Diego, CA 92182 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btw547
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The Protein Data Bank (PDB) now contains more than 120,000 three-dimensional (3D) structures of biological macromolecules. To allow an interpretation of how PDB data relates to other publicly available annotations, we developed a novel data integration platform that maps 3D structural information across various datasets. This integration bridges from the human genome across protein sequence to 3D structure space. We developed novel software solutions for data management and visualization, while incorporating new libraries for web-based visualization using SVG graphics. Availability and Implementation: The new views are available from http://www.rcsb.org and software is available from https://github.com/rcsb/. Contact: andreas.prlic@rcsb.org Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:3833 / 3835
页数:3
相关论文
共 50 条
  • [21] Protein sequence-structure space and resultant data redundancy in the Protein Data Bank
    Shindyalov, IN
    Bourne, PE
    METMBS'01: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2001, : 139 - 145
  • [22] Data-driven models of protein sequence landscapes: inference, 3D structure prediction and protein design
    Weigt, Martin
    EUROPEAN BIOPHYSICS JOURNAL WITH BIOPHYSICS LETTERS, 2019, 48 : S71 - S71
  • [23] RCSB Protein Data Bank: Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
    Hudson, Brian
    Rose, Yana
    Duarte, Jose M.
    Lowe, Robert
    Bi, Chunxiao
    Bhikadiya, Charmi
    Chen, Li
    Bittrich, Sebastian
    Segura, Joan
    Burley, Stephen
    Westbrook, John
    Rose, Alexander S.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2021, 77 : A253 - A253
  • [24] Exploring experimental structures and computed structure models from artificial intelligence/machine learning at RCSB Protein Data Bank (RCSB PDB, RCSB.org)
    Segura, Joan
    Duarte, Jose
    Bittrich, Sebastian
    Bi, Chunxiao
    Bhikadiya, Charmi
    Fayazi, Maryam
    Henry, Jeremy
    Khokhriakov, Igor
    Lowe, Robert
    Piehl, Dennis W.
    Vallat, Brinda
    Voigt, Maria
    Westbrook, John
    Rose, Yana
    Burley, Stephen K.
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 282A - 282A
  • [25] ESPript/ENDscript: Sequence and 3D Information from Protein Structures
    Gouet, Patrice
    Robert, Xavier
    Courcelle, Emmanuel
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2005, 61 : C42 - C43
  • [26] Information quantity for secondary structure propensities of protein subsequences in the Protein Data Bank
    Kondo, Ryohei
    Kasahara, Kota
    Takahashi, Takuya
    BIOPHYSICS AND PHYSICOBIOLOGY, 2022, 19
  • [27] RCSB PROTEIN DATA BANK: Enabling Breakthroughs in Biomedical Research and Structure-Guided Drug Discovery
    Zardecki, Christine
    Shao, Chenghua
    Westbrook, John
    Feng, Zukang
    Young, Jasmine
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2018, 74 : A120 - A120
  • [28] Classification of 3D protein based on structure information feature
    Cui, Chenyang
    Liu, Zhen
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 98 - +
  • [29] Motif3D: relating protein sequence motifs to 3D structure
    Gaulton, A
    Attwood, TK
    NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3333 - 3336
  • [30] A PROLOG APPROACH TO INTEGRATING PROTEIN-SEQUENCE AND STRUCTURE DATA
    BARTON, GJ
    RAWLINGS, CJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1991, 202 : 30 - CINF