PDBeCIF: an open-source mmCIF/CIF parsing and processing package

被引:1
|
作者
van Ginkel, Glen [1 ]
Pravda, Lukas [1 ]
Dana, Jose M. [1 ]
Varadi, Mihaly [1 ]
Keller, Peter [2 ]
Anyango, Stephen [1 ]
Velankar, Sameer [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, England
[2] Global Phasing Ltd, Sheraton House,Castle Pk, Cambridge CB3 0AX, England
基金
英国惠康基金;
关键词
Software; Parser; PDB; PDBx; mmCIF; Protein structure; CCD; Small molecule; MACROMOLECULAR STRUCTURE DATA; WEB SERVER; TOOLS; BIOLOGY;
D O I
10.1186/s12859-021-04271-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biomacromolecular structural data outgrew the legacy Protein Data Bank (PDB) format which the scientific community relied on for decades, yet the use of its successor PDBx/Macromolecular Crystallographic Information File format (PDBx/ mmCIF) is still not widespread. Perhaps one of the reasons is the availability of easy to use tools that only support the legacy format, but also the inherent difficulties of processing mmCIF files correctly, given the number of edge cases that make efficient parsing problematic. Nevertheless, to fully exploit macromolecular structure data and their associated annotations such as multiscale structures from integrative/hybrid methods or large macromolecular complexes determined using traditional methods, it is necessary to fully adopt the new format as soon as possible. Results: To this end, we developed PDBeCIF, an open-source Python project for manipulating mmCIF and CIF files. It is part of the official list of mmCIF parsers recorded by the wwPDB and is heavily employed in the processes of the Protein Data Bank in Europe. The package is freely available both from the PyPI repository (http:// pypi.org/project/pdbecif) and from GitHub (https://github.com/pdbeurope/pdbecif) along with rich documentation and many ready-to-use examples. Conclusions: PDBeCIF is an efficient and lightweight Python 2.6+/3+ package with no external dependencies. It can be readily integrated with 3rd party libraries as well as adopted for broad scientific analyses.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Iibcrn, an Open-Source Document Image Processing Library
    Leydier, Yann
    Duong, Jean
    Bres, Stephane
    Eglin, Veronique
    Lebourgeois, Frank
    Tola, Martial
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 211 - 215
  • [32] PuPl: an open-source tool for processing pupillometry data
    Kinley, Isaac
    Levy, Yarden
    [J]. BEHAVIOR RESEARCH METHODS, 2022, 54 (04) : 2046 - 2069
  • [33] HASPPP: an open-source Galileo HAS embeddable RTKLIB decoding package
    Zhang, Runzhi
    Tu, Rui
    Lu, Xiaochun
    [J]. GPS SOLUTIONS, 2024, 28 (04)
  • [34] TAPAS: An Open-Source Software Package for Translational Neuromodeling and Computational Psychiatry
    Fraessle, Stefan
    Aponte, Eduardo A.
    Bollmann, Saskia
    Brodersen, Kay H.
    Do, Cao T.
    Harrison, Olivia K.
    Harrison, Samuel J.
    Heinzle, Jakob
    Iglesias, Sandra
    Kasper, Lars
    Lomakina, Ekaterina, I
    Mathys, Christoph
    Mueller-Schrader, Matthias
    Pereira, Ines
    Petzschner, Frederike H.
    Raman, Sudhir
    Schoebi, Dario
    Toussaint, Birte
    Weber, Lilian A.
    Yao, Yu
    Stephan, Klaas E.
    [J]. FRONTIERS IN PSYCHIATRY, 2021, 12
  • [35] Cyclone: open-source package for simulation and analysis of finite dynamical systems
    Dimitrova, Elena S.
    Knapp, Adam C.
    Stigler, Brandilyn
    Stillman, Michael E.
    [J]. BIOINFORMATICS, 2023, 39 (11)
  • [36] An Open-Source Package for Thermal and Multispectral Image Analysis for Plants in Glasshouse
    Sharma, Neelesh
    Banerjee, Bikram Pratap
    Hayden, Matthew
    Kant, Surya
    [J]. PLANTS-BASEL, 2023, 12 (02):
  • [37] On the Use of Package Managers by the C plus plus Open-Source Community
    Miranda, Andre
    Pimentel, Joao
    [J]. 33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 1483 - 1491
  • [38] STOPGAP: an open-source package for template matching, subtomogram alignment and classification
    Wan, William
    Khavnekar, Sagar
    Wagner, Jonathan
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2024, 80 : 336 - 349
  • [39] Rxnat: An Open-Source R Package for XNAT-Based Repositories
    Gherman, Adrian
    Muschelli, John
    Caffo, Brian
    Crainiceanu, Ciprian
    [J]. FRONTIERS IN NEUROINFORMATICS, 2020, 14
  • [40] A Comparative Evaluation of Open-Source Graph Processing Platforms
    Pan, Xiaohui
    [J]. 2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 325 - 330