PDBeCIF: an open-source mmCIF/CIF parsing and processing package

被引:0
|
作者
van Ginkel, Glen [1 ]
Pravda, Lukas [1 ]
Dana, Jose M. [1 ]
Varadi, Mihaly [1 ]
Keller, Peter [2 ]
Anyango, Stephen [1 ]
Velankar, Sameer [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, England
[2] Global Phasing Ltd, Sheraton House,Castle Pk, Cambridge CB3 0AX, England
基金
英国惠康基金;
关键词
Software; Parser; PDB; PDBx; mmCIF; Protein structure; CCD; Small molecule; MACROMOLECULAR STRUCTURE DATA; WEB SERVER; TOOLS; BIOLOGY;
D O I
10.1186/s12859-021-04271-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biomacromolecular structural data outgrew the legacy Protein Data Bank (PDB) format which the scientific community relied on for decades, yet the use of its successor PDBx/Macromolecular Crystallographic Information File format (PDBx/ mmCIF) is still not widespread. Perhaps one of the reasons is the availability of easy to use tools that only support the legacy format, but also the inherent difficulties of processing mmCIF files correctly, given the number of edge cases that make efficient parsing problematic. Nevertheless, to fully exploit macromolecular structure data and their associated annotations such as multiscale structures from integrative/hybrid methods or large macromolecular complexes determined using traditional methods, it is necessary to fully adopt the new format as soon as possible. Results: To this end, we developed PDBeCIF, an open-source Python project for manipulating mmCIF and CIF files. It is part of the official list of mmCIF parsers recorded by the wwPDB and is heavily employed in the processes of the Protein Data Bank in Europe. The package is freely available both from the PyPI repository (http:// pypi.org/project/pdbecif) and from GitHub (https://github.com/pdbeurope/pdbecif) along with rich documentation and many ready-to-use examples. Conclusions: PDBeCIF is an efficient and lightweight Python 2.6+/3+ package with no external dependencies. It can be readily integrated with 3rd party libraries as well as adopted for broad scientific analyses.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] PDBeCIF: an open-source mmCIF/CIF parsing and processing package
    Glen van Ginkel
    Lukáš Pravda
    José M. Dana
    Mihaly Varadi
    Peter Keller
    Stephen Anyango
    Sameer Velankar
    [J]. BMC Bioinformatics, 22
  • [2] ParsCit: An open-source CRF reference string parsing package
    Councill, Isaac G.
    Giles, C. Lee
    Kan, Min-Yen
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 661 - 667
  • [3] SigMT: An open-source Python']Python package for magnetotelluric data processing
    Ajithabh, K. S.
    Patro, Prasanta K.
    [J]. COMPUTERS & GEOSCIENCES, 2023, 171
  • [4] XMIPP:: a new generation of an open-source image processing package for electron microscopy
    Sorzano, COS
    Marabini, R
    Velázquez-Muriel, J
    Bilbao-Castro, JR
    Scheres, SHW
    Carazo, JM
    Pascual-Montano, A
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2004, 148 (02) : 194 - 204
  • [5] Open-Source package for PJVS testing and calibration
    Durandetto, P.
    Sosso, A.
    [J]. 2018 CONFERENCE ON PRECISION ELECTROMAGNETIC MEASUREMENTS (CPEM 2018), 2018,
  • [6] MSIGen: An Open-Source Python Package for Processing and Visualizing Mass Spectrometry Imaging Data
    Hernly, Emerson
    Hu, Hang
    Laskin, Julia
    [J]. Journal of the American Society for Mass Spectrometry, 1600, 35 (10): : 2315 - 2323
  • [7] Open-source solutions for SPIMage processing
    Schmied, Christopher
    Stamataki, Evangelia
    Tomancak, Pavel
    [J]. QUANTITATIVE IMAGING IN CELL BIOLOGY, 2014, 123 : 505 - 529
  • [8] nlmixr: an open-source package for pharmacometric modeling in R
    Schoemaker, Rik
    Fidler, Matt
    Xiong, Yuan
    Wilkins, Justin
    Trame, Mirjame
    Laveille, Christian
    Wang, Wenping
    [J]. JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2017, 44 : S60 - S61
  • [9] An open-source System Identification Package for multivariable processes
    Armenise, Giuseppe
    Vaccari, Marco
    di Capaci, Riccardo Bacci
    Pannocchia, Gabriele
    [J]. 2018 UKACC 12TH INTERNATIONAL CONFERENCE ON CONTROL (CONTROL), 2018, : 152 - 157
  • [10] ViXeN: An open-source package for managing multimedia data
    Ramachandran, Prabhu
    Devarajan, Kadambari
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2018, 9 (03): : 785 - 792