HDMF: Hierarchical Data Modeling Framework for Modem Science Data Standards

被引:0
|
作者
Tritt, Andrew J. [1 ]
Rubel, Oliver [1 ]
Dichter, Benjamin [2 ]
Ly, Ryan [1 ]
Kang, Donghe [3 ]
Chang, Edward E. [5 ,6 ]
Frank, Loren M. [4 ]
Bouchard, Kristofer [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Natl Lab, Biol Syst & Engn, Berkeley, CA USA
[3] Ohio State Univ, Comp Sci & Engn, Columbus, OH 43210 USA
[4] Univ Calif San Francisco, Howard Hughes Med Inst, Kavli Inst Fundamental Neurosci, Dept Physiol, San Francisco, CA USA
[5] Univ Calif San Francisco, Dept Neurol Surg, San Francisco, CA USA
[6] Univ Calif San Francisco, Ctr Integrat Neurosci, San Francisco, CA 94143 USA
基金
美国国家卫生研究院;
关键词
data standards; data modeling; data formats; HDF5; neurophysiology;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A ubiquitous problem in aggregating data across different experimental and observational data sources is a lack of software infrastructure that enables flexible and extensible standardization of data and metadata. To address this challenge, we developed HDMF, a hierarchical data modeling framework for modern science data standards. With HDMF, we separate the process of data standardization into three main components: (1) data modeling and specification, (2) data I/O and storage, and (3) data interaction and data APIs. To enable standards to support the complex requirements and varying use cases throughout the data life cycle, HDMF provides object mapping infrastructure to insulate and integrate these various components. This approach supports the flexible development of data standards and extensions, optimized storage backends, and data APIs, while allowing the other components of the data standards ecosystem to remain stable. To meet the demands of modern, large-scale science data, HDMF provides advanced data I/O functionality for iterative data write, lazy data load, and parallel I/O. It also supports optimization of data storage via support for chunking, compression, linking, and modular data storage. We demonstrate the application of HDMF in practice to design NWB 2.0 [13], a modern data standard for collaborative science across the neurophysiology community.
引用
收藏
页码:165 / 179
页数:15
相关论文
共 50 条
  • [41] A flexible Bayesian hierarchical modeling framework for spatially dependent peaks-over-threshold data
    Yadav, Rishikesh
    Huser, Raphael
    Opitz, Thomas
    [J]. SPATIAL STATISTICS, 2022, 51
  • [42] Compliance Framework for Personal Data Protection Law Standards
    Alkhamsi, Norah Nasser
    Alqahtani, Sultan Saud
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 512 - 526
  • [43] Cleansing drug discovery data for data science predictive modeling
    Stouch, Terry
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [44] Data Modeling in the Virtual Observatory Framework
    Louys, Mireille
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXVI, 2019, 521 : 437 - 445
  • [45] A conceptual framework for spatiotemporal data modeling
    Wang, K
    Fierbinteanu, C
    Maekawa, M
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, 2736 : 57 - 66
  • [46] NELLEN - A FRAMEWORK FOR LITERATE DATA MODELING
    LEONARD, M
    PRINCE, I
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1992, 593 : 239 - 256
  • [47] An Epidemiological Modeling and Data Integration Framework
    Pfeifer, B.
    Wurz, M.
    Hanser, F.
    Seger, M.
    Netzer, M.
    Osl, M.
    Modre-Osprian, R.
    Schreier, G.
    Baumgartner, C.
    [J]. METHODS OF INFORMATION IN MEDICINE, 2010, 49 (03) : 290 - 296
  • [48] AN EPIDEMIOLOGIC MODELING AND DATA INTEGRATION FRAMEWORK
    Pfeifer, B.
    Seger, M.
    Netzer, M.
    Osl, M.
    Modre-Osprian, R.
    Schreier, G.
    Hanser, F.
    Baumgartner, C.
    [J]. EHEALTH2009 - MEDICAL INFORMATICS MEETS EHEALTH, 2009, : 33 - 39
  • [49] EU Personal Data Protection Standards and Regulatory Framework
    Stepenko, Valery
    Dreval, Lyudmila
    Chernov, Sergei
    Shestak, Viktor
    [J]. JOURNAL OF APPLIED SECURITY RESEARCH, 2022, 17 (02) : 190 - 207
  • [50] Digital data recording and interpretational standards in mummy science
    Beckett, Ronald G.
    [J]. INTERNATIONAL JOURNAL OF PALEOPATHOLOGY, 2017, 19 : 135 - 141