Efficient data management in a large-scale epidemiology research project

被引:27
|
作者
Meyer, Jens [1 ]
Ostrzinski, Stefan [1 ]
Fredrich, Daniel [1 ]
Havemann, Christoph [1 ]
Krafczyk, Janina [1 ]
Hoffmann, Wolfgang [1 ]
机构
[1] Inst Community Med, Sect Epidemiol Hlth Care & Community Hlth, D-17487 Greifswald, Germany
关键词
Central Data Management; Electronic Data Capture; Electronic Case Report Forms; Individualized medicine; Personalized Medicine; ELECTRONIC DATA-COLLECTION; SUPPORT; SYSTEM;
D O I
10.1016/j.cmpb.2010.12.016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This article describes the concept of a "Central Data Management" (CDM) and its implementation within the large-scale population-based medical research project "Personalized Medicine". The CDM can be summarized as a conjunction of data capturing, data integration, data storage, data refinement, and data transfer. A wide spectrum of reliable "Extract Transform Load" (ETL) software for automatic integration of data as well as "electronic Case Report Forms" (eCRFs) was developed, in order to integrate decentralized and heterogeneously captured data. Due to the high sensitivity of the captured data, high system resource availability, data privacy, data security and quality assurance are of utmost importance. A complex data model was developed and implemented using an Oracle database in high availability cluster mode in order to integrate different types of participant-related data. Intelligent data capturing and storage mechanisms are improving the quality of data. Data privacy is ensured by a multi-layered role/right system for access control and de-identification of identifying data. A well defined backup process prevents data loss. Over the period of one and a half year, the CDM has captured a wide variety of data in the magnitude of approximately 5 terabytes without experiencing any critical incidents of system breakdown or loss of data. The aim of this article is to demonstrate one possible way of establishing a Central Data Management in large-scale medical and epidemiological studies. (C) 2011 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:425 / 435
页数:11
相关论文
共 50 条
  • [21] A research agenda for query processing in large-scale Peer Data Management Systems
    Hose, Katja
    Roth, Armin
    Zeitz, Andre
    Sattler, Kai-Uwe
    Naumann, Felix
    [J]. INFORMATION SYSTEMS, 2008, 33 (7-8) : 597 - 610
  • [22] Hierarchical Management of Large-Scale Malware Data
    Kellogg, Lee
    Ruttenberg, Brian
    O'Connor, Alison
    Howard, Michael
    Pfeffer, Avi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 666 - 674
  • [23] Data sharing and intellectual property in a genomic epidemiology network: policies for large-scale research collaboration
    Chokshi, Dave A.
    Parker, Michael
    Kwiatkowski, Dominic P.
    [J]. BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2006, 84 (05) : 382 - 387
  • [24] NewBalance: Efficient Data Space Management and Algorithmic Optimization for Large-Scale Storage Systems
    Xu Guangping
    Lin Sheng
    Shi Kai
    Zhang Hua
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (03) : 493 - 501
  • [25] Efficient group key management for secure big data in predictable large-scale networks
    He, Shuangyu
    Wu, Qianhong
    Qin, Bo
    Liu, Jianwei
    Li, Yan
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (04): : 1174 - 1192
  • [26] NewBalance: Efficient Data Space Management and Algorithmic Optimization for Large-Scale Storage Systems
    XU Guangping
    LIN Sheng
    SHI Kai
    ZHANG Hua
    [J]. Chinese Journal of Electronics, 2017, 26 (03) : 493 - 501
  • [27] Efficient large-scale data analysis using mapreduce
    [J]. Kubo, R., 1600, Nippon Telegraph and Telephone Corp. (10):
  • [28] An Efficient Strategy for Large-Scale CORS Data Processing
    Xiong, Bolin
    Huang, Dingfa
    [J]. CHINA SATELLITE NAVIGATION CONFERENCE (CSNC) 2016 PROCEEDINGS, VOL I, 2016, 388 : 213 - 225
  • [29] Efficient bioinformatics approaches for large-scale data analysis
    Hautaniemi, S.
    [J]. FEBS JOURNAL, 2011, 278 : 27 - 27
  • [30] An Efficient Large-Scale Volume Data Compression Algorithm
    Xiao, Degui
    Zhao, Liping
    Yang, Lei
    Li, Zhiyong
    Li, Kenli
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 567 - 575