Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data

被引:23
|
作者
Freire, Sergio Miranda [1 ,2 ]
Teodoro, Douglas [1 ]
Wei-Kleiner, Fang [3 ]
Sundvall, Erik [2 ,4 ]
Karlsson, Daniel [2 ]
Lambrix, Patrick [3 ,5 ]
机构
[1] Univ Estado Rio de Janeiro, Dept Tecnol Informacao & Educ Saude, BR-20550011 Rio De Janeiro, RJ, Brazil
[2] Linkoping Univ, Dept Biomed Engn, Linkoping, Sweden
[3] Linkoping Univ, Dept Comp & Informat Sci, S-58183 Linkoping, Sweden
[4] Reg Ostergotland, Linkoping, Sweden
[5] Swedish E Sci Res Ctr, Linkoping, Sweden
来源
PLOS ONE | 2016年 / 11卷 / 03期
关键词
D O I
10.1371/journal.pone.0150069
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study provides an experimental performance evaluation on population-based queries of NoSQL databases storing archetype-based Electronic Health Record (EHR) data. There are few published studies regarding the performance of persistence mechanisms for systems that use multilevel modelling approaches, especially when the focus is on population-based queries. A healthcare dataset with 4.2 million records stored in a relational database (MySQL) was used to generate XML and JSON documents based on the openEHR reference model. Six datasets with different sizes were created from these documents and imported into three single machine XML databases (BaseX, eXistdb and Berkeley DB XML) and into a distributed NoSQL database system based on the MapReduce approach, Couchbase, deployed in different cluster configurations of 1, 2, 4, 8 and 12 machines. Population-based queries were submitted to those databases and to the original relational database. Database size and query response times are presented. The XML databases were considerably slower and required much more space than Couchbase. Overall, Couchbase had better response times than MySQL, especially for larger datasets. However, Couchbase requires indexing for each differently formulated query and the indexing time increases with the size of the datasets. The performances of the clusters with 2, 4, 8 and 12 nodes were not better than the single node cluster in relation to the query response time, but the indexing time was reduced proportionally to the number of nodes. The tested XML databases had acceptable performance for openEHR-based data in some querying use cases and small datasets, but were generally much slower than Couchbase. Couchbase also outperformed the response times of the relational database, but required more disk space and had a much longer indexing time. Systems like Couchbase are thus interesting research targets for scalable storage and querying of archetype-based EHR data when population-based use cases are of interest.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Archetype-based data warehouse environment to enable the reuse of electronic health record data
    Marco-Ruiz, Luis
    Moner, David
    Maldonado, Jose A.
    Kolstrup, Nils
    Bellika, Johan G.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2015, 84 (09) : 702 - 714
  • [2] Dynamic Generation of Archetype-Based User Interfaces for Queries on Electronic Health Record Databases
    Sachdeva, Shelly
    Yaginuma, Daigo
    Chu, Wanming
    Bhalla, Subhash
    [J]. DATABASES IN NETWORKED INFORMATION SYSTEMS, 2011, 7108 : 109 - 125
  • [3] An Archetype Query Language interpreter into MongoDB: Managing NoSQL standardized Electronic Health Record extracts systems
    Ramos, Miguel
    Sanchez-de-Madariaga, Ricardo
    Barros, Jesus
    Carrajo, Lino
    Vazquez, Guillermo
    Perez, Santiago
    Pascual, Mario
    Martin-Sanchez, Fernando
    Munoz-Carrero, Adolfo
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 101
  • [4] Applying representational state transfer (REST) architecture to archetype-based electronic health record systems
    Erik Sundvall
    Mikael Nyström
    Daniel Karlsson
    Martin Eneling
    Rong Chen
    Håkan Örman
    [J]. BMC Medical Informatics and Decision Making, 13
  • [5] Applying representational state transfer (REST) architecture to archetype-based electronic health record systems
    Sundvall, Erik
    Nystrom, Mikael
    Karlsson, Daniel
    Eneling, Martin
    Chen, Rong
    Orman, Hakan
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [6] Archetype-Based Knowledge Management for Semantic Interoperability of Electronic Health Records
    Garde, Sebastian
    Chen, Rong
    Leslie, Heather
    Beale, Thomas
    McNicoll, Ian
    Heard, Sam
    [J]. MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 1007 - 1011
  • [7] Archetype-based electronic health records: a literature review and evaluation of their applicability to health data interoperability and access
    Wollersheim, Dennis
    Sari, Anny
    Rahayu, Wenny
    [J]. HEALTH INFORMATION MANAGEMENT JOURNAL, 2009, 38 (02) : 7 - 17
  • [8] Archetype-based semantic integration and standardization of clinical data
    Moner, David
    Maldonado, Jose A.
    Bosca, Diego
    Fernandez, Jesualdo T.
    Angulo, Carlos
    Crespo, Pere
    Vivancos, Pedro J.
    Robles, Montserrat
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 1314 - +
  • [9] Toward an Adaptive Software Architecture for Archetype-Based Health-Care Applications
    Pereira da Silva, Marcio Alexandre
    Times, Valeria Cesario
    Costa de Araujo, Andre Magno
    da Silva, Paulo Caetano
    [J]. IEEE SOFTWARE, 2022, 39 (02) : 89 - 96
  • [10] Bridging the Gap between Clinical Practice Guidelines and Archetype-Based Electronic Health Records: A Novel Model Proposal
    Garcia, Diego
    Moro, Claudia Maria C.
    Cintho, Lilian Mie M.
    [J]. MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 952 - 952