Transforming Estonian health data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model: lessons learned

被引:5
|
作者
Oja, Marek [1 ,3 ]
Tamm, Sirli [1 ]
Mooses, Kerli [1 ]
Pajusalu, Maarja [1 ]
Talvik, Harry-Anton [1 ,2 ]
Ott, Anne [1 ]
Laht, Marianna [1 ]
Malk, Maria [1 ]
Loo, Marcus [1 ]
Holm, Johannes [1 ]
Haug, Markus [1 ]
Suvalov, Hendrik [1 ]
Saerg, Dage [1 ,2 ]
Vilo, Jaak [1 ,2 ]
Laur, Sven [1 ]
Kolde, Raivo [1 ]
Reisberg, Sulev [1 ,2 ]
机构
[1] Univ Tartu, Inst Comp Sci, Tartu 51009, Estonia
[2] STACC, Tartu 51009, Estonia
[3] Univ Tartu, Inst Comp Sci, Narva mnt 18, Tartu 51009, Estonia
基金
欧盟地平线“2020”;
关键词
OMOP; electronic health record; EHR; ETL; mapping; FEASIBILITY; RECORDS;
D O I
10.1093/jamiaopen/ooad100
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective To describe the reusable transformation process of electronic health records (EHR), claims, and prescriptions data into Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM), together with challenges faced and solutions implemented.Materials and Methods We used Estonian national health databases that store almost all residents' claims, prescriptions, and EHR records. To develop and demonstrate the transformation process of Estonian health data to OMOP CDM, we used a 10% random sample of the Estonian population (n = 150 824 patients) from 2012 to 2019 (MAITT dataset). For the sample, complete information from all 3 databases was converted to OMOP CDM version 5.3. The validation was performed using open-source tools.Results In total, we transformed over 100 million entries to standard concepts using standard OMOP vocabularies with the average mapping rate 95%. For conditions, observations, drugs, and measurements, the mapping rate was over 90%. In most cases, SNOMED Clinical Terms were used as the target vocabulary.Discussion During the transformation process, we encountered several challenges, which are described in detail with concrete examples and solutions.Conclusion For a representative 10% random sample, we successfully transferred complete records from 3 national health databases to OMOP CDM and created a reusable transformation process. Our work helps future researchers to transform linked databases into OMOP CDM more efficiently, ultimately leading to better real-world evidence. Health data can be found in various sources and formats, making it challenging for researchers. To address this issue, one possible approach is to transform the data into a standardized common data model (CDM). In this study, we describe the process of converting electronic health records (EHR), claims, and prescriptions data into the Observational Medical Outcome Partnership (OMOP) CDM, along with the challenges faced and solutions implemented. We used Estonian national health databases containing information on claims, prescriptions, and EHR records of 10% of Estonian residents (MAITT dataset). The study describes how data were mapped to standardized vocabulary and successfully converted to the OMOP CDM. We discuss the encountered difficulties and problems and propose solutions to help future researchers transform linked databases into OMOP CDM more efficiently, leading to better real-world evidence.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [31] Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model
    Klann, Jeffrey G.
    Joss, Matthew A. H.
    Embree, Kevin
    Murphy, Shawn N.
    PLOS ONE, 2019, 14 (02):
  • [32] Characterizing VA Users with the OMOP Common Data Model
    Viernes, Benjamin
    Lynch, Kristine E.
    South, Brett
    Coronado, Gregorio
    DuVall, Scott L.
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 1614 - 1615
  • [33] Developing a perinatal extension for the OMOP common data model
    Abellan, Alicia
    Burn, Edward
    Trinh, Nhung
    Burkard, Theresa
    Fernandez-Bertolin, Sergio
    Hurley, Eimir
    Rodriguez, Clara
    Segundo, Elena
    Morales, Daniel R.
    Nordeng, Hedvig
    Duarte-Salles, Talita
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 419 - 419
  • [34] An Evaluation of the THIN Database in the OMOP Common Data Model
    Zhou, Xiaofeng
    Murugesan, Sundaresan
    Bhullar, Harshvinder
    Liu, Qing
    Wentworth, Chuck
    Bate, Andrew
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2011, 20 : S232 - S232
  • [35] Implementation of a Cohort Retrieval System for Clinical Data Repositories Using the Observational Medical Outcomes Partnership Common Data Model: Proof-of-Concept System Validation
    Liu, Sijia
    Wang, Yanshan
    Wen, Andrew
    Wang, Liwei
    Hong, Na
    Shen, Feichen
    Bedrick, Steven
    Hersh, William
    Liu, Hongfang
    JMIR MEDICAL INFORMATICS, 2020, 8 (10)
  • [36] Transforming a Large-Scale Prostate Cancer Outcomes Dataset to the OMOP Common Data Model-Experiences from a Scientific Data Holder's Perspective
    Sibert, Nora Tabea
    Soff, Johannes
    La Ferla, Sebastiano
    Quaranta, Maria
    Kremer, Andreas
    Kowalski, Christoph
    CANCERS, 2024, 16 (11)
  • [37] Conceptual design of a generic data harmonization process for OMOP common data model
    Elisa Henke
    Michele Zoch
    Yuan Peng
    Ines Reinecke
    Martin Sedlmayr
    Franziska Bathelt
    BMC Medical Informatics and Decision Making, 24
  • [38] Transforming and evaluating the UK Biobank to the OMOP Common Data Model for COVID-19 research and beyond
    Papez, Vaclav
    Moinat, Maxim
    Voss, Erica A.
    Bazakou, Sofia
    Van Winzum, Anne
    Peviani, Alessia
    Payralbe, Stefan
    Kallfelz, Michael
    Asselbergs, Folkert W.
    Prieto-Alhambra, Daniel
    Dobson, Richard J. B.
    Denaxas, Spiros
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 30 (01) : 103 - 111
  • [39] Comparison of family health history in surveys vs electronic health record data mapped to the observational medical outcomes partnership data model in the All of Us Research Program
    Cronin, Robert M.
    Halvorson, Alese E.
    Springer, Cassie
    Feng, Xiaoke
    Sulieman, Lina
    Loperena-Cortes, Roxana
    Mayo, Kelsey
    Carroll, Robert J.
    Chen, Qingxia
    Ahmedani, Brian K.
    Karnes, Jason
    Korf, Bruce
    O'Donnell, Christopher J.
    Qian, Jun
    Ramirez, Andrea H.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (04) : 695 - 703
  • [40] Conceptual design of a generic data harmonization process for OMOP common data model
    Henke, Elisa
    Zoch, Michele
    Peng, Yuan
    Reinecke, Ines
    Sedlmayr, Martin
    Bathelt, Franziska
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)