mosaicQA - A General Approach to Facilitate Basic Data Quality Assurance for Epidemiological Research

被引:6
|
作者
Bialke, Martin [1 ]
Rau, Henriette [1 ]
Schwaneberg, Thea [1 ]
Walk, Rene [2 ]
Bahls, Thomas [1 ]
Hoffmann, Wolfgang [1 ]
机构
[1] Univ Med Greifswald, Sect Epidemiol Hlth Care & Community Hlth, Inst Community Med, Ellernholzstr 1-2, D-17487 Greifswald, Germany
[2] Univ Med Greifswald, Sect GANI MED, Inst Community Med, Greifswald, Germany
关键词
Medical data management; data quality assurance; HEALTH-SERVICES RESEARCH;
D O I
10.3414/ME16-01-0123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: Epidemiological studies are based on a considerable amount of personal, medical and socio-economic data. To answer research questions with reliable results, epidemiological research projects face the challenge of providing high quality data. Consequently, gathered data has to be reviewed continuously during the data collection period. Objectives: This article describes the development of the mosaicQA-library for non-statistical experts consisting of a set of reusable R functions to provide support for a basic data quality assurance for a wide range of application scenarios in epidemiological research. Methods: To generate valid quality reports for various scenarios and data sets, a general and flexible development approach was needed. As a first step, a set of quality-related questions, targeting quality aspects on a more general level, was identified. The next step included the design of specific R-scripts to produce proper reports for metric and categorical data. For more flexibility, the third development step focussed on the generalization of the developed R-scripts, e.g. extracting characteristics and parameters. As a last step the generic characteristics of the developed R functionalities and generated reports have been evaluated using different metric and categorical datasets. Results: The developed mosaicQA-library generates basic data quality reports for multivariate input data. If needed, more detailed results for single-variable data, including definition of units, variables, descriptions, code lists and categories of qualified missings, can easily be produced. Conclusions: The mosaicQA-library enables researchers to generate reports for various kinds of metric and categorical data without the need for computational or scripting knowledge. At the moment, the library focusses on the data structure quality and supports the assessment of several quality indicators, including frequency, distribution and plausibility of research variables as well as the occurrence of missing and extreme values. To simplify the installation process, mosaicQA has been released as an official R-package.
引用
收藏
页码:E67 / E73
页数:7
相关论文
共 50 条
  • [41] How does basic research in cancer and AIDS approach the concern for quality of life?
    Levy, JA
    [J]. CANCER, AIDS, AND QUALITY OF LIFE, 1997, : 17 - 35
  • [42] STRUCTURED, SYSTEMATIC THREAT BASED APPROACH TO EVALUATE AND IMPROVE DATA QUALITY TO FACILITATE DIGITAL TRANSFORMATION
    Tomar, Pushpendra
    Kruse, Betsy
    Hasan, Samah
    Kondratyuk, Sergiy
    [J]. PROCEEDINGS OF 2022 14TH INTERNATIONAL PIPELINE CONFERENCE, IPC2022, VOL 2, 2022,
  • [43] A clustering approach for data quality results of research information systems
    Abadi, Reza Edris
    Ershadi, Mohammad Javad
    Niaki, Seyed Taghi Akhavan
    [J]. INFORMATION DISCOVERY AND DELIVERY, 2023, 51 (04) : 337 - 348
  • [44] Unlocking patients' records in general practice for research, medical education and quality assurance: The registration network family practices
    Metsemakers, JFM
    Knottnerus, JA
    vanSchendel, GJ
    Kocken, RJJ
    Limonard, CBG
    [J]. INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1996, 42 (1-2): : 43 - 50
  • [45] Quality assurance approach for basic occupational health services provided by Primary Care Units (PCUs) in Thailand, an example for others?
    Untimanon, Orrapan
    Boonmeephong, Kowit
    Laplue, Ammaraporn
    Sukanan, Kamonchanok
    [J]. SAFETY AND HEALTH AT WORK, 2022, 13 : S12 - S12
  • [46] The quality of OpenStreetMap food-related point-of-interest data for use in epidemiological research
    Pinho, Maria Gabriela M.
    Flueckiger, Benjamin
    Valentin, Antonia
    Kasdagli, Maria-Iosifina
    Kyriakou, Kalliopi
    Lakerveld, Jeroen
    Mackenbach, Joreintje D.
    Beulens, Joline W. J.
    de Hoogh, Kees
    [J]. HEALTH & PLACE, 2023, 83
  • [47] Performance of Afinion HbA1c measurements in general practice as judged by external quality assurance data
    Stavelin, Anne
    Flesche, Kristine
    Tollaanes, Mette
    Christensen, Nina Gade
    Sandberg, Sverre
    [J]. CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2020, 58 (04) : 588 - 596
  • [48] Counteracting Stakeholder Scepticism Towards the Integration of Quality Assurance Activities at a University: A Habermasian and Action Research Approach
    Brits, H. J.
    [J]. SYSTEMIC PRACTICE AND ACTION RESEARCH, 2015, 28 (02) : 163 - 177
  • [49] Research on the Quality Assurance System of Financial and Economic University Graduate Students under the Background of Big Data
    Yu, Rong
    Huang, Lei
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND SOCIAL SCIENCE (ICEESS 2018), 2018, 223 : 30 - 33
  • [50] Omission of Quality Assurance during Data Entry in Public Health Research from India: Is There an Elephant in the Room?
    Faizi, Nafis
    Kumar, Ajay M. V.
    Kazmi, Shahwar
    [J]. INDIAN JOURNAL OF PUBLIC HEALTH, 2018, 62 (02) : 150 - 152