Applying Domain Knowledge for Data Quality Assessment in Dermatology

被引:0
|
作者
Igic, Nemanja [1 ]
Terzic, Branko [1 ]
Matic, Milan [2 ]
Ivancevic, Vladimir [1 ]
Lukovic, Ivan [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad, Serbia
[2] Univ Novi Sad, Fac Med, Novi Sad, Serbia
关键词
Dermatology; Data quality assessment; Domain knowledge application; INFORMATION-SYSTEMS;
D O I
10.1007/978-3-319-59424-8_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Dermatology Clinic at the Clinical Center of Vojvodina, Novi Sad, Serbia, has actively collected data regarding patients' treatment, health insurance and examinations. These data were stored in documents in the comma-separated values (CSV) format. Since many fields in these documents were presented as free form text or allow null values, there are many data records that are inconsistent with the real-world system. Currently, there is a large need for an analytic system that can analyze these data and find relevant patterns. Since such an analytic system would require clean and accurate data, there is a need to assess data quality. Therefore, a data quality system should be designed and built with a goal of identifying inaccurate records so that they can be aligned with the real-world state. In our approach to data quality assessment, the domain knowledge about data is used to define rules which are then used to evaluate the quality of the data. In this paper, we present the architecture of a data quality system that is used to define and apply these rules. The rules are first defined by a domain expert and then applied to data in order to determine the number of records that do not match the defined rules and identify the exact anomalies in the given records. Also, we present a case study in which we applied this data quality system to the data collected by the Dermatology Clinic.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 50 条
  • [31] Revolutionizing dermatology residency: artificial intelligence for knowledge and clinical milestones assessment
    Ramachandran, Vignesh
    Jairath, Neil
    Cheraghlou, Shayan
    Pahalyants, Vartan
    CLINICAL AND EXPERIMENTAL DERMATOLOGY, 2023, 49 (07) : 732 - 733
  • [32] Measuring Data Quality When Applying Data Swapping and Perturbation
    Canfora, G.
    Visaggio, C. A.
    INFORMATION SYSTEMS: PEOPLE, ORGANIZATIONS, INSTITUTIONS, AND TECHNOLOGIES, 2010, : 157 - 163
  • [33] Applying Domain Knowledge in Association Rules Mining Process - First Experience
    Rauch, Jan
    Simunek, Milan
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 113 - 122
  • [34] Applying domain knowledge to the discovery of operating situations in wastewater treatment plants
    Bejar, J
    Cortes, U
    Sanchez, M
    Gimeno, JM
    Poch, M
    INTELLIGENT INFORMATION SYSTEMS, (IIS'97) PROCEEDINGS, 1997, : 360 - 364
  • [35] An assessment of anxiety and dermatology life quality in patients with atopic dermatitis
    Linnet, J
    Jemec, GBE
    BRITISH JOURNAL OF DERMATOLOGY, 1999, 140 (02) : 268 - 272
  • [36] Applying quality of service architectures to the field-bus domain
    Rossler, F
    Geppert, B
    WFCS '97 - 1997 IEEE INTERNATIONAL WORKSHOP ON FACTORY COMMUNICATION SYSTEMS, PROCEEDINGS, 1997, : 39 - 48
  • [37] Knowledge Management in Website Quality Evaluation Domain
    Ziemba, Pawel
    Jankowski, Jaroslaw
    Watrobski, Jaroslaw
    Becker, Jaroslaw
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 75 - 85
  • [38] Enhancing cardiovascular risk assessment with advanced data balancing and domain knowledge-driven explainability
    Yang, Fan
    Qiao, Yanan
    Hajek, Petr
    Abedin, Mohammad Zoynul
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [39] Assessment of Catastrophic Risk Using Bayesian Network Constructed from Domain Knowledge and Spatial Data
    Li, Lianfa
    Wang, Jinfeng
    Leung, Hareton
    Jiang, Chengsheng
    RISK ANALYSIS, 2010, 30 (07) : 1157 - 1175
  • [40] Improving the quality of manually acquired data: Applying the theory of planned behaviour to data quality
    Murphy, Glen D.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2009, 94 (12) : 1881 - 1886