A novel approach to assess and improve syntactic interoperability in data integration

被引:1
|
作者
Nasfi, Rihem [1 ]
Bronselaer, Antoon [1 ]
De Tre, Guy [1 ]
机构
[1] Univ Ghent, Dept Telecommun & Informat Proc, Sint Pietersnieuwstr 41, B-9000 Ghent, Belgium
关键词
Relational databases; Interoperability; Data quality; FOUNDATION; QUALITY;
D O I
10.1016/j.ipm.2023.103522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data integration is essential to enrich a database with external information. One effective approach is to match shared identifiers across diverse databases. However, a lack of syntactic interoperability, which refers to the ability to match data based on their syntax, can pose challenges. In this paper, we present a novel method to evaluate and enhance syntactic interop-erability, considering associated costs. First, we introduce the linking index and completeness index as generic measures of fine-grained syntactic interoperability. Second, we analyze the data consistency level of the identifiers using a rule-based framework for data quality assessment. Third, we propose a data integration strategy that strikes a balance between fixing data inconsistencies and the resulting benefits, as measured by the linking and completeness indices. The approach is illustrated through two use cases: bibliographic databases and clinical trial registries. The results demonstrate that standardizing identifiers' representations can signifi-cantly improve syntactic interoperability in certain scenarios while in others, the standardization process does not yield improvements, discouraging, hence integration decisions. By conducting a cost-benefit analysis of improving data interoperability, this analysis enables data integrators to make informed decisions regarding the feasibility and advantages of proceeding with data integration.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] An openEHR based approach to improve the semantic interoperability of clinical data registry
    Min, Lingtong
    Tian, Qi
    Lu, Xudong
    An, Jiye
    Duan, Huilong
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [2] An openEHR based approach to improve the semantic interoperability of clinical data registry
    Lingtong Min
    Qi Tian
    Xudong Lu
    Jiye An
    Huilong Duan
    [J]. BMC Medical Informatics and Decision Making, 18
  • [3] Challenges of Data Integration and Interoperability in Big Data
    Kadadi, Anirudh
    Agrawal, Rajeev
    Nyamful, Christopher
    Atiq, Rahman
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [4] Interoperability and Integration: An Updated Approach to Linked Data Publication at the Dutch Land Registry
    Rowland, Alexandra
    Folmer, Erwin
    Beek, Wouter
    Wenneker, Rob
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (01)
  • [5] A novel approach to improve numerical weather prediction skills by using anomaly integration and historical data
    Peng, Xindong
    Che, Yuzhang
    Chang, Jun
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2013, 118 (16) : 8814 - 8826
  • [6] Challenges in data integration and interoperability in geovisual analytics
    Turdukulov, Ulanbek D.
    Blok, Connie A.
    Kobben, Barend
    Morales, Javier
    [J]. JOURNAL OF LOCATION BASED SERVICES, 2010, 4 (3-4) : 166 - 182
  • [7] Data Integration and Interoperability: Towards a Model-Driven and Pattern-Oriented Approach
    Petrasch, Roland J.
    Petrasch, Richard R.
    [J]. MODELLING, 2022, 3 (01): : 105 - 126
  • [8] Interoperability Driven Integration of Biomedical Data Sources
    Teodoro, Douglas
    Choquet, Remy
    Schober, Daniel
    Mels, Giovanni
    Pasche, Emilie
    Ruch, Patrick
    Lovis, Christian
    [J]. USER CENTRED NETWORKED HEALTH CARE, 2011, 169 : 185 - 189
  • [9] A semantic interoperability approach to support integration of gene expression and clinical data in breast cancer
    Alonso-Calvo, Raul
    Paraiso-Medina, Sergio
    Perez-Rey, David
    Alonso-Oset, Enrique
    van Stiphout, Ruud
    Yu, Sheng
    Taylor, Marian
    Buffa, Francesca
    Fernandez-Lozano, Carlos
    Pazos, Alejandro
    Maojo, Victor
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 87 : 179 - 186
  • [10] Blockchain Interoperability in Data Exchange Logistics Integration
    Li, Kaiye
    Wang, Chun
    Feng, Xia
    Wu, Songze
    [J]. MATHEMATICS, 2024, 12 (10)