KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data

被引:0
|
作者
Gutiérrez, Yoan [1 ]
Salas, José I. Abreu [1 ]
Montoyo, Andrés [1 ]
Muñoz, Rafael [1 ]
Estévez-Velarde, Suilan [2 ]
机构
[1] University Institute for Computing Research, University of Alicante, Carretera San Vicente del Raspeig s/n, 03690, Alicante, Spain
[2] Artificial Intelligence and Computing Systems, University of Havana, San Lázaro y L. Edificio Felipe Poey. Plaza de la Revolución, Havana, Cuba
关键词
Data integration - Data mining - Decision making - Digital storage - Economic analysis - Merging - Natural language processing systems - Population statistics - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents KD SENSO-MERGER, a novel Knowledge Discovery (KD) architecture that is capable of semantically integrating heterogeneous data from various sources of structured and unstructured data (i.e. geolocations, demographic, socio-economic, user reviews, and comments). This goal drives the main design approach of the architecture. It works by building internal representations that adapt and merge knowledge across multiple domains, ensuring that the knowledge base is continuously updated. To deal with the challenge of integrating heterogeneous data, this proposal puts forward the corresponding solutions: (i) knowledge extraction, addressed via a plugin-based architecture of knowledge sensors; (ii) data integrity, tackled by an architecture designed to deal with uncertain or noisy information; (iii) scalability, this is also supported by the plugin-based architecture as only relevant knowledge to the scenario is integrated by switching-off non-relevant sensors. Also, we minimize the expert knowledge required, which may pose a bottleneck when integrating a fast-paced stream of new sources. As proof of concept, we developed a case study that deploys the architecture to integrate population census and economic data, municipal cartography, and Google Reviews to analyze the socio-economic contexts of educational institutions. The knowledge discovered enables us to answer questions that are not possible through individual sources. Thus, companies or public entities can discover patterns of behavior or relationships that would otherwise not be visible and this would allow extracting valuable information for the decision-making process. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data
    Gutierrez, Yoan
    Salas, Jose I. Abreu
    Montoyo, Andres
    Munoz, Rafael
    Estevez-Velarde, Suilan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [2] The mediated integration architecture for heterogeneous data integration
    Chirathamjaree, C
    Mukviboonchai, S
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 77 - 80
  • [3] Semantic data integration: overall architecture
    Paiano, Roberto
    Guido, Anna Lisa
    [J]. INNOVATION AND KNOWLEDGE MANAGEMENT IN TWIN TRACK ECONOMIES: CHALLENGES & SOLUTIONS, VOLS 1-3, 2009, : 430 - 436
  • [4] A Data Model for Heterogeneous Data Integration Architecture
    Chromiak, Michal
    Stencel, Krzysztof
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2014, 2014, 424 : 547 - 556
  • [5] A Semantic Integration System for Heterogeneous Bioinformatics Data
    Dai, Weidi
    Cheng, Jianlai
    Wang, Qiuwen
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1072 - 1076
  • [6] An approach for semantic integration of heterogeneous data sources
    Fusco, Giuseppe
    Aversano, Lerina
    [J]. PEERJ COMPUTER SCIENCE, 2020, PeerJ Inc. (2020): : 1 - 30
  • [7] Semantic integration of XML heterogeneous data sources
    Reynaud, C
    Sirot, JP
    Vodislav, D
    [J]. 2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2001, : 199 - 208
  • [8] Semantic integration of heterogeneous XML data sources
    Kim, HH
    Park, SS
    [J]. OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 95 - 107
  • [9] An architecture for semantic integration of data and medical images
    Millan, Marta
    Trujillo, Maria
    Valencia, Daniel
    [J]. INGENIERIA Y COMPETITIVIDAD, 2014, 16 (02): : 11 - 22
  • [10] An architecture for the integration of multimedia heterogeneous data sources
    Chianese, A
    Moscato, V
    Picariello, A
    Rinaldi, AM
    [J]. MSV'04 & AMCS'04, PROCEEDINGS, 2004, : 45 - 51