Matching disparate dimensions for analytical integration of heterogeneous data sources

被引:0
|
作者
Korobko, Anna [1 ]
Korobko, Aleksei [1 ]
机构
[1] RAS, ICM SB, Dept Appl Comp Sci, Krasnoyarsk, Russia
关键词
Analytical Data Integration; OLAP; FCA; Heterogeneous Data; Semantic Analysis; EXPLORATORY OLAP;
D O I
10.1145/3297662.3365809
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper presents the first steps towards an authorial integration methodology for heterogeneous data. Exposing information from multiple heterogeneous data sources demands a global (mediated) schema. We need a model to couple with the mismatches between schemata of different sources and to provide uniform access to the data. The virtual global schema is apparently more convenient for assembling big data sources because of useless time consumption during the processes of materialization and synchronization. Thus, an integral analytical model has been proposed as the global schema of heterogeneous data sources. The suggested model provides virtual integration of complex and diverse information for further analytical processing. It combines the original multidimensional design and lattice structure according to the formal conceptual analysis. The main goal of the paper is to suggest an approach to automatic mapping between the schemata of the disparate data sources and virtual integral analytical model with human moderation.
引用
收藏
页码:66 / 72
页数:7
相关论文
共 50 条
  • [1] Embedding-Based Data Matching for Disparate Data Sources
    Kired, Nour Elhouda
    Ravat, Franck
    Song, Jiefu
    Teste, Olivier
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2024, 2024, 14912 : 66 - 71
  • [2] Matching Data Fragments with Imperfect Identifiers from Disparate Sources
    Craig, Michael B.
    Moody, Benjamin E.
    Jia, Sherman
    Villarroel, Mauricio C.
    Mark, Roger G.
    [J]. COMPUTING IN CARDIOLOGY 2010, VOL 37, 2010, 37 : 793 - 796
  • [3] Semantic matching across heterogeneous data sources
    Zhao, Huimin
    [J]. COMMUNICATIONS OF THE ACM, 2007, 50 (01) : 45 - 50
  • [4] An approach for semantic integration of heterogeneous data sources
    Fusco, Giuseppe
    Aversano, Lerina
    [J]. PEERJ COMPUTER SCIENCE, 2020, PeerJ Inc. (2020): : 1 - 30
  • [5] An architecture for the integration of multimedia heterogeneous data sources
    Chianese, A
    Moscato, V
    Picariello, A
    Rinaldi, AM
    [J]. MSV'04 & AMCS'04, PROCEEDINGS, 2004, : 45 - 51
  • [6] Semantic integration of XML heterogeneous data sources
    Reynaud, C
    Sirot, JP
    Vodislav, D
    [J]. 2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2001, : 199 - 208
  • [7] Semantic integration of heterogeneous XML data sources
    Kim, HH
    Park, SS
    [J]. OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 95 - 107
  • [8] Integration of Multimodal Data from Disparate Sources for Identifying Disease Subtypes
    Zhou, Kaiyue
    Kottoori, Bhagya Shree
    Munj, Seeya Awadhut
    Zhang, Zhewei
    Draghici, Sorin
    Arslanturk, Suzan
    [J]. BIOLOGY-BASEL, 2022, 11 (03):
  • [9] Contextualized linguistic matching for heterogeneous data source integration
    Idrissi, Youssef Bououlid
    Vachon, Julie
    [J]. 2008 INTERNATIONAL MCETECH CONFERENCE ON E-TECHNOLOGIES, PROCEEDINGS, 2007, : 136 - 147
  • [10] Data Integration of Heterogeneous Data Sources Using QR Decomposition
    Sandhya, Harikumar
    Roy, Mekha Meriam
    [J]. INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 2, 2016, 385 : 333 - 344