A Variety-Sensitive ETL Processes

被引:2
|
作者
Berkani, Nabila [1 ]
Bellatreche, Ladjel [2 ]
机构
[1] Ecole Natl Super Informat ESI, Algiers, Algeria
[2] Poitiers Univ, ENSMA, ISAE, LIAS, Poitiers, France
来源
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II | 2017年 / 10439卷
关键词
DESIGN; WAREHOUSES;
D O I
10.1007/978-3-319-64471-4_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, small, medium and large companies need advanced data integration techniques supported by tools to analyse data in order to deliver real-time alerts and trigger automated actions, etc. In the context of rapidly technology changing, these techniques have to consider two main issues: (a) the variety of the huge amount of data sources (ex. traditional, semantic, and graph databases) and (b) the variety of storage platforms, where a data integration system may have several stores, where one hosts a particular type. These issues directly impact the efficiency and the deployment flexibility of ETL (Extract, Transform, Load). In this paper, we consider these issues. Firstly, thanks to Model Driven Engineering, we make generic different types of data sources. This genericity allows overloading the ETL operators. To show the benefit of this genericity, several examples of instantiation are described covering relational, semantic and graph databases. Secondly, a Web-service-driven approach for orchestrating the ETL flows is given. Thirdly, we present a fusion procedure that merges the set of heterogeneous instances and deployed according their favorite stores. Finally, our finding is validated through a proof of concept tool using the LUBM benchmark and YAGO KB and deployed in Oracle RDF Semantic Graph 12c.
引用
收藏
页码:201 / 216
页数:16
相关论文
共 50 条
  • [1] ETL Processes Security Modeling
    Dammak, Salma
    Ghozzi, Faiza
    Gargouri, Faiez
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2019, 10 (01) : 60 - 84
  • [2] E-ETL: Framework For Managing Evolving ETL Processes
    Wojciechowski, Artur
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 441 - 449
  • [3] GENUS: an ETL tool treating the Big Data Variety
    Souissi, Salwa
    BenAyed, Mounir
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [4] Using Shannon Entropy in ETL Processes
    Balta, Marian
    Felea, Victor
    NINTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 151 - 156
  • [5] Security Measures for Web ETL Processes
    Dammak, Salma
    Jedidi, Faiza Ghozzi
    Gargouri, Faiez
    COMPUTER AND INFORMATION SCIENCE 2015, 2016, 614 : 13 - 26
  • [6] An approach to conceptual modelling of ETL processes
    Dupor, Sasa
    Jovanovic, Vladan
    2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1485 - 1490
  • [7] A Method for Modelling and Organazing ETL Processes
    Kabiri, Ahmed
    Chiadmi, Dalila
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 138 - 143
  • [8] Generating incremental ETL processes automatically
    Zhang, Xufeng
    Sun, Weiwei
    Wang, Wei
    Feng, Yahui
    Shi, Baile
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 516 - +
  • [9] Optimizing ETL processes in data warehouses
    Simitsis, A
    Vassiliadis, P
    Sellis, T
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 564 - 575
  • [10] Modeling Agents Working on ETL Processes
    Gomes, Nuno
    Oliveira, Bruno
    Belo, Orlando
    ADVANCES IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, 2016, 9662 : 265 - 268