ETL workflows: From formal specification to optimization

被引:0
|
作者
Sellis, Timos K. [1 ]
Simitsis, Alkis [2 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-10682 Athens, Greece
[2] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present our work on a framework towards the modeling and optimization of Extraction-Transformation-Loading (ETL) workflows. The goal of this research was to facilitate, manage, and optimize the design and implementation of the ETL workflows both during the initial design and deployment stage, as well as, during the continuous evolution of a data warehouse. In particular, we present our results which include: (a) the provision of a novel conceptual model for the tracing of inter-attribute relationships and the respective ETL transformations in the early stages of a data warehouse project, along with an attempt to use ontology-based mechanisms to semi-automatically capture the semantics and the relationships among the various sources; (b) the provision of a novel logical model for the representation of ETL workflows with two main characteristics: genericity and customization; (c) the semi-automatic transition from the conceptual to the logical model for ETL workflows; and (d) the tuning of an ETL workflow for the optimization of the execution order of its operations. Finally, we discuss some issues on future work in the area that we consider important and a step towards the incorporation of the above research results to other areas as well.
引用
收藏
页码:1 / +
页数:3
相关论文
共 50 条
  • [1] State-space optimization of ETL workflows
    Simitsis, A
    Vassiliadis, P
    Sellis, T
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) : 1404 - 1419
  • [2] An Efficient Heuristic for Logical Optimization of ETL Workflows
    Kumar, Nitin
    Kumar, P. Sreenivasa
    ENABLING REAL-TIME BUSINESS INTELLIGENCE, 2011, 84 : 68 - 83
  • [3] Benchmarking ETL Workflows
    Simitsis, Alkis
    Vassiliadis, Patios
    Dayal, Umeshwar
    Karagiannis, Anastasios
    Tziovara, Vasiliki
    PERFORMANCE EVALUATION AND BENCHMARKING, 2009, 5895 : 199 - +
  • [4] Blueprints and measures for ETL workflows
    Vassiliadis, P
    Simitsis, A
    Terrovitis, M
    Skiadopoulos, S
    CONCEPTUAL MODELING - ER 2005, 2005, 3716 : 385 - 400
  • [5] From conceptual design to performance optimization of ETL workflows: current state of research and open problems
    Syed Muhammad Fawad Ali
    Robert Wrembel
    The VLDB Journal, 2017, 26 : 777 - 801
  • [6] Automatic Composition of ETL Workflows from Business Intents
    Deneke, Wesley
    Li, Wing-Ning
    Thompson, Craig
    2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1036 - 1042
  • [7] From conceptual design to performance optimization of ETL workflows: current state of research and open problems
    Ali, Syed Muhammad Fawad
    Wrembel, Robert
    VLDB JOURNAL, 2017, 26 (06): : 777 - 801
  • [8] E-ETL: Framework for Managing Evolving ETL Workflows
    Wojciechowski, Artur
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2013, 38 (02) : 131 - 142
  • [9] Frequent patterns in ETL workflows: An empirical approach
    Theodorou, Vasileios
    Abello, Alberto
    Thiele, Maik
    Lehner, Wolfgang
    DATA & KNOWLEDGE ENGINEERING, 2017, 112 : 1 - 16
  • [10] Optimizing ETL Workflows for Fault-Tolerance
    Simitsis, Alkis
    Wilkinson, Kevin
    Dayal, Umeshwar
    Castellanos, Malu
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 385 - 396