ETL workflow reparation by means of case-based reasoning

被引:10
|
作者
Wojciechowski, Artur [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Poznan, Poland
关键词
Data source evolution; ETL evolution; ETL repair; Case-based reasoning; SCHEMA EVOLUTION; DATA WAREHOUSES; ECOSYSTEMS; MANAGEMENT; METRICS; IMPACT;
D O I
10.1007/s10796-016-9732-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data sources (DSs) being integrated in a data warehouse frequently change their structures/schemas. As a consequence, in many cases, an already deployed ETL workflow stops its execution, yielding errors. Since in big companies the number of ETL workflows may reach dozens of thousands and since structural changes of DSs are frequent, an automatic repair of an ETL workflow after such changes is of high practical importance. In our approach, we developed a framework, called E-ETL, for handling the evolution of an ETL layer. In the framework, an ETL workflow is semi-automatically or automatically (depending on a case) repaired as the result of structural changes in DSs, so that it works with the changed DSs. E-ETL supports two different repair methods, namely: (1) user defined rules, (2) and Case-Based Reasoning. In this paper, we present how Case-Based Reasoning may be applied to repairing ETL workflows. In particular, we contribute an algorithm for selecting the most suitable case for a given ETL evolution problem. The algorithm applies a technique for reducing cases in order to make them more universal and capable of solving more problems. The algorithm has been implemented in prototype E-ETL and evaluated experimentally. The obtained results are also discussed in this paper.
引用
收藏
页码:21 / 43
页数:23
相关论文
共 50 条
  • [1] ETL workflow reparation by means of case-based reasoning
    Artur Wojciechowski
    Information Systems Frontiers, 2018, 20 : 21 - 43
  • [2] E-ETL Framework: ETL Process Reparation Algorithms Using Case-Based Reasoning
    Wojciechowski, Artur
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2015), 2015, 539 : 321 - 333
  • [3] A case-based reasoning framework for workflow model management
    Madhusudan, T
    Zhao, JL
    Marshall, B
    DATA & KNOWLEDGE ENGINEERING, 2004, 50 (01) : 87 - 115
  • [4] Document-based workflow modeling: a case-based reasoning approach
    Kim, J
    Suh, W
    Lee, H
    EXPERT SYSTEMS WITH APPLICATIONS, 2002, 23 (02) : 77 - 93
  • [5] CASE-BASED REASONING
    EHRENBERG, D
    PETERSOHN, H
    WIRTSCHAFTSINFORMATIK, 1994, 36 (02): : 166 - 168
  • [6] CASE-BASED REASONING
    LEHNERT, W
    AI MAGAZINE, 1990, 11 (03) : 29 - 29
  • [7] CASE-BASED REASONING
    LEAKE, DB
    KNOWLEDGE ENGINEERING REVIEW, 1994, 9 (01): : 61 - 64
  • [8] Case-Based Reasoning
    Aha, DW
    AI MAGAZINE, 1995, 17 (01) : 92 - 92
  • [9] CBRFlow: Enabling adaptive workflow management through conversational case-based reasoning
    Weber, B
    Wild, W
    Breu, R
    ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2004, 3155 : 434 - 448
  • [10] Distributed case-based reasoning
    Plaza, Enric
    Mcginty, Lorraine
    KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (03): : 261 - 265