A UML based approach for modeling ETL processes in data warehouses

被引:0
|
作者
Trujillo, J [1 ]
Luján-Mora, S [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, Alicante, Spain
关键词
ETL processes; data warehouses; conceptual modeling; UML;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data warehouses (DWs) are complex computer systems whose main goal is to facilitate the decision making process of knowledge workers. ETL (Extraction-Transformation-Loading) processes are responsible for the extraction of data from heterogeneous operational data sources, their transformation (conversion, cleaning, normalization, etc.) and their loading into DWs. ETL processes are a key component of DWs because incorrect or misleading data will produce wrong business decisions, and therefore, a correct design of these processes at early stages of a DW project is absolutely necessary to improve data quality. However, not much research has dealt with the modeling of ETL processes. In this paper, we present our approach, based on the Unified Modeling Language (UML), which allows us to accomplish the conceptual modeling of these ETL processes. We provide the necessary mechanisms for an easy and quick specification of the common operations defined in these ETL processes such as, the integration of different data sources, the transformation between source and target attributes, the generation of surrogate keys and so on. Another advantage of our proposal is the use of the UML (standardization, ease-of-use and functionality) and the seamless integration of the design of the ETL processes with the DW conceptual schema.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 50 条
  • [31] Physical modeling of data warehouses using UML component and deployment diagrams:: Design and implementation issues
    Luján-Mora, S
    Trujillo, J
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2006, 17 (02) : 12 - 42
  • [32] Applying the UML and the unified process to the design of data warehouses
    Lujan-Mora, Sergio
    Trujillo, Juan
    [J]. JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2006, 46 (30-58) : 30 - 58
  • [33] Modeling of ETL-Processes and Processed Information in Clinical Data Warehousing
    Tute, Erik
    Steiner, Jochen
    [J]. HEALTH INFORMATICS MEETS EHEALTH: BIOMEDICAL MEETS EHEALTH - FROM SENSORS TO DECISIONS, 2018, 248 : 204 - 211
  • [34] Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service
    Nabila Berkani
    Ladjel Bellatreche
    Selma Khouri
    [J]. Cluster Computing, 2013, 16 : 915 - 931
  • [35] Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service
    Berkani, Nabila
    Bellatreche, Ladjel
    Khouri, Selma
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (04): : 915 - 931
  • [36] Modeling Agents Working on ETL Processes
    Gomes, Nuno
    Oliveira, Bruno
    Belo, Orlando
    [J]. ADVANCES IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, 2016, 9662 : 265 - 268
  • [37] Multidimensional data modeling for data warehouses
    Harbin Inst of Technology, Harbin, China
    [J]. Ruan Jian Xue Bao/Journal of Software, 2000, 11 (07): : 908 - 917
  • [38] Instant-On Scientific Data Warehouses Lazy ETL for Data-Intensive Research
    Kargin, Yagiz
    Pirk, Holger
    Ivanova, Milena
    Manegold, Stefan
    Kersten, Martin
    [J]. ENABLING REAL-TIME BUSINESS INTELLIGENCE, VLDB 2012, 2013, 154 : 60 - 75
  • [39] An approach to conceptual modelling of ETL processes
    Dupor, Sasa
    Jovanovic, Vladan
    [J]. 2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1485 - 1490
  • [40] A UML 2.0 profile to design Association Rule mining models in the multidimensional conceptual modeling of data warehouses
    Zubcoff, Jose
    Trujillo, Juan
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 63 (01) : 44 - 62