A natural language-based approach for a semi-automatic data mart design and ETL generation

被引:2
|
作者
Bargui, Fahmi [1 ]
Ben-Abdallah, Hanene [2 ]
Feki, Jamel [3 ]
机构
[1] Univ Sfax, Mir Cl Lab, Sfax, Tunisia
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
[3] Univ Jeddah, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Decision support system; data warehouse; data mart; ETL; analytical requirements; multidimensional model;
D O I
10.1080/12460125.2016.1158066
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Data warehousing projects still face challenges in the various phases of the development life cycle. In particular, the success of the design phase, the focus of this paper, is hindered by the cross-disciplinary competences it requires. This paper presents a natural language-based method for the design of data mart schemas. Compared to existing approaches, our method has three main advantages: first, it facilitates requirements specification through a template covering all concepts of the decision-making process while providing for the acquisition of analytical requirements written in a structured natural language format. Second, it supports requirement validation with respect to a data source used in the ETL process. Third, it provides for a semi-automatic generation of conceptual data mart schemas that are directly mapped onto the data source; this mapping assists the definition of ETL procedures. The performance of the proposed method is illustrated through a software prototype used in an empirical comparison of our method with top-down methods.
引用
收藏
页码:309 / 344
页数:36
相关论文
共 50 条
  • [1] A Semi-automatic Approach to Identify Business Process Elements in Natural Language Texts
    Borges Ferreira, Renato Cesar
    Thom, Lucineia Heloisa
    Fantinato, Marcelo
    [J]. ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 3, 2017, : 250 - 261
  • [2] An approach to the semi-automatic generation of mediator specifications
    König-Ries, B
    [J]. ADVANCES IN DATABSE TECHNOLOGY-EDBT 2000, PROCEEDINGS, 2000, 1777 : 101 - 117
  • [3] Semi-automatic Lexicalized Tree Adjoining Grammar Extraction towards Natural Language Generation
    Qiu, Wei
    Yao, Tianfang
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 229 - 234
  • [4] Semi-Automatic Generation of Competency Maps Based on Educational Data Mining
    Alfonso, David
    Manjarres, Angeles
    Pickin, Simon
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 744 - 760
  • [5] Semi-Automatic Generation of Competency Maps Based on Educational Data Mining
    David Alfonso
    Angeles Manjarrés
    Simon Pickin
    [J]. International Journal of Computational Intelligence Systems, 2019, 12 : 744 - 760
  • [6] Natural Language-Based Approach for Helping in the Reuse of Ontology Design Patterns
    de Cea, Guadalupe Aguado
    Gomez-Perez, Asuncion
    Montiel-Ponsoda, Elena
    Suarez-Figueroa, Mari Carmen
    [J]. KNOWLEDGE ENGINEERING: PRACTICE AND PATTERNS, PROCEEDINGS, 2008, 5268 : 32 - 47
  • [7] Natural Language-Based Automatic Programming for Industrial Robots
    Haiyang Hu
    Jie Chen
    Hanwen Liu
    Zhongjin Li
    Liguo Huang
    [J]. Journal of Grid Computing, 2022, 20
  • [8] Data Minimisation: A Language-Based Approach
    Antignac, Thibaud
    Sands, David
    Schneider, Gerardo
    [J]. ICT SYSTEMS SECURITY AND PRIVACY PROTECTION, SEC 2017, 2017, 502 : 442 - 456
  • [9] Data Mining Techniques for Semi-Automatic Signature Generation
    Tylman, Wojciech
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DEPENDABILITY OF COMPUTER SYSTEMS, 2009, : 210 - 217
  • [10] Natural language-based automatic qualitative description of clusters
    Sanchez-Hernandez, German
    Agell, Nuria
    Carlos Aguado, Juan
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 225 - 234