A Global and Comprehensive Approach for XML Data Warehouse design

被引:0
|
作者
Ouaret, Zoubir [1 ]
Boussaid, Omar [2 ]
Chalal, Rachid [1 ]
机构
[1] High Natl Sch Comp Sci, ESI, BP 68M, Algiers 16309, Algeria
[2] Univ Lyon 2, ERIC, F-69676 Lyon, France
关键词
XML data warehouse; multiple XML data sources; star-join schema; MDA APPROACH;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
the increasing amounts of interesting data stored in the XML format is the most challenging issue for BI community, thus it is desirable to successfully extract, store and integrate this large sources of information special purpose systems called "data warehouse" for further analysis and decision-making. However, compared with the well structured relational databases of a company, XML data presents a complex hierarchical structure, which renders inappropriate, existing traditional data warehouse approaches and techniques. In this paper, we propose a semi-automatic approach for XML data warehouse design starting from XML schemas as data sources. The first step consists in automatically generating the UML Class diagram from W3C XML Schema (XSD). However, the obtained diagram can be very large and hard to understand. To overcome this situation, we use a set of rules based on basic techniques for object oriented design quality to develop a simplification algorithm that efficiently generates high-quality diagrams with limited number of classes. Then, we propose a multi-dimensional (MD) element extraction algorithm to automatically identify facts, measures and their corresponding dimensions. We also present a new metric for ranking obtained MD schemas according to their relevance. The final step consists in automatically generating the star XML schema that corresponds to the XML Data warehouse schema. Finally, we have implemented our approach using JAVA and we have evaluated this tool on several XML schemas.
引用
收藏
页码:578 / 585
页数:8
相关论文
共 50 条
  • [41] XML-Based Heterogeneous Database Integration For Data Warehouse Creation
    Tseng, Frank S. C.
    [J]. PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2005, SECTIONS 1-8 AND POSTER SESSIONS 1-6, 2005, : 590 - 603
  • [42] Building an XML document warehouse
    Feki, Jamel
    Ben Messaoud, Ines
    Zurfluh, Gilles
    [J]. JOURNAL OF DECISION SYSTEMS, 2013, 22 (02) : 122 - 148
  • [43] XML Warehouse Modelling and Querying
    Abdelhedi, Fatma
    Ntsama, Landry
    Zurfluh, Gilles
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2014, 2014, 424 : 72 - 81
  • [44] An XML Document Warehouse model
    Nassis, Vicky
    Dillon, Tharam S.
    Rajagopalapillai, Rajugan
    Rahayu, Wenny
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 513 - 529
  • [45] XWeB: The XML Warehouse Benchmark
    Mahboubi, Hadj
    Darmont, Jerome
    [J]. PERFORMANCE EVALUATION, MEASUREMENT AND CHARACTERIZATION OF COMPLEX SYSTEMS, 2011, 6417 : 185 - +
  • [46] Past Indeterminacy in Data Warehouse Design
    Khnaisser, Christina
    Lavoie, Luc
    Burgun, Anita
    Ethier, Jean-Francois
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II, 2017, 10439 : 90 - 100
  • [47] View selection for designing the global data warehouse
    Theodoratos, D
    Ligoudistianos, S
    Sellis, T
    [J]. DATA & KNOWLEDGE ENGINEERING, 2001, 39 (03) : 219 - 240
  • [48] Research on warehouse design and performance evaluation: A comprehensive review
    Gu, Jinxiang
    Goetschalckx, Marc
    McGinnis, Leon F.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 203 (03) : 539 - 549
  • [49] Design Metrics for Data Warehouse Evolution
    Papastefanatos, George
    Vassiliadis, Panos
    Simitsis, Alkis
    Vassiliou, Yannis
    [J]. CONCEPTUAL MODELING - ER 2008, PROCEEDINGS, 2008, 5231 : 440 - +
  • [50] XML Schema Design Approach
    Quang, Nguyen Hong
    Rahayu, Wenny
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2005, 1 (03) : 161 - +