Active XML-based Web data integration

被引:16
|
作者
Salem, Rashed [1 ]
Boussaid, Omar [1 ]
Darmont, Jerome [1 ]
机构
[1] Univ Lyon, ERIC Lyon 2, F-69676 Bron, France
关键词
Real-time Web data integration; Metadata; Integration services; Active rules; Event mining; DATA WAREHOUSES; ISSUES; OLAP;
D O I
10.1007/s10796-012-9405-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, the Web is the largest source of information worldwide. There is currently a strong trend for decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) to move onto the Web, especially in the cloud. Integrating data into DW/BI applications is a critical and time-consuming task. To make better decisions in DW/BI applications, next generation data integration poses new requirements to data integration systems, over those posed by traditional data integration. In this paper, we propose a generic, metadata-based, service-oriented, and event-driven approach for integrating Web data timely and autonomously. Beside handling data heterogeneity, distribution and interoperability, our approach satisfies near real-time requirements and realize active data integration. For this sake, we design and develop a framework that utilizes Web standards (e.g., XML and Web services) for tackling data heterogeneity, distribution and interoperability issues. Moreover, our framework utilizes Active XML (AXML) to warehouse passive data as well as services to integrate active and dynamic data on-the-fly. AXML embedded services and changes detection services ensure near real-time data integration. Furthermore, the idea of integrating Web data actively and autonomously revolves around mining events logged by the data integration environment. Therefore, we propose an incremental XML-based algorithm for mining association rules from logged events. Then, we define active rules dynamically upon mined data to automate and reactivate integration tasks. Finally, as a proof of concept, we implement a framework prototype as a Web application using open-source tools.
引用
收藏
页码:371 / 398
页数:28
相关论文
共 50 条
  • [41] An XML-based distributed spatial data engine
    Tan, YM
    Chi, TH
    Tang, ZS
    [J]. WAVELET ANALYSIS AND ITS APPLICATIONS, AND ACTIVE MEDIA TECHNOLOGY, VOLS 1 AND 2, 2004, : 881 - 886
  • [42] XBiT: An XML-based bitemporal data model
    Wang, FS
    Zaniolo, C
    [J]. CONCEPTUAL MODELING - ER 2004, PROCEEDINGS, 2004, 3288 : 810 - 824
  • [43] XML-BASED AUTOMATIC TEST DATA GENERATION
    Bulbul, Halil Ibrahim
    Bakir, Turgut
    [J]. COMPUTING AND INFORMATICS, 2008, 27 (04) : 681 - 698
  • [44] Toward XML-based data warehouse architecture
    Rifaieh, R
    Benkat, NA
    [J]. INFORMATION TECHNOLOGY AND ORGANIZATIONS: TRENDS, ISSUES, CHALLENGES AND SOLUTIONS, VOLS 1 AND 2, 2003, : 552 - 555
  • [45] Maintaining data consistency in XML-based applications
    Pardede, E
    Rahayu, JW
    Taniar, D
    [J]. 2005 3rd IEEE International Conference on Industrial Informatics (INDIN), 2005, : 510 - 515
  • [46] The development of an XML-Based data warehouse system
    Huang, SM
    Su, CH
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 206 - 212
  • [47] Virtual XML:a New Approach to an XML-based Virtualization of Data Resources
    吴吉义
    [J]. 四川大学学报(工程科学版), 2007, (工程科学版) : 260 - 263
  • [48] An XML-based agent model for supporting user activities on the Web
    DIMET Università Mediterranea di Reggio Calabria, Via Graziella, Localita Feo di Vito, 89060 Reggio Calabria, Italy
    不详
    [J]. Web Intell. Agent Syst., 2006, 2 (181-207):
  • [49] XML-based Web Information Extraction System Design and Implementation
    Jun, Ma
    Li Tihong
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 551 - 554
  • [50] AVSML: An XML-Based markup language for web information integration in 3D virtual space
    Kitamura, Yasuhiko
    Shibata, Yatsuho
    Tokuda, Keisuke
    Kobayashi, Kazuki
    Nagata, Noriko
    [J]. INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2007, 4722 : 385 - +