A rewrite/merge approach for supporting real-time data warehousing via lightweight data integration

被引:7
|
作者
Cuzzocrea, Alfredo [1 ]
Ferreira, Nickerson [2 ]
Furtado, Pedro [2 ]
机构
[1] Univ Trieste, DIA Dept, Trieste, Italy
[2] Univ Coimbra, DEI Dept, Coimbra, Portugal
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 05期
关键词
Real-time data warehousing; Data warehouse optimization; Data warehouse performance; OLAP DATA; BIG DATA; ARCHITECTURE; ALGORITHMS;
D O I
10.1007/s11227-018-2707-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes and experimentally assesses a rewrite/merge approach for supporting real-time data warehousing via lightweight data integration. Real-time data warehouses are becoming more and more relevant actually, due to emerging research challenges such as Big Data and Cloud Computing. Our contribution fulfills limitations of actual data warehousing architectures, which are no suitable to perform classical operations (e.g., loading, aggregation, indexing, OLAP query answering, and so forth) under real-time constraints. The proposed approach is based on intelligent manipulation of SQL statements of input queries, which are decomposed in suitable sub-queries (the rewrite phase) that are finally submitted as (final) input queries to an ad hoc component responsible for the cooperative query answering via a parallel query processing inspired method (the merge phase). This method induces in a novel data warehousing framework where the static phase is separated by the dynamic phase, in order to achieve the real-time processing features. We complete our analytical contributions by means of an extensive experimental campaign where we stress the performance of our proposed real-time data warehousing framework against a popular data warehouse benchmark, and in comparison with traditional architectures, which finally confirms the benefits deriving from our proposal.
引用
收藏
页码:3898 / 3922
页数:25
相关论文
共 50 条
  • [1] A rewrite/merge approach for supporting real-time data warehousing via lightweight data integration
    Alfredo Cuzzocrea
    Nickerson Ferreira
    Pedro Furtado
    [J]. The Journal of Supercomputing, 2020, 76 : 3898 - 3922
  • [2] A continuous data integration methodology for supporting real-time data warehousing
    Santos, Ricardo Jorge
    Bernardino, Jorge
    [J]. ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2007, : 589 - 595
  • [3] Towards Near Real-Time Data Warehousing
    Chen, Li
    Rahayu, Wenny
    Taniar, David
    [J]. 2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 1150 - 1157
  • [4] Bioterrorism surveillance with real-time data warehousing
    Berndt, DJ
    Hevner, AR
    Studnicki, J
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2003, 2665 : 322 - 335
  • [5] An architecture for real-time warehousing of scientific data
    Lawrence, R
    Kruger, A
    [J]. CSC '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON SCIENTIFIC COMPUTING, 2005, : 151 - 156
  • [6] A Robust Join Operator to Process Streaming Data in Real-Time Data Warehousing
    Naeem, M. Asif
    [J]. 2013 EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2013, : 119 - 124
  • [7] A Supporting Framework for Real-time Data Mining
    Fan Aijing
    Fan Aiwan
    [J]. ADVANCED MEASUREMENT AND TEST, PARTS 1 AND 2, 2010, 439-440 : 1499 - +
  • [9] Data Warehousing Massive Real-time Elevator Signals and Maintenance Records
    Yang, Yi-Yang
    Si, Yain-Whar
    Leong, Wai-Leong
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, : 1260 - 1267
  • [10] Real-Time Integration of Building Energy Data
    Anjos, Diogo
    Carreira, Paulo
    Francisco, Alexandre P.
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 250 - 257