Opportunistic Physical Design for Big Data Analytics

被引:15
|
作者
LeFevre, Jeff [1 ,2 ]
Sankaranarayanan, Jagan [1 ]
Hacigumus, Hakan [1 ]
Tatemura, Junichi [1 ]
Polyzotis, Neoklis [2 ]
Carey, Michael J. [3 ]
机构
[1] NEC Labs Amer, Cupertino, CA USA
[2] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[3] Univ Calif Irvine, Irvine, CA USA
关键词
D O I
10.1145/2588555.2610512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analytical systems, such as MapReduce, perform aggressive materialization of intermediate job results in order to support fault tolerance. When jobs correspond to exploratory queries submitted by data analysts, these materializations yield a large set of materialized views that we propose to treat as an opportunistic physical design. We present a semantic model for UDFs that enables effective reuse of views containing UDFs along with a rewrite algorithm that provably finds the minimum-cost rewrite under certain assumptions. An experimental study on real-world datasets using our prototype based on Hive shows that our approach can result in dramatic performance improvements.
引用
收藏
页码:851 / 862
页数:12
相关论文
共 50 条
  • [1] Design of Algorithms for Big Data Analytics
    Bhatnagar, Raj
    [J]. BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 101 - 107
  • [2] Big Data Analytics by CrowdLearning: Architecture and Mechanism Design
    Zhan, Yufeng
    Li, Peng
    Wang, Kun
    Guo, Song
    Xia, Yuanqing
    [J]. IEEE NETWORK, 2020, 34 (03): : 143 - 147
  • [3] Synchronous Big Data analytics for personalized and remote physical therapy
    Calyam, Prasad
    Mishra, Anup
    Antequera, Ronny Bazan
    Chemodanov, Dmitrii
    Berryman, Alex
    Zhu, Kunpeng
    Abbott, Carmen
    Skubic, Marjorie
    [J]. PERVASIVE AND MOBILE COMPUTING, 2016, 28 : 3 - 20
  • [4] Big data analytics and business analytics
    Duan, Lian
    Xiong, Ye
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2015, 2 (01) : 1 - 21
  • [5] Big Data Analytics
    Andreas Meier
    [J]. HMD Praxis der Wirtschaftsinformatik, 2019, 56 (5) : 879 - 880
  • [6] Big data and analytics
    Misovic, Andrej
    Duzik, Ondrej
    Pleva, Michal
    [J]. ERA OF SCIENCE DIPLOMACY: IMPLICATIONS FOR ECONOMICS, BUSINESS, MANAGEMENT AND RELATED DISCIPLINES (EDAMBA 2015), 2015, : 639 - 644
  • [7] Big Data Analytics
    Rajaraman, V.
    [J]. RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2016, 21 (08): : 695 - 716
  • [8] Big data analytics for wireless and wired network design: A survey
    Hadi, Mohammed S.
    Lawey, Ahmed Q.
    El-Gorashi, Taisir E. H.
    Elmirghani, Jaafar M. H.
    [J]. COMPUTER NETWORKS, 2018, 132 : 180 - 199
  • [9] Design of a Government Collaboration Service Map by Big Data Analytics
    Lee, YoungGun
    Park, Sungbum
    [J]. PROMOTING BUSINESS ANALYTICS AND QUANTITATIVE MANAGEMENT OF TECHNOLOGY: 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2016), 2016, 91 : 751 - 760
  • [10] How to Apply Privacy by Design in OSINT and big Data Analytics?
    Rajamaki, Jyri
    Simola, Jussi
    [J]. PROCEEDINGS OF THE 18TH EUROPEAN CONFERENCE ON CYBER WARFARE AND SECURITY (ECCWS 2019), 2019, : 364 - 371