Towards Big Data Analytics across Multiple Clusters

被引:4
|
作者
Wu, Dongyao [1 ,2 ]
Sakr, Sherif [1 ,2 ,3 ]
Zhu, Liming [1 ,2 ]
Wu, Huijun [1 ,2 ]
机构
[1] CSIRO, Data61, Sydney, NSW, Australia
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[3] King Saud Bin Abdulaziz Univ Hlth Sci, Natl Guard, Riyadh, Saudi Arabia
关键词
D O I
10.1109/CCGRID.2017.73
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Big data are increasingly collected and stored in a highly distributed infrastructures due to the development of sensor network, cloud computing, IoT and mobile computing among many other emerging technologies. In practice, the majority of existing big-data-processing frameworks (e.g., Hadoop and Spark) are designed based on the single-cluster setup with the assumptions of centralized management and homogeneous connectivity which makes them sub-optimal and sometimes infeasible to apply for scenarios that require implementing data analytics jobs on highly distributed data sets (across racks, data centers or multi-organizations). In order to tackle this challenge, we present HDM-MC, a multi-cluster big data processing framework which is designed to enable the capability of performing large scale data analytics across multi-clusters with minimum extra overhead due to additional scheduling requirements. In this paper, we present the architecture and realization of the system. In addition, we evaluate the performance of our framework in comparison to other state-of-art single cluster big data processing frameworks.
引用
收藏
页码:218 / 227
页数:10
相关论文
共 50 条
  • [1] HDM-MC in-Action: A Framework for Big Data Analytics across Multiple Clusters
    Wu, Dongyao
    Sakr, Sherif
    Zhu, Liming
    Lee, Sung Une
    Wu, Huijun
    [J]. 2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1547 - 1550
  • [2] Analytics towards big data
    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing
    100876, China
    不详
    100876, China
    不详
    100876, China
    [J]. Beijing Youdian Daxue Xuebao, 3 (1-12):
  • [3] Towards Automatic Memory Tuning for In-Memory Big Data Analytics in Clusters
    Koliopoulos, Aris-Kyriakos
    Yiapanis, Paraskevas
    Tekiner, Firat
    Nenadic, Goran
    Keane, John
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 353 - 356
  • [4] Towards Streamlined Big Data Analytics
    Benczur, Andras A.
    Palovics, Robert
    Balassi, Marton
    Markl, Volker
    Rabl, Tilmann
    Soto, Juan
    Hovstadius, Bjorn
    Dowling, Jim
    Haridi, Seif
    [J]. ERCIM NEWS, 2016, (107): : 31 - 32
  • [5] Visual analytics towards big data
    Ren, Lei
    Du, Yi
    Ma, Shuai
    Zhang, Xiao-Long
    Dai, Guo-Zhong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2014, 25 (09): : 1909 - 1936
  • [6] Towards Efficient Big Data and Data Analytics: A Review
    Qureshi, Salim Raza
    Gupta, Ankur
    [J]. 2014 CONFERENCE ON IT IN BUSINESS, INDUSTRY AND GOVERNMENT (CSIBIG), 2014,
  • [7] Towards a 'Big' Health Data Analytics Platform
    Cha, Sangwhan
    Abusharekh, Ashraf
    Abidi, Syed S. R.
    [J]. 2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 233 - 241
  • [8] Data-Less Big Data Analytics (Towards Intelligent Data Analytics Systems)
    Triantafillou, Peter
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1666 - 1667
  • [9] Big Data Analytics Towards a Framework for a Smart City
    Srivastava, Devesh Kumar
    Singh, Ayush
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 1, 2018, 83 : 225 - 232
  • [10] Towards Seamless Configuration Tuning of Big Data Analytics
    Fekry, Ayat
    Carata, Lucian
    Pasquier, Thomas
    Rice, Andrew
    Hopper, Andy
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1912 - 1919