Components and Rationale of a Big Data Toolkit Spanning HPC, Grid, Edge and Cloud Computing

被引:6
|
作者
Bridge, Derek [1 ]
机构
[1] Indiana Univ, Dept Intelligent Syst Engn, Bloomington, IN 47405 USA
关键词
Cloud Computing; MapReduce; MPI; HPC; Dataflow; Edge Computing; Global Machine Learning;
D O I
10.1145/3126858.3133312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We look again at Big Data Programming environments such as Hadoop, Spark, Flink, Heron, Pregel; HPC concepts such as MPI and Asynchronous Many-Task runtimes and Cloud/Grid/Edge ideas such as event-driven computing, serverless computing, workflow, and Services. These cross many research communities including distributed systems, databases, cyberphysical systems and parallel computing which sometimes have inconsistent worldviews. There are many common capabilities across these systems which are often implemented differently in each packaged environment. For example, communication can be bulk synchronous processing or data flow; scheduling can be dynamic or static; state and fault-tolerance can have different models; execution and data can be streaming or batch, distributed or local. We suggest that one can usefully build a toolkit (called Twister2 by us) that supports these different choices and allows fruitful customization for each application area. We illustrate the design of Twister2 by several point studies. We stress the many open questions in very traditional areas including scheduling, messaging and checkpointing.
引用
收藏
页码:1 / 1
页数:1
相关论文
共 50 条
  • [1] Emerging intelligent big data analytics for cloud and edge computing
    Dong, Fang
    Yong, Jianming
    Fei, Xiang
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (23):
  • [2] Hybrid Computing-Where HPC meets grid and Cloud Computing
    Mateescu, Gabriel
    Gentzsch, Wolfgang
    Ribbens, Calvin J.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (05): : 440 - 453
  • [3] Application Of Cloud Computing In Biomedicine Big Data Analysis Cloud Computing In Big Data
    Yang, Tianyi
    Zhao, Yang
    [J]. 2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [4] Editorial: Convergency of AI and Cloud/Edge Computing for Big Data Applications
    Xuyun Zhang
    Lianyong Qi
    Yuan Yuan
    [J]. Mobile Networks and Applications, 2022, 27 : 2292 - 2294
  • [5] Editorial: Convergency of AI and Cloud/Edge Computing for Big Data Applications
    Zhang, Xuyun
    Qi, Lianyong
    Yuan, Yuan
    [J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (06): : 2292 - 2294
  • [6] Performance evaluation of edge cloud computing system for big data applications
    Femminella, Mauro
    Pergolesi, Matteo
    Reali, Gianluca
    [J]. 2016 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET), 2016, : 170 - 175
  • [7] Seamless Computing for Industrial Systems spanning Cloud and Edge
    Mueller, Harald
    Gogouvitis, Spyridon V.
    Seitz, Andreas
    Bruegge, Bernd
    [J]. 2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 209 - 216
  • [8] Cloud Computing and Big Data
    Hsu, Ching-Hsien
    Tang, Chunming
    Esteves, Rui M.
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (06): : 995 - 997
  • [9] Big data and cloud computing
    Shrestha, Rasu B.
    [J]. APPLIED RADIOLOGY, 2014, 43 (03) : 32 - 34
  • [10] Chiminey: Connecting Scientists to HPC, Cloud and Big Data
    Yusuf, Iman I.
    Thomas, Ian E.
    Spichkova, Maria
    Schmidt, Heinz W.
    [J]. BIG DATA RESEARCH, 2017, 8 : 39 - 49