Traffic-aware Task Placement with Guaranteed Job Completion Time for Geo-distributed Big Data

被引:0
|
作者
Li, Peng [1 ]
Miyazaki, Toshiaki [1 ]
Guo, Song [2 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima, Japan
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
MAPREDUCE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Big data analysis is usually casted into parallel jobs running on geo-distributed data centers. Different from a single data center, geo-distributed environment imposes big challenges for big data analytics due to the limited network bandwidth between data centers located in different regions. Although research efforts have been devoted to geo-distributed big data, the results are still far from being efficient because of their suboptimal performance or high complexity. In this paper, we propose a traffic-aware task placement to minimize job completion time of big data jobs. We formulate the problem as a non-convex optimization problem and design an algorithm to solve it with proved performance gap. Finally, extensive simulations are conducted to evaluate the performance of our proposal. The simulation results show that our algorithm can reduce job completion time by 40%, compared to a conventional approach that aggregates all data for centralized processing. Meanwhile, it has only 10% performance gap with the optimal solution, but its problem-solving time is extremely small.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Traffic-aware and Reliability-guaranteed Virtual Machine Placement Optimization in Cloud Datacenters
    Liu, Xuan
    Cheng, Bo
    Yue, Yi
    Wang, Meng
    Li, Biyi
    Chen, Junliang
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 91 - 98
  • [42] A LAHC-based Job Scheduling Strategy to Improve Big Data Processing in Geo-distributed Contexts
    Cavallo, Marco
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    IOTBDS: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY, 2017, : 92 - 101
  • [43] Blender: A Traffic-Aware Container Placement for Containerized Data Centers
    Wu, Zhaorui
    Deng, Yuhui
    Feng, Hao
    Zhou, Yi
    Min, Geyong
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 986 - 989
  • [44] Scalable and Adaptive Data Replica Placement for Geo-Distributed Cloud Storages
    Liu, Kaiyang
    Peng, Jun
    Wang, Jingrong
    Liu, Weirong
    Huang, Zhiwu
    Pan, Jianping
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (07) : 1575 - 1587
  • [45] On Traffic-Aware Partition and Aggregation in MapReduce for Big Data Applications
    Ke, Huan
    Li, Peng
    Guo, Song
    Guo, Minyi
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 818 - 828
  • [46] Scalable Data Placement of Data-intensive Services in Geo-distributed Clouds
    Atrey, Ankita
    Van Seghbroeck, Gregory
    Volckaert, Bruno
    De Turck, Filip
    CLOSER: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2018, : 497 - 508
  • [47] A Network Cost-aware Geo-distributed Data Analytics System
    Oh, Kwangsung
    Chandra, Abhishek
    Weissman, Jon
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 649 - 658
  • [48] DRASH: A Data Replication-Aware Scheduler in Geo-distributed Data Centers
    Convolbo, Moise W.
    Chou, Jerry
    Lu, Shihyu
    Chung, Yeh Ching
    2016 8TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2016), 2016, : 302 - 309
  • [49] Workload-Aware Scheduling Across Geo-distributed Data Centers
    Jin, Yibo
    Gao, Yuan
    Qian, Zhuzhong
    Zhai, Mingyu
    Peng, Hui
    Lu, Sanglu
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1455 - 1462
  • [50] Placement of High Availability Geo-Distributed Data Centers in Emerging Economies
    Liu, Ruiyun
    Sun, Weiqiang
    Hu, Weisheng
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 3274 - 3288