Traffic-aware Task Placement with Guaranteed Job Completion Time for Geo-distributed Big Data

被引:0
|
作者
Li, Peng [1 ]
Miyazaki, Toshiaki [1 ]
Guo, Song [2 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima, Japan
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
MAPREDUCE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Big data analysis is usually casted into parallel jobs running on geo-distributed data centers. Different from a single data center, geo-distributed environment imposes big challenges for big data analytics due to the limited network bandwidth between data centers located in different regions. Although research efforts have been devoted to geo-distributed big data, the results are still far from being efficient because of their suboptimal performance or high complexity. In this paper, we propose a traffic-aware task placement to minimize job completion time of big data jobs. We formulate the problem as a non-convex optimization problem and design an algorithm to solve it with proved performance gap. Finally, extensive simulations are conducted to evaluate the performance of our proposal. The simulation results show that our algorithm can reduce job completion time by 40%, compared to a conventional approach that aggregates all data for centralized processing. Meanwhile, it has only 10% performance gap with the optimal solution, but its problem-solving time is extremely small.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Traffic-Aware Geo-Distributed Big Data Analytics with Predictable Job Completion Time
    Li, Peng
    Guo, Song
    Miyazaki, Toshiaki
    Liao, Xiaofei
    Jin, Hai
    Zomaya, Albert Y.
    Wang, Kun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1785 - 1796
  • [2] Yugong: Geo-Distributed Data and Job Placement at Scale
    Huang, Yuzhen
    Shi, Yingjie
    Zhong, Zheng
    Feng, Yihui
    Cheng, James
    Li, Jiwei
    Fang, Haochuan
    Li, Chao
    Guan, Tao
    Zhou, Jingren
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 2155 - 2169
  • [3] Time Optimization Modeling for Big Data Placement and Analysis for Geo-Distributed Data Centers
    Khan, Awais
    Attique, Muhammad
    Chung, Tae-Sun
    Kim, Youngjae
    2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 140 - 141
  • [4] Congestion-Aware Traffic Allocation for Geo-Distributed Data Centers
    Tao, Xiaoyi
    Ota, Kaoru
    Dong, Mianxiong
    Borjigin, Wuyunzhaola
    Qi, Heng
    Li, Keqiu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (03) : 1675 - 1687
  • [5] Type-aware Task Placement in Geo-distributed Data Centers with Low OPEX using Data Center Resizing
    Gu, Lin
    Zeng, Deze
    Quo, Song
    Yu, Shui
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 211 - 215
  • [6] QoS-Aware Data Placement for MapReduce Applications in Geo-Distributed Data Centers
    Chen, Wuhui
    Liu, Baichuan
    Paik, Incheon
    Li, Zhenni
    Zheng, Zibin
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2021, 68 (01) : 120 - 136
  • [7] Location-Aware Data Placement for Geo-distributed Online Social Networks
    Zhou, Jingya
    Fan, Jianxi
    Jia, Juncheng
    Cheng, Baolei
    Liu, Zhao
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 234 - 239
  • [8] Multi-job Hadoop scheduling to process Geo-distributed big data
    Cavallo, Marco
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 1175 - 1181
  • [9] Cost-Aware Big Data Processing Across Geo-Distributed Datacenters
    Xiao, Wenhua
    Bao, Weidong
    Zhu, Xiaomin
    Liu, Ling
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (11) : 3114 - 3127
  • [10] Location-aware Associated Data Placement for Geo-distributed Data-intensive Applications
    Yu, Boyang
    Pan, Jianping
    2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), 2015,