Traffic-aware Task Placement with Guaranteed Job Completion Time for Geo-distributed Big Data

被引:0
|
作者
Li, Peng [1 ]
Miyazaki, Toshiaki [1 ]
Guo, Song [2 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima, Japan
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
MAPREDUCE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Big data analysis is usually casted into parallel jobs running on geo-distributed data centers. Different from a single data center, geo-distributed environment imposes big challenges for big data analytics due to the limited network bandwidth between data centers located in different regions. Although research efforts have been devoted to geo-distributed big data, the results are still far from being efficient because of their suboptimal performance or high complexity. In this paper, we propose a traffic-aware task placement to minimize job completion time of big data jobs. We formulate the problem as a non-convex optimization problem and design an algorithm to solve it with proved performance gap. Finally, extensive simulations are conducted to evaluate the performance of our proposal. The simulation results show that our algorithm can reduce job completion time by 40%, compared to a conventional approach that aggregates all data for centralized processing. Meanwhile, it has only 10% performance gap with the optimal solution, but its problem-solving time is extremely small.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Renewable Energy-Aware Big Data Analytics in Geo-Distributed Data Centers with Reinforcement Learning
    Xu, Chenhan
    Wang, Kun
    Li, Peng
    Xia, Rui
    Guo, Song
    Guo, Minyi
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (01): : 205 - 215
  • [22] GEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centers
    Convolbo, Moise W.
    Chou, Jerry
    Hsu, Ching-Hsien
    Chung, Yeh Ching
    COMPUTING, 2018, 100 (01) : 21 - 46
  • [23] GEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centers
    Moïse W. Convolbo
    Jerry Chou
    Ching-Hsien Hsu
    Yeh Ching Chung
    Computing, 2018, 100 : 21 - 46
  • [24] Joint Data Purchasing and Data Placement in a Geo-Distributed Data Market
    Ren, Xiaoqi
    London, Palma
    Ziani, Juba
    Wierman, Adam
    SIGMETRICS/PERFORMANCE 2016: PROCEEDINGS OF THE SIGMETRICS/PERFORMANCE JOINT INTERNATIONAL CONFERENCE ON MEASUREMENT AND MODELING OF COMPUTER SCIENCE, 2016, : 383 - 384
  • [25] Traffic-aware Virtual Machine Placement in Geographically Distributed Clouds
    Teyeb, Hana
    Balma, Ali
    Ben Hadj-Alouane, Nejib
    Tata, Samir
    Hadj-Alouane, Atidel B.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 24 - 29
  • [26] Octopus: Based on Congestion-aware Scheduling on Geo-distributed Big Data Analytics Cluster
    Du, Haizhou
    Zhang, Keke
    Yang, Zhenchen
    2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 490 - 495
  • [27] Achieving Cost Optimization for Tenant Task Placement in Geo-Distributed Clouds
    Luo, Luyao
    Zhao, Gongming
    Xu, Hongli
    Yu, Zhuolong
    Xie, Liguang
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (02) : 1391 - 1406
  • [28] The Effects of IDS/IPS Placement on Big Data Systems in Geo-Distributed Wide Area Networks
    Hart, Michael
    Richardson, Eric
    Dave, Rushit
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (09) : 11 - 20
  • [29] Traffic-aware Data Placement for Online Social Networks
    Zhou, Jingya
    Fan, Jianxi
    Wang, Jin
    Jia, Juncheng
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 125 - 132
  • [30] Temperature Aware Workload Management in Geo-Distributed Data Centers
    Xu, Hong
    Feng, Chen
    Li, Baochun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (06) : 1743 - 1753