Improving Network Traffic in MapReduce for Big Data Applications

被引:0
|
作者
Gawande, Priya [1 ]
Shaikh, Nuzhaft [1 ]
机构
[1] MES Coll Engn, Dept Comp Engn, Pune, Maharashtra, India
关键词
Aggregator; distributed system; locality; scheduling;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Improving the performance of network traffic in shuffle phase is important to improve the performance of MapReduce. The goal of enhancement of network traffic is achieved by using partition and aggregation. According to traditional method a hash function is used to partition intermediate data among reduce tasks but the traditional function is not efficient to handle network traffic. A novel intermediate data partition scheme is designed to reduce network traffic cost in MapReduce. The aggregator placement problem is considered, where each aggregator can reduce merged traffic from multiple map tasks. A decomposition-based distributed algorithm is proposed to deal with the large-scale optimization problem for big data applications. Also an online algorithm is designed to adjust data partition and aggregation in a dynamic manner. Network traffic cost under both offline and online cases is significantly reduced as demonstrated by the stimulation results by the various proposal considered and used.
引用
收藏
页码:2979 / 2983
页数:5
相关论文
共 50 条
  • [21] Dache: A Data Aware Caching for Big-Data Applications Using the MapReduce Framework
    Yaxiong Zhao
    Jie Wu
    Cong Liu
    [J]. Tsinghua Science and Technology, 2014, 19 (01) : 39 - 50
  • [22] Dache: A data aware caching for big-data applications using the MapReduce framework
    [J]. Zhao, Y. (yaxiongzhao@google.com), 1600, Tsinghua University (19):
  • [23] MapReduce: Simplified Data Analysis of Big Data
    Maitrey, Seema
    Jha, C. K.
    [J]. 3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 563 - 571
  • [24] A Survey on Big Data for Network Traffic Monitoring and Analysis
    D'Alconzo, Alessandro
    Drago, Idilio
    Morichetta, Andrea
    Mellia, Marco
    Casas, Pedro
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (03): : 800 - 813
  • [25] Improving the approaches of traffic demand forecasting in the big data era
    Zhao, Yongmei
    Zhang, Hongmei
    An, Li
    Liu, Quan
    [J]. CITIES, 2018, 82 : 19 - 26
  • [26] Analysis of the Big Data based on MapReduce
    Tian, Zi-de
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 224 - 228
  • [27] Architecture of Efficient Word Processing using Hadoop MapReduce for Big Data Applications
    Mandal, Bichitra
    Sahoo, Ramesh Kumar
    Sethi, Srinivas
    [J]. PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON MAN AND MACHINE INTERFACING (MAMI), 2015,
  • [28] Improving the performance of GIS polygon overlay computation with MapReduce for spatial big data processing
    Wang, Yong
    Liu, Zhenling
    Liao, Hongyan
    Li, Chengjun
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 507 - 516
  • [29] MapReduce Algorithms for Big Data Analysis
    Shim, Kyuseok
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2016 - 2017
  • [30] Improving the performance of GIS polygon overlay computation with MapReduce for spatial big data processing
    Yong Wang
    Zhenling Liu
    Hongyan Liao
    Chengjun Li
    [J]. Cluster Computing, 2015, 18 : 507 - 516