Improving Network Traffic in MapReduce for Big Data Applications

被引:0
|
作者
Gawande, Priya [1 ]
Shaikh, Nuzhaft [1 ]
机构
[1] MES Coll Engn, Dept Comp Engn, Pune, Maharashtra, India
关键词
Aggregator; distributed system; locality; scheduling;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Improving the performance of network traffic in shuffle phase is important to improve the performance of MapReduce. The goal of enhancement of network traffic is achieved by using partition and aggregation. According to traditional method a hash function is used to partition intermediate data among reduce tasks but the traditional function is not efficient to handle network traffic. A novel intermediate data partition scheme is designed to reduce network traffic cost in MapReduce. The aggregator placement problem is considered, where each aggregator can reduce merged traffic from multiple map tasks. A decomposition-based distributed algorithm is proposed to deal with the large-scale optimization problem for big data applications. Also an online algorithm is designed to adjust data partition and aggregation in a dynamic manner. Network traffic cost under both offline and online cases is significantly reduced as demonstrated by the stimulation results by the various proposal considered and used.
引用
收藏
页码:2979 / 2983
页数:5
相关论文
共 50 条
  • [1] LARGE SCALE OPTIMIZATION TO MINIMIZE NETWORK TRAFFIC USING MAPREDUCE IN BIG DATA APPLICATIONS
    Neelakandan, S.
    Divyabharathi, S.
    Rahini, S.
    Vijayalakshmi, G.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY INFORMATION AND COMMUNICATION (ICCPEIC), 2016, : 193 - 199
  • [2] On Traffic-Aware Partition and Aggregation in MapReduce for Big Data Applications
    Ke, Huan
    Li, Peng
    Guo, Song
    Guo, Minyi
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 818 - 828
  • [3] TRAFFIC FLOW PROPHECY WITH MAPREDUCE JOB FOR BIG DATA DRIVEN
    Abirami, U.
    Sridevi, S.
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 13 - 18
  • [4] Towards MapReduce based Bayesian Deep Learning Network for Monitoring Big Data Applications
    Shafiq, M. Omair
    Torunski, Eric
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2112 - 2121
  • [5] Investigation and Characterization of MapReduce Applications for Big Data Analytics
    Li, Y.
    Lam, T. B. V.
    Do, T. V. Van
    Chakka, R.
    Rotter, C.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2018, 77 (09): : 493 - 498
  • [6] The Scalability of Volunteer Computing for MapReduce Big Data Applications
    Li, Wei
    Guo, William
    [J]. DATA SCIENCE, PT 1, 2017, 727 : 153 - 165
  • [7] Big network traffic data visualization
    Ruan, Zichan
    Miao, Yuantian
    Pan, Lei
    Xiang, Yang
    Zhang, Jun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 11459 - 11487
  • [8] Big network traffic data visualization
    Zichan Ruan
    Yuantian Miao
    Lei Pan
    Yang Xiang
    Jun Zhang
    [J]. Multimedia Tools and Applications, 2018, 77 : 11459 - 11487
  • [9] MapReduce based Classification for Fault Detection in Big Data Applications
    Shafiq, M. Omair
    Fekri, Maryam
    Ibrahim, Rami
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 637 - 642
  • [10] Learning Markov Blanket Bayesian Network for Big Data in MapReduce
    Che, Yuxin
    Hong, Shaohui
    Zhang, Defu
    Zhang, Liming
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 896 - 900