A Hadoop-Based Packet Trace Processing Tool

被引:0
|
作者
Lee, Yeonhee [1 ]
Kang, Wonchul [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Taejon 305764, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Internet traffic measurement and analysis has become a significantly challenging job because large packet trace files captured on fast links could not be easily handled on a single server with limited computing and memory resources. Hadoop is a popular open-source cloud computing platform that provides a software programming framework called MapReduce and the distributed filesystem, HDFS, which are useful for analyzing a large data set. Therefore, in this paper, we present a Hadoop-based packet processing tool that provides scalability for a large data set by harnessing Map Reduce and HDFS. To tackle large packet trace files in Hadoop efficiently, we devised a new binary input format, called PcapInputFormat, hiding the complexity of processing binary-formatted packet data and parsing each packet record. We also designed efficient traffic analysis MapReduce job models consisting of map and reduce functions. To evaluate our tool, we compared its computation time with a well-known packet-processing tool, CoralReef, and showed that our approach is more affordable to process a large set of packet data.
引用
收藏
页码:51 / 63
页数:13
相关论文
共 50 条
  • [1] Data processing of hadoop-based wide area measurement system
    Zhu, L. (Zhulizb@sina.com), 1600, Automation of Electric Power Systems Press (37):
  • [2] Performance Analysis of Hadoop-Based SQL and NoSQL for Processing Log Data
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    Won, Hee-Sun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 293 - 299
  • [3] Hadoop-based Distributed Computing Algorithms for Healthcare and Clinic Data Processing
    Ni, Jun
    Chen, Ying
    Sha, Jie
    Zhang, Minghuan
    2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 188 - 193
  • [4] Hadoop-based Genome Comparisons
    Heinzlreiter, Paul
    Krieger, Michael T.
    Leitner, Iris
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 695 - 701
  • [5] A HADOOP-BASED DATA PROCESSING PLATFORM FOR FRESH AGRO-PRODUCTS TRACEABILITY
    Xu, Mark
    Siraj, Sajid
    Qi, Lin
    PROCEEDINGS OF THE EUROPEAN CONFERENCE ON DATA MINING 2015 AND INTERNATIONAL CONFERENCES ON INTELLIGENT SYSTEMS AND AGENTS 2015 AND THEORY AND PRACTICE IN MODERN COMPUTING 2015, 2015, : 37 - 44
  • [6] A Hadoop-based Molecular Docking System
    Dong, Yueli
    Guo, Quan
    Sun, Bin
    2017 INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGY AND COMMUNICATION ENGINEERING (CTCE2017), 2017, 910
  • [7] Investigation on Hadoop-based distributed search engine
    Chen, Ning
    Xiangyang, Chai
    Journal of Software Engineering, 2014, 8 (03): : 127 - 131
  • [8] Hadoop-based Implementation of Processing Medical Diagnostic Records for Visual Patient System
    Yang, Yuanyuan
    Shi, Liehang
    Xie, Zhe
    Zhang, Jianguo
    MEDICAL IMAGING 2018: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2018, 10579
  • [9] Hadoop-based Model of Mass Data Storage
    Yang, Li
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 632 - 634
  • [10] A Hadoop-Based Online Teaching Model of "VisibleBody"
    Deng, Haiyan
    Li, Chunyan
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (11) : 46 - 58