A Hadoop-Based Packet Trace Processing Tool

被引:0
|
作者
Lee, Yeonhee [1 ]
Kang, Wonchul [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Taejon 305764, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Internet traffic measurement and analysis has become a significantly challenging job because large packet trace files captured on fast links could not be easily handled on a single server with limited computing and memory resources. Hadoop is a popular open-source cloud computing platform that provides a software programming framework called MapReduce and the distributed filesystem, HDFS, which are useful for analyzing a large data set. Therefore, in this paper, we present a Hadoop-based packet processing tool that provides scalability for a large data set by harnessing Map Reduce and HDFS. To tackle large packet trace files in Hadoop efficiently, we devised a new binary input format, called PcapInputFormat, hiding the complexity of processing binary-formatted packet data and parsing each packet record. We also designed efficient traffic analysis MapReduce job models consisting of map and reduce functions. To evaluate our tool, we compared its computation time with a well-known packet-processing tool, CoralReef, and showed that our approach is more affordable to process a large set of packet data.
引用
收藏
页码:51 / 63
页数:13
相关论文
共 50 条
  • [21] Hadoop-Based Big Data Distributions: A Comparative Study
    Hamdaoui, Ikram
    El Fissaoui, Mohamed
    El Makkaoui, Khalid
    El Allali, Zakaria
    EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 242 - 252
  • [22] HADOOP-BASED NETWORK TRAFFIC ANOMALY DETECTION IN BACKBONE
    Yu, Jishen
    Liu, Feng
    Zhou, Wenli
    Yu, Hua
    2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS), 2014, : 140 - 145
  • [23] Design of Effective Indexing Technique in Hadoop-Based Database
    Shim, Jae-Sung
    Jang, Young-Hwan
    Ju, Yong-Wan
    Park, Seok-Cheon
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 90 - 95
  • [24] Hadoop-Based Distributed Sensor Node Management System
    Jung, In-Yong
    Kim, Ki-Hyun
    Han, Byong-John
    Jeong, Chang-Sung
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [25] Design of Hadoop-based Framework for Analytics of Large Synchrophasor Datasets
    Edwards, Matthew
    Rambani, Aseem
    Zhu, Yifeng
    Musavi, Mohamad
    COMPLEX ADAPTIVE SYSTEMS 2012, 2012, 12 : 254 - 258
  • [26] Hadoop-based replica exchange over heterogeneous distributed cyberinfrastructures
    Platania, Richard
    Shams, Shayan
    Chiu, Chui-Hui
    Kim, Nayong
    Kim, Joohyun
    Park, Seung-Jong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (04):
  • [27] Hadoop-based ARIMA algorithm and its application in weather forecast
    Li, Leixiao
    Ma, Zhiqiang
    Liu, Limin
    Fan, Yuhong
    Li, L. (llxhappy@126.com), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06): : 119 - 132
  • [28] A Hadoop-Based Method to Predict Potential Effective Drug Combination
    Sun, Yifan
    Xiong, Yi
    Xu, Qian
    Wei, Dongqing
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [29] Hadoop-based System Design for Website Intrusion Detection and Analysis
    Zhang, Xiaoming
    Wang, Guang
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 1171 - 1174
  • [30] Development and Application of Personal Hadoop-Based Big Data Platform
    Wu G.
    Lin F.
    Chang W.-Y.
    Tsai W.-F.
    Lin S.-C.
    Yang C.-T.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2018, 30 (02): : 107 - 120