Toward Scalable Internet Traffic Measurement and Analysis with Hadoop

被引:1
|
作者
Lee, Yeonhee [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Engn, Daejon, South Korea
关键词
Hadoop; Hive; MapReduce; NetFlow; pcap; packet; traffic measurement; analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet traffic measurement and analysis has long been used to characterize network usage and user behaviors, but faces the problem of scalability under the explosive growth of Internet traffic and high-speed access. Scalable Internet traffic measurement and analysis is difficult because a large data set requires matching computing and storage resources. Hadoop, an open-source computing platform of MapReduce and a distributed file system, has become a popular infrastructure for massive data analytics because it facilitates scalable data processing and storage services on a distributed computing system consisting of commodity hardware. In this paper, we present a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner. From experiments with a 200-node testbed, we achieved 14 Gbps throughput for 5 TB files with IP and HTTP-layer analysis MapReduce jobs. We also explain the performance issues related with traffic analysis MapReduce jobs.
引用
收藏
页码:6 / 13
页数:8
相关论文
共 50 条
  • [41] Toward dynamic phenotypes and the scalable measurement of human behavior
    Germine, Laura
    Strong, Roger W.
    Singh, Shifali
    Sliwinski, Martin J.
    NEUROPSYCHOPHARMACOLOGY, 2021, 46 (01) : 209 - 216
  • [42] Internet traffic measurement and analysis in a high speed network environment: Workload and flow characteristics
    Park, JS
    Lee, JY
    Lee, SB
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2000, 2 (03) : 287 - 296
  • [43] A Scalable Approach to Tomography-based Internet Measurement System
    Tagami, Atsushi
    Hasegawa, Teruyuki
    Ano, Shigehiro
    Hasegawa, Toru
    2006 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-12, 2006, : 489 - 494
  • [44] FairNet: A Measurement Framework for Traffic Discrimination Detection on the Internet
    Khandkar, Vinod S.
    Hanawal, Manjesh K.
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (04): : 4097 - 4109
  • [45] On the Duration and Spatial Characteristics of Internet Traffic Measurement Experiments
    Garcia-Dorado, Jose Luiis
    Hernandez, Jose Alberto
    Aracil, Javier
    de Vergara, Jorge E. Lopez
    Monserrat, Francisco J.
    Robles, Esther
    de Miguel, Tomas P.
    IEEE COMMUNICATIONS MAGAZINE, 2008, 46 (11) : 148 - 155
  • [46] Traffic measurement mechanisms for high precision Internet applications
    Wang, Zhenqi
    Liu, Jin
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 1, PROCEEDINGS, 2007, : 66 - +
  • [47] Internet Traffic Characterization based on Active Network Measurement
    Liu, Jun
    Chen, Bochuan
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [48] An Adaptive Sampling Methodology for Internet Traffic Data Measurement
    Zeng, Bin
    Zhang, Dafang
    Li, Wenwei
    Zhang, Mei
    Hong, Qiao
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS, 2009, : 215 - +
  • [49] Internet traffic tends toward Poisson and independent as the load increases
    Cao, J
    Cleveland, WS
    Lin, D
    Sun, DX
    NONLINEAR ESTIMATION AND CLASSIFICATION, 2003, 171 : 83 - 109
  • [50] Feature Selection Toward Optimizing Internet Traffic Behavior Identification
    Chen, Zhenxiang
    Peng, Lizhi
    Zhao, Shupeng
    Zhang, Lei
    Jing, Shan
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 631 - 644