Toward Scalable Internet Traffic Measurement and Analysis with Hadoop

被引:1
|
作者
Lee, Yeonhee [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Engn, Daejon, South Korea
关键词
Hadoop; Hive; MapReduce; NetFlow; pcap; packet; traffic measurement; analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet traffic measurement and analysis has long been used to characterize network usage and user behaviors, but faces the problem of scalability under the explosive growth of Internet traffic and high-speed access. Scalable Internet traffic measurement and analysis is difficult because a large data set requires matching computing and storage resources. Hadoop, an open-source computing platform of MapReduce and a distributed file system, has become a popular infrastructure for massive data analytics because it facilitates scalable data processing and storage services on a distributed computing system consisting of commodity hardware. In this paper, we present a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner. From experiments with a 200-node testbed, we achieved 14 Gbps throughput for 5 TB files with IP and HTTP-layer analysis MapReduce jobs. We also explain the performance issues related with traffic analysis MapReduce jobs.
引用
收藏
页码:6 / 13
页数:8
相关论文
共 50 条
  • [1] A study on improvement of Internet Traffic Measurement and Analysis Using Hadoop System
    Ibrahim, Lena T.
    Hassan, Rosilah
    Ahmad, Kamsuriah
    Asat, Asrul Nizam
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 462 - 466
  • [2] Hobbits: Hadoop and Hive Based Internet Traffic Analysis
    Hendawi, Abdellawab M.
    Allah, Fatemah
    Wang, Xiaoyu
    Guan, Yunfei
    Zhou, Tianshu
    Hu, Xiao
    Basit, Nada
    Stankovic, John A.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2590 - 2599
  • [3] Toward an efficient and scalable feature selection approach for internet traffic classification
    Fahad, Adil
    Tari, Zahir
    Khalil, Ibrahim
    Habib, Ibrahim
    Alnuweiri, Hussein
    COMPUTER NETWORKS, 2013, 57 (09) : 2040 - 2057
  • [4] Internet traffic measurement
    Williamson, C
    IEEE INTERNET COMPUTING, 2001, 5 (06) : 70 - 74
  • [5] Asymmetric characteristics of Internet based on traffic measurement and analysis
    Katsuno, S
    Sugauchi, K
    Tsunehiro, O
    Yamazaki, K
    Yoshida, K
    Esaki, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (11) : 2300 - 2309
  • [6] The Internet is For Porn: Measurement and Analysis of Online Adult Traffic
    Ahmed, Faraz
    Shafiq, M. Zubair
    Liu, Alex X.
    PROCEEDINGS 2016 IEEE 36TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2016, 2016, : 88 - 97
  • [7] Traffic Measurement and Analysis of a Broadband Wireless Internet Access
    Pries, Rastin
    Wamser, Florian
    Staehle, Dirk
    Heck, Klaus
    Tran-Gia, Phuoc
    2009 IEEE VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-5, 2009, : 2998 - +
  • [8] Internet traffic measurement: A critical study of wavelet analysis
    Benetazzo, Luigino
    Narduzzi, Claudio
    Pegoraro, Paolo Attilio
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2007, 56 (03) : 800 - 806
  • [9] Toward Reliable and Scalable Internet of Vehicles: Performance Analysis and Resource Management
    Ni, Yuanzhi
    Cai, Lin
    He, Jianping
    Vinel, Alexey
    Li, Yue
    Mosavat-Jahromi, Hamed
    Pan, Jianping
    PROCEEDINGS OF THE IEEE, 2020, 108 (02) : 324 - 340
  • [10] Flow Identification and Characteristics Mining from Internet Traffic with Hadoop
    Cai, Yuanjun
    Wu, Bin
    Zhang, Xinwei
    Luo, Min
    Su, Jinzhao
    2014 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2014,