Toward Scalable Internet Traffic Measurement and Analysis with Hadoop

被引:1
|
作者
Lee, Yeonhee [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Engn, Daejon, South Korea
关键词
Hadoop; Hive; MapReduce; NetFlow; pcap; packet; traffic measurement; analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet traffic measurement and analysis has long been used to characterize network usage and user behaviors, but faces the problem of scalability under the explosive growth of Internet traffic and high-speed access. Scalable Internet traffic measurement and analysis is difficult because a large data set requires matching computing and storage resources. Hadoop, an open-source computing platform of MapReduce and a distributed file system, has become a popular infrastructure for massive data analytics because it facilitates scalable data processing and storage services on a distributed computing system consisting of commodity hardware. In this paper, we present a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner. From experiments with a 200-node testbed, we achieved 14 Gbps throughput for 5 TB files with IP and HTTP-layer analysis MapReduce jobs. We also explain the performance issues related with traffic analysis MapReduce jobs.
引用
收藏
页码:6 / 13
页数:8
相关论文
共 50 条
  • [31] Compressive Traffic Analysis: A New Paradigm for Scalable Traffic Analysis
    Nasr, Milad
    Houmansadr, Amir
    Mazumdar, Arya
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 2053 - 2069
  • [32] Toward Composable Network Traffic Measurement
    di Pietro, Andrea
    Huici, Felipe
    Bonelli, Nicola
    Trammell, Brian
    Kastovsky, Petr
    Groleat, Tristan
    Vaton, Sandrine
    Dusi, Maurizio
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 70 - 74
  • [33] Internet traffic characterization - An analysis of traffic oscillations
    Owezarski, P
    Larrieu, N
    HIGH SPEED NETWORKS AND MULTIMEDIA COMMUNICATIONS, PROCEEDINGS, 2004, 3079 : 96 - 107
  • [34] Internet traffic measurement and characteristic analysis on output link of metro area network
    College of Computer and Communication, Hunan University, Changsha 410080, China
    不详
    不详
    Tien Tzu Hsueh Pao, 2007, 11 (2092-2097):
  • [35] Capacity dimensioning based on traffic measurement in the Internet
    Matoba, K
    Ata, S
    Murata, M
    GLOBECOM '01: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6, 2001, : 2532 - 2536
  • [36] Toward Scalable Emulation of Future Internet Applications with Simulation Symbiosis
    Liu, Jason
    Marcondes, Cesar
    Ahmed, Musa
    Rong, Rong
    2015 IEEE/ACM 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2015, : 68 - 77
  • [37] Toward Secure and Scalable Computation in Internet of Things Data Applications
    Yuan, Xu
    Yuan, Xingliang
    Li, Baochun
    Wang, Cong
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) : 3753 - 3763
  • [38] Internet Traffic Analysis at Scale
    Feldmann, Anja
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (13): : 3415 - 3415
  • [39] Analysis of Internet Traffic in Ecuador
    Ponce, David
    Tipantuna, Christian
    Espinosa, Cristian
    IEEE ACCESS, 2023, 11 : 126365 - 126385
  • [40] Toward dynamic phenotypes and the scalable measurement of human behavior
    Laura Germine
    Roger W. Strong
    Shifali Singh
    Martin J. Sliwinski
    Neuropsychopharmacology, 2021, 46 : 209 - 216