Statistical Profiling of n-grams for Payload Based Anomaly Detection for HTTP Web Traffic

被引:0
|
作者
Pal, Rajarshi [1 ]
Chowdary, Naveen [2 ]
机构
[1] IDRBT, Hyderabad, India
[2] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad, India
关键词
SYSTEM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Anomalous HTTP traffic can be identified by analysing the content of HTTP packet as payload. n-gram analysis is a prominent technique for payload analysis. In this paper, a novel n-gram based anomaly detection method has been proposed for HTTP traffic. During the training phase, statistical profiling (the maximum, the minimum, the median and the average of number of occurrences in a packet) of n-grams for a data set of normal (not malicious) HTTP packets provides the basis for this work. In a test packet, the number of occurrences of an n-gram decides whether the n-gram is anomalous or not. Moreover, the deviation of number of occurrences of such an anomalous n-gram from the median (or the average) of number of occurrences of the n-gram in training packets is considered for estimating an anomaly score of the test packet. Consideration of this magnitude of the deviation from the statistical profile (median or average) of n-gram occurrences for a normal HTTP traffic is the highlight of the proposed method. Finally, an anomaly-to-normal ratio for the test packet determines whether it is malicious or normal. This technique yields better performance as compared to an existing n-gram based method of anomalous HTTP traffic detection.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Layered higher order n-grams for hardening payload based anomaly intrusion detection
    Hubballi, Neminath
    Biswas, Santosh
    Nandi, Sukumar
    [J]. FIFTH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY: ARES 2010, PROCEEDINGS, 2010, : 321 - 326
  • [2] Anomaly Detection for Automotive Diagnostic Applications based on N-grams
    Rumez, Marcel
    Lin, Jinghua
    FuchB, Thomas
    Kriesten, Reiner
    Sax, Eric
    [J]. 2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1423 - 1429
  • [3] Automatic statistical translation based on n-grams
    Oliver, Antonio
    Badia, Toni
    Boleda, Gemma
    Melero, Maite
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 77 - 84
  • [4] Utilizing statistical characteristics of N-grams for intrusion detection
    Li, ZW
    Das, A
    Nandi, S
    [J]. 2003 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2003, : 486 - 493
  • [5] Rangegram: A Novel Payload based Anomaly Detection Technique Against Web Traffic
    Swarnkar, Mayank
    Hubballi, Neminath
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNCATIONS SYSTEMS (ANTS), 2015,
  • [6] On Automatic Plagiarism Detection Based on n-Grams Comparison
    Barron-Cedeno, Alberto
    Rosso, Paolo
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 696 - 700
  • [7] Malware Detection and Classification Based on n-grams Attribute Similarity
    Zhang Fuyong
    Zhao Tiezhou
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 793 - 796
  • [8] Micro-signatures: The Effectiveness of Known Bad N-Grams for Network Anomaly Detection
    Harang, Richard
    Mell, Peter
    [J]. FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2016, 2017, 10128 : 36 - 47
  • [9] Effective Dimensionality Reduction of Payload-Based Anomaly Detection in TMAD Model for HTTP Payload
    Kakavand, Mohsen
    Mustapha, Norwati
    Mustapha, Aida
    Abdullah, Mohd Taufik
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (08): : 3884 - 3910
  • [10] Learning Visual N-Grams from Web Data
    Li, Ang
    Jabri, Allan
    Joulin, Armand
    van der Maaten, Laurens
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4193 - 4202