Utilizing statistical characteristics of N-grams for intrusion detection

被引:4
|
作者
Li, ZW [1 ]
Das, A [1 ]
Nandi, S [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
D O I
10.1109/CYBER.2003.1253494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information and infrastructure security is a serious issue of global concern. As the last line of defense for security infrastructure, intrusion detection techniques are paid more and more attention. In this paper, one anomaly-based intrusion detection technique (ScanAID: Statistical ChAracteristics of N-grams for Anomaly-based Intrusion Detection) is proposed to detect intrusive behaviors in a computer system. The statistical properties in sequences of system calls are abstracted to model the normal behaviors of a privileged process, in which the model is characterized by a vector of anomaly values of N-grams. With a reasonable definition of efficiency, parameter the length of an N-gram and the size of the training dataset are optimized to get an efficient and compact model. Then, with the optimal modeling parameters, the flexibility and efficiency of the model are evaluated by the ROC curves. Our experimental results show that the proposed statistical anomaly detection technique is promising and deserves further research (such as applying it to network environments).
引用
收藏
页码:486 / 493
页数:8
相关论文
共 50 条
  • [1] Automatic statistical translation based on n-grams
    Oliver, Antonio
    Badia, Toni
    Boleda, Gemma
    Melero, Maite
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 77 - 84
  • [2] Layered higher order n-grams for hardening payload based anomaly intrusion detection
    Hubballi, Neminath
    Biswas, Santosh
    Nandi, Sukumar
    [J]. FIFTH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY: ARES 2010, PROCEEDINGS, 2010, : 321 - 326
  • [3] Detection of Opinion Spam with Character n-grams
    Hernandez Fusilier, Donato
    Montes-y-Gomez, Manuel
    Rosso, Paolo
    Guzman Cabrera, Rafael
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 285 - 294
  • [4] Statistical Analysis of the Indus Script Using n-Grams
    Yadav, Nisha
    Joglekar, Hrishikesh
    Rao, Rajesh P. N.
    Vahia, Mayank N.
    Adhikari, Ronojoy
    Mahadevan, Iravatham
    [J]. PLOS ONE, 2010, 5 (03):
  • [5] Plagiarism Detection Using Stopword n-grams
    Stamatatos, Efstathios
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (12): : 2512 - 2527
  • [6] Spam detection using character N-grams
    Kanaris, Ioannis
    Kanaris, Konstantinos
    Stamatatos, Efstathios
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 95 - 104
  • [7] n-Grams exclusion and inclusion filter for intrusion detection in Internet of Energy big data systems
    Aldwairi, Monther
    Alansari, Duaa
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (03):
  • [8] The Distribution of N-Grams
    Leo Egghe
    [J]. Scientometrics, 2000, 47 : 237 - 252
  • [9] The distribution of N-grams
    Egghe, L
    [J]. SCIENTOMETRICS, 2000, 47 (02) : 237 - 252
  • [10] Collocations and N-grams
    FREEBURY-JONES, D. A. R. R. E. N.
    [J]. RENAISSANCE AND REFORMATION, 2021, 44 (04) : 210 - 216