A Novel Support Vector Machine Approach to High Entropy Data Fragment Classification

被引:0
|
作者
Li, Q. [1 ]
Ong, A. [2 ]
Suganthan, P. [2 ]
Thing, V. [1 ]
机构
[1] Inst Infocomm Res, Cryptog & Secur Dept, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Elect Elect Engn, Singapore, Singapore
关键词
Data classification; support vector machine; digital forensics;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A major challenge in digital forensics is the efficient and accurate file type classification of a fragment of evidence data, in the absence of header and file system information. A typical approach to this problem is to classify the fragment based on simple statistics, such as the entropy and the statistical distance of byte histograms. This approach is ineffective when dealing with high entropy data, such as multimedia and compressed files, all of which often appear to be random. We propose a method incorporating a support vector machine (SVM). In particular, we extract feature vectors from the byte frequencies of a given fragment, and use an SVM to predict the type of the fragment under supervised learning. Our method is efficient and achieves high accuracy for high entropy data fragments.
引用
收藏
页码:236 / 247
页数:12
相关论文
共 50 条
  • [1] Data Classification with Support Vector Machine and Generalized Support Vector Machine
    Qi, Xiaomin
    Silvestrov, Sergei
    Nazir, Talat
    ICNPAA 2016 WORLD CONGRESS: 11TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES, 2017, 1798
  • [2] Support vector machine classification trees based on fuzzy entropy of classification
    Harrington, Peter de Boves
    ANALYTICA CHIMICA ACTA, 2017, 954 : 14 - 21
  • [3] Support vector machine approach for fast classification
    Kianmehr, Keivan
    Alhajj, Reda
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 534 - 543
  • [4] Weighted support vector machine for data classification
    Yang, XL
    Song, Q
    Cao, AZ
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 859 - 864
  • [5] Support vector machine for functional data classification
    Rossi, F
    Villa, N
    NEUROCOMPUTING, 2006, 69 (7-9) : 730 - 742
  • [6] A weighted support vector machine for data classification
    Yang, Xulei
    Song, Qing
    Wang, Yue
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 961 - 976
  • [7] A Novel Approach for the Brain tumor detection and Classification using Support Vector Machine
    Shankaragowda, B. B.
    Siddappa, M.
    Suresha, M.
    PROCEEDINGS OF THE 2017 3RD INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2017, : 90 - 93
  • [8] A novel twin-support vector machine for binary classification to imbalanced data
    Li, Jingyi
    Chao, Shiwei
    DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (03) : 385 - 396
  • [9] A novel support vector machine with generalized pinball loss for uncertain data classification
    Damminsed, Vipavee
    Panup, Wanida
    Makmuang, Dawrawee
    Suppalap, Siwakon
    Wangkeeree, Rabian
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2023, 46 (18) : 18729 - 18748
  • [10] A novel robust twin support vector machine for classification
    Cheng, Haoxiang
    Wang, Jian
    Journal of Computational Information Systems, 2015, 11 (12): : 4421 - 4427