Applying Random Projection to the Classification of Malicious Applications using Data Mining Algorithms

被引:0
|
作者
Durand, Jan [1 ]
Atkison, Travis [1 ]
机构
[1] Louisiana Tech Univ, Dept Comp Sci, Ruston, LA 71270 USA
关键词
Malicious software detection; information retrieval; n-gram analysis; random projection; data mining;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This research is part of a continuing effort to show the viability of using random projection as a feature extraction and reduction technique in the classification of malware to produce more accurate classifiers. In this paper, we use a vector space model with n-gram analysis to produce weighted feature vectors from binary executables, which we then reduce to a smaller feature set using the random projection method proposed by Achlioptas, and the feature selection method of mutual information to produce two separate data sets. We then apply several popular machine learning algorithms including J48 decision tree, naive Bayes, support vector machines, and an instance-based learner to the data sets to produce classifiers for the detection of malicious executables. We evaluate the performance of the different classifiers and discover that using a data set reduced by random projection can improve the performance of support vector machine and instance-based learner classifiers.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Diagnosis of Diabetes by Applying Data Mining Classification Techniques Comparison of Three Data Mining Algorithms
    Daghistani, Tahani
    Alshammari, Riyad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 329 - 332
  • [2] Data mining classification algorithms
    Saouabi, Mohamed
    Ezzati, Abdellah
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2020, 15 (01): : 389 - 394
  • [3] Sequential projection pursuit using genetic algorithms for data mining of analytical data
    Guo, Q
    Wu, W
    Questier, F
    Massart, DL
    Boucon, C
    de Jong, S
    [J]. ANALYTICAL CHEMISTRY, 2000, 72 (13) : 2846 - 2855
  • [4] Applying Combinatorial Testing to Data Mining Algorithms
    Chandrasekaran, Jaganmohan
    Feng, Huadong
    Lei, Yu
    Kuhn, D. Richard
    Kacker, Raghu
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS - ICSTW 2017, 2017, : 253 - 261
  • [5] Image Classification using Hybrid Data Mining Algorithms - A Review
    Thamilselvan, P.
    Sathiaseelan, J. G. R.
    [J]. 2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [6] Review of classification algorithms for data mining
    Liu, Hongyan
    Chen, Jian
    Chen, Guoqing
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2002, 42 (06): : 727 - 730
  • [7] A Review of Classification Algorithms for Data Mining
    Li Mindong
    Chen Qingwei
    Huang Panling
    Zhou Jun
    Gong Weike
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 364 - 370
  • [8] Data mining classification algorithms: An overview
    Bardab, Saeed Ngmaldin
    Ahmed, Tarig Mohamed
    Mohammed, Tarig Abdalkarim Abdalfadil
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2021, 8 (02): : 1 - 5
  • [9] DATA MINING CLASSIFICATION ALGORITHMS: A SURVEY
    Mohamed, Saouabi
    Abdellah, Ezzati
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2021, 15 (01): : 45 - 50
  • [10] A Survey on Data Mining Classification Algorithms
    Umadevi, S.
    Marseline, K. S. Jeen
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 264 - 268