Windows malware detection based on static analysis with multiple features

被引:2
|
作者
Yousuf, Muhammad Irfan [1 ]
Anwer, Izza [2 ]
Riasat, Ayesha [3 ]
Zia, Khawaja Tahir [1 ]
Kim, Suhyun [4 ]
机构
[1] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore, Pakistan
[2] Univ Engn & Technol Lahore, Dept Transportat Engn & Management, Lahore, Pakistan
[3] Univ Engn & Technol Lahore, Dept Basic Sci & Humanities, Lahore, Pakistan
[4] Korea Inst Sci & Technol, Ctr Artificial Intelligence, Seoul, South Korea
关键词
Static malware analysis; Windows PE; Machine learning; Multiple features;
D O I
10.7717/peerj-cs.1319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Malware or malicious software is an intrusive software that infects or performs harmful activities on a computer under attack. Malware has been a threat to individuals and organizations since the dawn of computers and the research community has been struggling to develop efficient methods to detect malware. In this work, we present a static malware detection system to detect Portable Executable (PE) malware in Windows environment and classify them as benign or malware with high accuracy. First, we collect a total of 27,920 Windows PE malware samples divided into six categories and create a new dataset by extracting four types of information including the list of imported DLLs and API functions called by these samples, values of 52 attributes from PE Header and 100 attributes of PE Section. We also amalgamate this information to create two integrated feature sets. Second, we apply seven machine learning models; gradient boosting, decision tree, random forest, support vector machine, K-nearest neighbor, naive Bayes, and nearest centroid, and three ensemble learning techniques including Majority Voting, Stack Generalization, and AdaBoost to classify the malware. Third, to further improve the performance of our malware detection system, we also deploy two dimensionality reduction techniques: Information Gain and Principal Component Analysis. We perform a number of experiments to test the performance and robustness of our system on both raw and selected features and show its supremacy over previous studies. By combining machine learning, ensemble learning and dimensionality reduction techniques, we construct a static malware detection system which achieves a detection rate of 99.5% and error rate of only 0.47%.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Windows malware detection based on static analysis with multiple features
    Yousuf M.I.
    Anwer I.
    Riasat A.
    Zia K.T.
    Kim S.
    [J]. PeerJ Computer Science, 2023, 9
  • [2] Sensitivity Analysis of Static Features for Android Malware Detection
    Moghaddam, Samaneh Hosseini
    Abbaspour, Maghsood
    [J]. 2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 920 - 924
  • [3] Android malware detection based on overlapping of static features
    Nezhadkamali, Maryam
    Soltani, Somayeh
    Seno, Seyed Amin Hosseini
    [J]. PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2017, : 319 - 325
  • [4] A survey of IoT malware and detection methods based on static features
    Quoc-Dung Ngo
    Huy-Trung Nguyen
    Van-Hoang Le
    Doan-Hieu Nguyen
    [J]. ICT EXPRESS, 2020, 6 (04): : 280 - 286
  • [5] Malware Classification Using Static Analysis Based Features
    Hassen, Mehadi
    Carvalho, Marco M.
    Chan, Philip K.
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 734 - 740
  • [6] Windows malware detection system based on LSVC recommended hybrid features
    Darshan, S. L. Shiva
    Jaidhar, C. D.
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2019, 15 (02) : 127 - 146
  • [7] Windows malware detection system based on LSVC recommended hybrid features
    S. L. Shiva Darshan
    C. D. Jaidhar
    [J]. Journal of Computer Virology and Hacking Techniques, 2019, 15 : 127 - 146
  • [8] Limits of static analysis for malware detection
    Moser, Andreas
    Kruegel, Christopher
    Kirda, Engin
    [J]. TWENTY-THIRD ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 2007, : 421 - 430
  • [9] Android malware detection based on static behavior feature analysis
    Chen C.
    Liu Y.
    Shen B.
    Cheng J.-J.
    [J]. Journal of Computers (Taiwan), 2018, 29 (06) : 243 - 253
  • [10] Discovering optimal features using static analysis and a genetic search based method for Android malware detection
    Ahmad Firdaus
    Nor Badrul Anuar
    Ahmad Karim
    Mohd Faizal Ab Razak
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 712 - 736