An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms

被引:3
|
作者
Dabas, Namita [1 ]
Ahlawat, Prachi [1 ]
Sharma, Prabha [1 ]
机构
[1] NorthCap Univ, Sch Engn & Technol, Gurugram, India
关键词
Malware detection; API calls; API sequences; Frequent patterns; Feature selection; Machine learning; BEHAVIORAL-ANALYSIS; CLASSIFICATION; FUSION; CALLS;
D O I
10.1007/s13369-022-07309-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the advent of internet-based technology, there has been a surge in internet-enabled devices. These devices generate massive volumes of meaningful information to accomplish several tasks. Conversely, cyber-criminals leverage this information to perform cyber-attacks. Malware is one of the most prevalent attacks in the cyber threat landscape to fulfil malicious intents of cyber-criminals. Thus, it becomes imperative to detect and prevent these malware attacks precisely to minimize the damage. A number of researchers have proved that API calls can comprehend malware behaviour accurately and can be utilized with machine learning algorithms to effectively detect malware. Therefore, this paper proposes a novel malware detection method for Windows platform based on API calls, feature selection, and machine learning algorithms. It extracts API calls information in three forms: API calls usage, API calls frequency, and API calls sequences to create three feature sets. These feature sets are enriched using TF-IDF technique and combined to create a more extensive and robust feature set, API integrated feature set. A series of experiments were conducted and results showed that API integrated feature set outperformed other feature sets by attaining 99.6% and higher accuracy for all machine learning algorithms. To address the high-dimensionality concern of API integrated feature set, this work applied several feature selection techniques and results showed that we are able to achieve 99.6-99.9% accuracy with only 9% features of API integrated feature set using hybrid feature selection and machine learning algorithms.
引用
收藏
页码:9749 / 9767
页数:19
相关论文
共 50 条
  • [1] An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms
    Namita Dabas
    Prachi Ahlawat
    Prabha Sharma
    [J]. Arabian Journal for Science and Engineering, 2023, 48 : 9749 - 9767
  • [2] An Exploratory Analysis of Feature Selection for Malware Detection with Simple Machine Learning Algorithms
    Rahman, Md Ashikur
    Islam, Syful
    Nugroho, Yusuf Sulistyo
    Al Irsyadi, Fatah Yasin
    Hossain, Md Javed
    [J]. JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2023, 19 (03) : 207 - 219
  • [3] Improving Machine Learning Models for Malware Detection Using Embedded Feature Selection Method
    Chemmakha, Mohammed
    Habibi, Omar
    Lazaar, Mohamed
    [J]. IFAC PAPERSONLINE, 2022, 55 (12): : 771 - 776
  • [4] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    [J]. JURNAL TEKNOLOGI, 2015, 77 (01):
  • [5] Malware Analysis and Detection Using Machine Learning Algorithms
    Akhtar, Muhammad Shoaib
    Feng, Tao
    [J]. SYMMETRY-BASEL, 2022, 14 (11):
  • [6] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    [J]. MATHEMATICS, 2021, 9 (21)
  • [7] A New Feature Selection Method Based on Dragonfly Algorithm for Android Malware Detection Using Machine Learning Techniques
    Guendouz, Mohamed
    Amine, Abdelmalek
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2023, 17 (01)
  • [8] Malware Detection Method using Tree-based Machine Learning Algorithms
    Okada, Satoshi
    Matsuda, Wataru
    Fujimoto, Mariko
    Mitsunaga, Takuho
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 103 - 108
  • [9] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    [J]. Multimedia Tools and Applications, 2023, 82 : 9517 - 9531
  • [10] Android malware detection applying feature selection techniques and machine learning
    Keyvanpour, Mohammad Reza
    Shirzad, Mehrnoush Barani
    Heydarian, Farideh
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9517 - 9531