Automatic malware classification and new malware detection using machine learning

被引:63
|
作者
Liu, Liu [1 ]
Wang, Bao-sheng [1 ]
Yu, Bo [1 ]
Zhong, Qiu-xi [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Malware classification; Machine learning; n-gram; Gray-scale image; Feature extraction; Malware detection;
D O I
10.1631/FITEE.1601325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The explosive growth of malware variants poses a major threat to information security. Traditional anti-virus systems based on signatures fail to classify unknown malware into their corresponding families and to detect new kinds of malware programs. Therefore, we propose a machine learning based malware analysis system, which is composed of three modules: data processing, decision making, and new malware detection. The data processing module deals with gray-scale images, Opcode n-gram, and import functions, which are employed to extract the features of the malware. The decision-making module uses the features to classify the malware and to identify suspicious malware. Finally, the detection module uses the shared nearest neighbor (SNN) clustering algorithm to discover new malware families. Our approach is evaluated on more than 20 000 malware instances, which were collected by Kingsoft, ESET NOD32, and Anubis. The results show that our system can effectively classify the unknown malware with a best accuracy of 98.9%, and successfully detects 86.7% of the new malware.
引用
收藏
页码:1336 / 1347
页数:12
相关论文
共 50 条
  • [11] Android Malware Detection Using Machine Learning
    Droos, Ayat
    Al-Mahadeen, Awss
    Al-Harasis, Tasnim
    Al-Attar, Rama
    Ababneh, Mohammad
    [J]. 2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 36 - 41
  • [12] Malware Classification Using Probability Scoring and Machine Learning
    Xue, Di
    Li, Jingmei
    Lv, Tu
    Wu, Weifei
    Wang, Jiaxiang
    [J]. IEEE ACCESS, 2019, 7 : 91641 - 91656
  • [13] Macro Malware Detection using Machine Learning Techniques A New Approach
    De los Santos, Sergio
    Torres, Jose
    [J]. ICISSP: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2017, : 295 - 302
  • [14] A Comparative Analysis of Machine Learning Techniques for Classification and Detection of Malware
    Al-Janabi, Maryam
    Altamimi, Ahmad Mousa
    [J]. 2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [15] Android Malware Detection Using Machine Learning Technique
    Sabri, Nor ‘Afifah
    Khamis, Shakiroh
    Zainudin, Zanariah
    [J]. Lecture Notes on Data Engineering and Communications Technologies, 2024, 211 : 153 - 164
  • [16] Detection of Malware in the Network Using Machine Learning Techniques
    Yogesh, B.
    Reddy, G.Suresh
    [J]. Proceedings - 2022 International Conference on Recent Trends in Microelectronics, Automation, Computing and Communications Systems, ICMACC 2022, 2022, : 204 - 211
  • [17] PDF Malware Detection Using Visualization and Machine Learning
    Liu, Ching-Yuan
    Chiu, Min-Yi
    Huang, Qi-Xian
    Sun, Hung-Min
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXXV, 2021, 12840 : 209 - 220
  • [18] Malware Analysis and Detection Using Machine Learning Algorithms
    Akhtar, Muhammad Shoaib
    Feng, Tao
    [J]. SYMMETRY-BASEL, 2022, 14 (11):
  • [19] Malware Detection using Malware Image and Deep Learning
    Choi, Sunoh
    Jang, Sungwook
    Kim, Youngsoo
    Kim, Jonghyun
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1193 - 1195
  • [20] Android Malware Detection Using Machine Learning: A Review
    Chowdhury, Naseef-Ur-Rahman
    Haque, Ahshanul
    Soliman, Hamdy
    Hossen, Mohammad Sahinur
    Fatima, Tanjim
    Ahmed, Imtiaz
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023, 2024, 824 : 507 - 522