Malware Classification Using Probability Scoring and Machine Learning

被引:25
|
作者
Xue, Di [1 ]
Li, Jingmei [1 ]
Lv, Tu [1 ]
Wu, Weifei [1 ]
Wang, Jiaxiang [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Tc & Mol, Harbin 150001, Heilongjiang, Peoples R China
关键词
Grayscale image; native API call; malware; machine learning; probability scoring; static and dynamic analysis; NETWORKS;
D O I
10.1109/ACCESS.2019.2927552
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malware classification plays an important role in tracing the attack sources of computer security. However, existing static analysis methods are fast in classification, but they are inefficient in some malware using packing and obfuscation techniques; the dynamic analysis methods have better universality for packing and obfuscation, but they will cause excessive classification cost. To overcome these shortcomings, in this paper, we propose a classification system Malscore based on the probability scoring and machine learning, which sets the probability threshold to concatenate static analysis (called Phase 1) and dynamic analysis (called Phase 2). The convolutional neural networks with spatial pyramid pooling were used to analyze the grayscale images (static features) in Phase 1, and the variable n-grams and machine learning were used to analyze the native API call sequences (dynamic features) in Phase 2. Malscore combined static analysis with dynamic analysis not only accelerated the static analysis process by taking advantage of the CNN in image recognition but also appeared to be more resilient to obfuscation by the dynamic analysis. Different from other static and dynamic analysis techniques, when malware is detected, due to the fact that malware will most likely be labeled only by static analysis, we could reduce the overheads by dynamically analyzing a few malware that has less obvious features or greater confusion in static analysis. We performed experiments on 174 607 malware samples from 63 malware families. The result showed that Malscore achieved 98.82% accuracy for malware classification. Furthermore, Malscore was compared with the method of using static and dynamic analysis. The preprocessing and test time represented a reduction of 59.58% and 61.70%, respectively.
引用
收藏
页码:91641 / 91656
页数:16
相关论文
共 50 条
  • [1] Malware Classification Using Machine Learning
    Savard, Nolan
    Feinauer, David M.
    Alghazo, Jaafar M.
    Abdelhamid, Sherif E.
    [J]. SOUTHEASTCON 2024, 2024, : 843 - 847
  • [2] Automatic malware classification and new malware detection using machine learning
    Liu, Liu
    Wang, Bao-sheng
    Yu, Bo
    Zhong, Qiu-xi
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (09) : 1336 - 1347
  • [3] Automatic malware classification and new malware detection using machine learning
    Liu Liu
    Bao-sheng Wang
    Bo Yu
    Qiu-xi Zhong
    [J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1336 - 1347
  • [4] A Novel Malware Analysis for Malware Detection and Classification using Machine Learning Algorithms
    Sethi, Kamalakanta
    Chaudhary, Shankar Kumar
    Tripathy, Bata Krishan
    Bera, Padmalochan
    [J]. SIN'17: PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS, 2017, : 107 - 113
  • [5] Analysis and Classification of Android Malware using Machine Learning Algorithms
    Tarar, Neha
    Sharma, Shweta
    Krishna, C. Rama
    [J]. PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 738 - 743
  • [6] Malware Classification Approaches Using Machine Learning Techniques: A Review
    Naik, Shivarti
    Dessai, Amita
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 111 - 117
  • [7] Analysis of Malware Behavior: Type Classification using Machine Learning
    Pirscoveanu, Radu S.
    Hansen, Steven S.
    Larsen, Thor M. T.
    Stevanovic, Matija
    Pedersen, Jens Myrup
    Czech, Alexandre
    [J]. 2015 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA), 2015,
  • [8] Analysis of Malware Behavior: Type Classification using Machine Learning
    Pirscoveanu, Radu S.
    Hansen, Steven S.
    Larsen, Thor M. T.
    Stevanovic, Matija
    Pedersen, Jens Myrup
    Czech, Alexandre
    [J]. 2015 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA), 2015,
  • [9] A Novel Malware Analysis Framework for Malware Detection and Classification using Machine Learning Approach
    Sethi, Kamalakanta
    Chaudhary, Shankar Kumar
    Tripathy, Bata Krishan
    Bera, Padmalochan
    [J]. ICDCN'18: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2018,
  • [10] Machine learning aided Android malware classification
    Milosevic, Nikola
    Dehghantanha, Ali
    Choo, Kitn-Kwang Raymond
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 : 266 - 274