Deep Learning and Regularization Algorithms for Malicious Code Classification

被引:4
|
作者
Wang, Haojun [1 ]
Long, Haixia [1 ]
Wang, Ailan [2 ]
Liu, Tianyue [1 ]
Fu, Haiyan [1 ]
机构
[1] Hainan Normal Univ, Sch Informat Sci & Technol, Haikou 571158, Hainan, Peoples R China
[2] Geneis Beijing Co Ltd, Beijing 100102, Peoples R China
基金
中国国家自然科学基金; 海南省自然科学基金;
关键词
Malware; Feature extraction; Deep learning; Convolutional neural networks; Classification algorithms; Machine learning algorithms; Support vector machines; Malicious code classification; deep learning; convolutional neural networks; N-gram; regularization algorithm; MALWARE DETECTION; FEATURE-EXTRACTION; N-GRAM; FRAMEWORK;
D O I
10.1109/ACCESS.2021.3090464
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network security has become a growing concern within the popularity and development of the Internet. Malicious code is one of the main threats to network security. Different types of malicious code have different functions and cause different harms. Therefore, improving the detection efficiency and recognition accuracy of malicious code is becoming an urgent problem to be solved. While traditional machine learning methods for malicious code detection largely depend on hand-designed features with experts' knowledge of the domain or focus on the images which come from malicious code binary files. These methods spend too much time on feature extraction. With the emergence of a large amount of malicious code data, the efficiency of traditional machine learning algorithms is getting worse and worse. In this paper, a workflow based on deep learning is proposed to detect and classify malicious codes. This workflow adopts a convolutional neural network (CNN) and the regularization algorithms to classify malicious code with N_gram semantic feature as input of the model. The convolutional neural network can automatically extract the features of malicious code while avoiding the need for manual feature selection. Regularization algorithms not only speed up the training process of the deep model but also improve the generalization ability in the case of effective prevention of over-fitting of the model. The proposed method is compared with the state-of-the-art methods and other deep learning models. Experimental results show that our workflow can improve the accuracy and efficiency of malicious code classification.
引用
收藏
页码:91512 / 91523
页数:12
相关论文
共 50 条
  • [1] Malicious Code Classification Method Based on Deep Forest
    Lu, Xi-Dong
    Duan, Zhe-Min
    Qian, Ye-Kui
    Zhou, Wei
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1454 - 1464
  • [2] Detection of Malicious Code Variants Based on Deep Learning
    Cui, Zhihua
    Xue, Fei
    Cai, Xingjuan
    Cao, Yang
    Wang, Gai-ge
    Chen, Jinjun
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3187 - 3196
  • [3] Malicious Classification Based on Deep Learning and Visualization
    Wang Jun-ling
    Wang Shuo-hao
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 223 - 228
  • [4] Android malicious code Classification using Deep Belief Network
    Luo Shiqi
    Tian Shengwei
    Yu Long
    Yu Jiong
    Sun Hua
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (01): : 454 - 475
  • [5] Malicious Website Detection Through Deep Learning Algorithms
    Gutierrez, Norma
    Otero, Beatriz
    Rodriguez, Eva
    Canal, Ramon
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT I, 2022, 13163 : 512 - 526
  • [6] Improved Malicious Code Classification Considering Sequence by Machine Learning
    Paik, Incheon
    [J]. 18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
  • [7] Malicious code clone detection technology based on deep learning
    Shen, Yuan
    Yan, Hanbing
    Xia, Chunhe
    Han, Zhihui
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (02): : 282 - 290
  • [8] A deep learning approach for detecting malicious Java']JavaScript code
    Wang, Yao
    Cai, Wan-dong
    Wei, Peng-cheng
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (11) : 1520 - 1534
  • [9] A Hybrid Malicious Code Detection Method based on Deep Learning
    Li, Yuancheng
    Ma, Rong
    Jiao, Runhai
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (05): : 205 - 215
  • [10] Detection Approach of Malicious JavaScript Code Based on deep learning
    Zheng, Liyuan
    Zhang, Dongcheng
    Xie, Xin
    Wang, Chen
    Hou, Boyuan
    [J]. Proceedings of 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence, ICIBA 2023, 2023, : 1075 - 1079