Deep Learning and Regularization Algorithms for Malicious Code Classification

被引：4

作者：

Wang, Haojun ^{[1
]}

Long, Haixia ^{[1
]}

Wang, Ailan ^{[2
]}

Liu, Tianyue ^{[1
]}

Fu, Haiyan ^{[1
]}

机构：

[1] Hainan Normal Univ, Sch Informat Sci & Technol, Haikou 571158, Hainan, Peoples R China

[2] Geneis Beijing Co Ltd, Beijing 100102, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金; 海南省自然科学基金;

关键词：

Malware; Feature extraction; Deep learning; Convolutional neural networks; Classification algorithms; Machine learning algorithms; Support vector machines; Malicious code classification; deep learning; convolutional neural networks; N-gram; regularization algorithm; MALWARE DETECTION; FEATURE-EXTRACTION; N-GRAM; FRAMEWORK;

D O I：

10.1109/ACCESS.2021.3090464

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Network security has become a growing concern within the popularity and development of the Internet. Malicious code is one of the main threats to network security. Different types of malicious code have different functions and cause different harms. Therefore, improving the detection efficiency and recognition accuracy of malicious code is becoming an urgent problem to be solved. While traditional machine learning methods for malicious code detection largely depend on hand-designed features with experts' knowledge of the domain or focus on the images which come from malicious code binary files. These methods spend too much time on feature extraction. With the emergence of a large amount of malicious code data, the efficiency of traditional machine learning algorithms is getting worse and worse. In this paper, a workflow based on deep learning is proposed to detect and classify malicious codes. This workflow adopts a convolutional neural network (CNN) and the regularization algorithms to classify malicious code with N_gram semantic feature as input of the model. The convolutional neural network can automatically extract the features of malicious code while avoiding the need for manual feature selection. Regularization algorithms not only speed up the training process of the deep model but also improve the generalization ability in the case of effective prevention of over-fitting of the model. The proposed method is compared with the state-of-the-art methods and other deep learning models. Experimental results show that our workflow can improve the accuracy and efficiency of malicious code classification.

引用

页码：91512 / 91523

页数：12

共 50 条

[1] Malicious Code Classification Method Based on Deep Forest
Lu, Xi-Dong
Duan, Zhe-Min
Qian, Ye-Kui
Zhou, Wei
[J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1454 - 1464
[2] Detection of Malicious Code Variants Based on Deep Learning
Cui, Zhihua
Xue, Fei
Cai, Xingjuan
Cao, Yang
Wang, Gai-ge
Chen, Jinjun
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3187 - 3196
[3] Malicious Classification Based on Deep Learning and Visualization
Wang Jun-ling
Wang Shuo-hao
[J]. 2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 223 - 228
[4] Android malicious code Classification using Deep Belief Network
Luo Shiqi
Tian Shengwei
Yu Long
Yu Jiong
Sun Hua
[J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (01): : 454 - 475
[5] Malicious Website Detection Through Deep Learning Algorithms
Gutierrez, Norma
Otero, Beatriz
Rodriguez, Eva
Canal, Ramon
[J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT I, 2022, 13163 : 512 - 526
[6] Improved Malicious Code Classification Considering Sequence by Machine Learning
Paik, Incheon
[J]. 18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
[7] Malicious code clone detection technology based on deep learning
Shen, Yuan
Yan, Hanbing
Xia, Chunhe
Han, Zhihui
[J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (02): : 282 - 290
[8] A deep learning approach for detecting malicious Java']JavaScript code
Wang, Yao
Cai, Wan-dong
Wei, Peng-cheng
[J]. SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (11) : 1520 - 1534
[9] A Hybrid Malicious Code Detection Method based on Deep Learning
Li, Yuancheng
Ma, Rong
Jiao, Runhai
[J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (05): : 205 - 215
[10] Detection Approach of Malicious JavaScript Code Based on deep learning
Zheng, Liyuan
Zhang, Dongcheng
Xie, Xin
Wang, Chen
Hou, Boyuan
[J]. Proceedings of 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence, ICIBA 2023, 2023, : 1075 - 1079

← 1 2 3 4 5 →