A Novel Solutions for Malicious Code Detection and Family Clustering Based on Machine Learning

被引:19
|
作者
Yang, Hangfeng [1 ]
Li, Shudong [1 ]
Wu, Xiaobo [2 ]
Lu, Hui [1 ]
Han, Weihong [1 ]
机构
[1] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Sch Comp Sci & Cyber Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Malware; ensemble model; malware classification; family clustering; t-SNE; KEY MANAGEMENT SCHEME; INTERNET;
D O I
10.1109/ACCESS.2019.2946482
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malware has become a major threat to cyberspace security, not only because of the increasing complexity of malware itself, but also because of the continuously created and produced malicious code. In this paper, we propose two novel methods to solve the malware identification problem. One is to solve to malware classification. Different from traditional machine learning, our method introduces the ensemble models to solve the malware classification problem. The other is to solve malware family clustering. Different from the classic malware family clustering algorithm, our method introduces the t-SNE algorithm to visualize the feature data and then determines the number of malware families. The two proposed novel methods have been extensively tested on a large number of real-world malware samples. The results show that the first one is far superior to the existed individual models and the second one has a good adaptation ability. Our methods can be used for malicious code classification and family clustering, also with higher accuracy.
引用
收藏
页码:148853 / 148860
页数:8
相关论文
共 50 条
  • [1] Research and Implementation of Kernel Malicious Code Detection Based on Machine Learning
    Tian D.-H.
    Wei H.
    Zhang B.
    Yu Y.-L.
    Li J.-S.
    Ma R.
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2020, 40 (12): : 1295 - 1301
  • [2] Malicious URL Detection based on Machine Learning
    Cho Do Xuan
    Hoa Dinh Nguyen
    Nikolaevich, Tisenko Victor
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 148 - 153
  • [3] Detection of Malicious Code Variants Based on Deep Learning
    Cui, Zhihua
    Xue, Fei
    Cai, Xingjuan
    Cao, Yang
    Wang, Gai-ge
    Chen, Jinjun
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3187 - 3196
  • [4] Applying machine learning techniques for detection of malicious code in network traffic
    Elovici, Yuval
    Shabtai, Asaf
    Moskovitch, Robert
    Tahan, Gil
    Glezer, Chanan
    [J]. KI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4667 : 44 - +
  • [5] Malicious code clone detection technology based on deep learning
    Shen Y.
    Yan H.
    Xia C.
    Han Z.
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (02): : 282 - 290
  • [6] A Hybrid Malicious Code Detection Method based on Deep Learning
    Li, Yuancheng
    Ma, Rong
    Jiao, Runhai
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (05): : 205 - 215
  • [7] A Malicious Code Detection Method Based on Ensemble Learning of Behavior
    Xu X.-B.
    Zhang W.-B.
    He C.
    Luo Y.
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (04): : 89 - 95
  • [8] Detection Approach of Malicious JavaScript Code Based on deep learning
    Zheng, Liyuan
    Zhang, Dongcheng
    Xie, Xin
    Wang, Chen
    Hou, Boyuan
    [J]. Proceedings of 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence, ICIBA 2023, 2023, : 1075 - 1079
  • [9] Android malicious code detection and recognition based on depth learning
    Jing, Yang
    [J]. PROCEEDINGS OF THE 2017 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTER (MACMC 2017), 2017, 150 : 179 - 183
  • [10] Machine Learning for Implanted Malicious Code Detection with Incompletely Specified System Implementations
    Hsu, Yating
    Lee, David
    [J]. 2011 19TH IEEE INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2011,