HYDRA: A multimodal deep learning framework for malware classification

被引:78
|
作者
Gibert, Daniel [1 ]
Mateu, Carles [1 ]
Planes, Jordi [1 ]
机构
[1] Univ Lleida, Jaume II 69, Lleida, Spain
关键词
Malware classification; Machine learning; Deep learning; Feature fusion; Multimodal learning; ENTROPY;
D O I
10.1016/j.cose.2020.101873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While traditional machine learning methods for malware detection largely depend on hand-designed features, which are based on experts' knowledge of the domain, end-to-end learning approaches take the raw executable as input, and try to learn a set of descriptive features from it. Although the latter might behave badly in problems where there are not many data available or where the dataset is imbalanced. In this paper we present HYDRA, a novel framework to address the task of malware detection and classification by combining various types of features to discover the relationships between distinct modalities. Our approach learns from various sources to maximize the benefits of multiple feature types to reflect the characteristics of malware executables. We propose a baseline system that consists of both hand-engineered and end-to-end components to combine the benefits of feature engineering and deep learning so that malware characteristics are effectively represented. An extensive analysis of state-of-the-art methods on the Microsoft Malware Classification Challenge benchmark shows that the proposed solution achieves comparable results to gradient boosting methods in the literature and higher yield in comparison with deep learning approaches. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Speech Intention Classification with Multimodal Deep Learning
    Gu, Yue
    Li, Xinyu
    Chen, Shuhong
    Zhang, Jianyu
    Marsic, Ivan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2017, 2017, 10233 : 260 - 271
  • [32] A deep semantic framework for multimodal representation learning
    Cheng Wang
    Haojin Yang
    Christoph Meinel
    Multimedia Tools and Applications, 2016, 75 : 9255 - 9276
  • [33] Multimodal deep representation learning for video classification
    Haiman Tian
    Yudong Tao
    Samira Pouyanfar
    Shu-Ching Chen
    Mei-Ling Shyu
    World Wide Web, 2019, 22 : 1325 - 1341
  • [34] Multimodal deep representation learning for video classification
    Tian, Haiman
    Tao, Yudong
    Pouyanfar, Samira
    Chen, Shu-Ching
    Shyu, Mei-Ling
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (03): : 1325 - 1341
  • [35] Malware Classification by Deep Learning Using Characteristics of Hash Functions
    Baba, Takahiro
    Baba, Kensuke
    Yamauchi, Toshihiro
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2, 2022, 450 : 480 - 491
  • [36] DeepSign: Deep Learning for Automatic Malware Signature Generation and Classification
    David, Omid E.
    Netanyahu, Nathan S.
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [37] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [38] Explainable Deep Learning Models for Dynamic and Online Malware Classification
    Card, Quincy
    Simpson, Daniel
    Aryal, Kshitiz
    Gupta, Maanak
    Islam, Sheikh Rabiul
    2024 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP 2024, 2024, : 182 - 189
  • [39] DeepAM: a heterogeneous deep learning framework for intelligent malware detection
    Yanfang Ye
    Lingwei Chen
    Shifu Hou
    William Hardy
    Xin Li
    Knowledge and Information Systems, 2018, 54 : 265 - 285
  • [40] DeepAM: a heterogeneous deep learning framework for intelligent malware detection
    Ye, Yanfang
    Chen, Lingwei
    Hou, Shifu
    Hardy, William
    Li, Xin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 54 (02) : 265 - 285