Similarity-Based Malware Classification Using Graph Neural Networks

被引:1
|
作者
Chen, Yu-Hung [1 ]
Chen, Jiann-Liang [1 ]
Deng, Ren-Feng [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 106335, Taiwan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 21期
关键词
malware families; classification; similarity; graph neural networks; Siamese network; Malware Bazaar dataset;
D O I
10.3390/app122110837
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This work proposes a novel malware identification model that is based on a graph neural network (GNN). The function call relationship and function assembly content obtained by analyzing the malware are used to generate a graph that represents the functional structure of a malware sample. In addition to establishing a multi-classification model for predicting malware family, this work implements a similarity model that is based on Siamese networks, measuring the distance between two samples in the feature space to determine whether they belong to the same malware family. The distance between the samples is gradually adjusted during the training of the model to improve the performance. A Malware Bazaar dataset analysis reveals that the proposed classification model has an accuracy and area under the curve (AUC) of 0.934 and 0.997, respectively. The proposed similarity model has an accuracy and AUC of 0.92 and 0.92, respectively. Further, the proposed similarity model identifies the unseen malware family with approximately 70% accuracy. Hence, the proposed similarity model exhibits better performance and scalability than the pure classification model and previous studies.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Centrality-based and similarity-based neighborhood extension in graph neural networks
    Zohrabi, Mohammadjavad
    Saravani, Saeed
    Chehreghani, Mostafa Haghir
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (16): : 24638 - 24663
  • [2] A comparison of graph neural networks for malware classification
    Malhotra, Vrinda
    Potika, Katerina
    Stamp, Mark
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (01) : 53 - 69
  • [3] A comparison of graph neural networks for malware classification
    Vrinda Malhotra
    Katerina Potika
    Mark Stamp
    [J]. Journal of Computer Virology and Hacking Techniques, 2024, 20 : 53 - 69
  • [4] Similarity-based Heterogeneous Neural Networks
    Belanche Munoz, Lluis A.
    Valdes Ramos, Julio Jose
    [J]. ENGINEERING LETTERS, 2007, 14 (02)
  • [5] Malware Classification Based on Graph Convolutional Neural Networks and Static Call Graph Features
    Mester, Attila
    Bodo, Zalan
    [J]. ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 528 - 539
  • [6] SIMILARITY-BASED CLASSIFICATION IN PARTIALLY LABELED NETWORKS
    Zhang, Qian-Ming
    Shang, Ming-Sheng
    Lue, Linyuan
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2010, 21 (06): : 813 - 824
  • [7] Similarity-based adversarial knowledge distillation using graph convolutional neural network
    Lee, Sungjun
    Kim, Sejun
    Kim, Seong Soo
    Seo, Kisung
    [J]. ELECTRONICS LETTERS, 2022, 58 (16) : 606 - 608
  • [8] Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning
    Hoffmann, Maximilian
    Malburg, Lukas
    Klein, Patrick
    Bergmann, Ralph
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2020, 2020, 12311 : 229 - 244
  • [9] Odor classification using similarity-based representation
    Bicego, M
    [J]. SENSORS AND ACTUATORS B-CHEMICAL, 2005, 110 (02): : 225 - 230
  • [10] Malware Classification using Fusion of Neural Networks
    Lutz, Adam
    Sansing, Victor F., III
    Farag, Waleed
    Ezekiel, Soundararajan
    [J]. DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES II, 2019, 11013