Investigating Graph Embedding Methods for Cross-Platform Binary Code Similarity Detection

被引:1
|
作者
Cochard, Victor [1 ]
Pfammatter, Damian [2 ]
Duong, Chi Thang [3 ]
Humbert, Mathias [4 ]
机构
[1] Armasuisse ST, Cyber Def Campus, Lausanne, Switzerland
[2] Armasuisse ST, Cyber Def Campus, Zurich, Switzerland
[3] Ecole Polytech Fed Lausanne, DISL, Lausanne, Switzerland
[4] Univ Lausanne, DESI, Lausanne, Switzerland
关键词
D O I
10.1109/EuroSP53844.2022.00012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
IoT devices are increasingly present, both in the industry and in consumer markets, but their security remains weak, which leads to an unprecedented number of attacks against them. In order to reduce the attack surface, one approach is to analyze the binary code of these devices to early detect whether they contain potential security vulnerabilities. More specifically, knowing some vulnerable function, we can determine whether the firmware of an IoT device contains some security flaw by searching for this function. However, searching for similar vulnerable functions is in general challenging due to the fact that the source code is often not openly available and that it can be compiled for different architectures, using different compilers and compilation settings. In order to handle these varying settings, we can compare the similarity between the graph embeddings derived from the binary functions. In this paper, inspired by the recent advances in deep learning, we propose a new method - GESS (graph embeddings for similarity search) to derive graph embeddings, and we compare it with various state-of-the-art methods. Our empirical evaluation shows that GESS reaches an AUC of 0.979, thereby outperforming the best known approach. Furthermore, for a fixed low false positive rate, GESS provides a true positive rate (or recall) about 36% higher than the best previous approach. Finally, for a large search space, GESS provides a recall between 50% and 60% higher than the best previous approach.
引用
收藏
页码:60 / 73
页数:14
相关论文
共 50 条
  • [1] Cross-platform binary code similarity detection based on NMT and graph embedding
    Zhu, Xiaodong
    Jiang, Liehui
    Chen, Zeng
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 4528 - 4551
  • [2] Neural Network-based Graph Embedding for Cross-Platform Binary Code Similarity Detection
    Xu, Xiaojun
    Liu, Chang
    Feng, Qian
    Yin, Heng
    Song, Le
    Song, Dawn
    [J]. CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 363 - 376
  • [3] Cross-Platform Binary Code Homology Analysis Based on GRU Graph Embedding
    Wang, Shen
    Jiang, Xunzhi
    Yu, Xiangzhan
    Su, Xiaohui
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [4] Double-Layer Positional Encoding Embedding Method for Cross-Platform Binary Function Similarity Detection
    JIANG Xunzhi
    WANG Shen
    YU Xiangzhan
    GONG Yuxin
    [J]. Chinese Journal of Electronics, 2022, 31 (04) : 604 - 611
  • [5] Double-Layer Positional Encoding Embedding Method for Cross-Platform Binary Function Similarity Detection
    Jiang, Xunzhi
    Wang, Shen
    Yu, Xiangzhan
    Gong, Yuxin
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (04) : 604 - 611
  • [6] Asteria: Deep Learning-based AST-Encoding for Cross-platform Binary Code Similarity Detection
    Yang, Shouguo
    Cheng, Long
    Zeng, Yicheng
    Lang, Zhe
    Zhu, Hongsong
    Shi, Zhiqiang
    [J]. 51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2021), 2021, : 224 - 236
  • [7] Fast Cross-Platform Binary Code Similarity Detection Framework Based on CFGs Taking Advantage of NLP and Inductive GNN
    Peng, Jinxue
    Wang, Yong
    Xue, Jingfeng
    Liu, Zhenyan
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (01) : 128 - 138
  • [8] Fast Cross-Platform Binary Code Similarity Detection Framework Based on CFGs Taking Advantage of NLP and Inductive GNN
    Jinxue PENG
    Yong WANG
    Jingfeng XUE
    Zhenyan LIU
    [J]. Chinese Journal of Electronics, 2024, 33 (01) : 128 - 138
  • [9] FuncNet: A Euclidean Embedding Approach for Lightweight Cross-platform Binary Recognition
    Luo, Mengxia
    Yang, Can
    Gong, Xiaorui
    Yu, Lei
    [J]. SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM, PT I, 2019, 304 : 319 - 337
  • [10] A CROSS-PLATFORM BINARY DIFF
    COPPIETERS, K
    [J]. DR DOBBS JOURNAL, 1995, 20 (05): : 32 - &