Cross-architecture Binary Function Similarity Detection based on Composite Feature Model

被引:0
|
作者
Li, Xiaonan [1 ,2 ]
Zhang, Guimin [1 ]
Li, Qingbao [1 ]
Zhang, Ping [1 ]
Chen, Zhifeng [1 ]
Liu, Jinjin [2 ]
Yue, Shudan [1 ]
机构
[1] Informat Engn Univ, Zhengzhou 450001, Henan, Peoples R China
[2] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou 450007, Henan, Peoples R China
关键词
Binary Similarity; Composite Feature Model; Cross-Architecture; Graph Embedding Network; Vulnerability Detection;
D O I
10.3837/tiis.2023.08.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.
引用
收藏
页码:2101 / 2123
页数:23
相关论文
共 50 条
  • [41] Double-Layer Positional Encoding Embedding Method for Cross-Platform Binary Function Similarity Detection
    Jiang, Xunzhi
    Wang, Shen
    Yu, Xiangzhan
    Gong, Yuxin
    CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (04) : 604 - 611
  • [42] Similarity Based Feature Transformation for Network Anomaly Detection
    Nagaraja, Arun
    Boregowda, Uma
    Khatatneh, Khalaf
    Vangipuram, Radhakrishna
    Rajasekhar, Nuvvusetty
    Kiran, V. Sravan
    IEEE ACCESS, 2020, 8 (08): : 39184 - 39196
  • [43] αDiff: Cross-Version Binary Code Similarity Detection with DNN
    Liu, Bingchang
    Huo, Wei
    Zhang, Chao
    Li, Wenchao
    Li, Feng
    Piao, Aihua
    Zou, Wei
    PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 667 - 678
  • [44] A binary ABC algorithm based on advanced similarity scheme for feature selection
    Hancer, Emrah
    Xue, Bing
    Karaboga, Dervis
    Zhang, Mengjie
    APPLIED SOFT COMPUTING, 2015, 36 : 334 - 348
  • [45] Neural Network-based Graph Embedding for Cross-Platform Binary Code Similarity Detection
    Xu, Xiaojun
    Liu, Chang
    Feng, Qian
    Yin, Heng
    Song, Le
    Song, Dawn
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 363 - 376
  • [46] Attention-Based Multidimensional Deep Learning Approach for Cross-Architecture IoMT Malware Detection and Classification in Healthcare Cyber-Physical Systems
    Ravi, Vinayakumar
    Pham, Tuan D.
    Alazab, Mamoun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (04) : 1597 - 1606
  • [47] Multi-semantic feature fusion attention network for binary code similarity detection
    Bangling Li
    Yuting Zhang
    Huaxi Peng
    Qiguang Fan
    Shen He
    Yan Zhang
    Songquan Shi
    Yang Zhang
    Ailiang Ma
    Scientific Reports, 13
  • [48] Multi-semantic feature fusion attention network for binary code similarity detection
    Li, Bangling
    Zhang, Yuting
    Peng, Huaxi
    Fan, Qiguang
    He, Shen
    Zhang, Yan
    Shi, Songquan
    Zhang, Yang
    Ma, Ailiang
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] Detection of electrocardiographic changes in partial epileptic patients using local binary pattern based composite feature
    T. Sunil Kumar
    Vivek Kanhangad
    Australasian Physical & Engineering Sciences in Medicine, 2018, 41 : 209 - 216
  • [50] Detection of electrocardiographic changes in partial epileptic patients using local binary pattern based composite feature
    Kumar, T. Sunil
    Kanhangad, Vivek
    AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2018, 41 (01) : 209 - 216