Cross-architecture Binary Function Similarity Detection based on Composite Feature Model

被引:0
|
作者
Li, Xiaonan [1 ,2 ]
Zhang, Guimin [1 ]
Li, Qingbao [1 ]
Zhang, Ping [1 ]
Chen, Zhifeng [1 ]
Liu, Jinjin [2 ]
Yue, Shudan [1 ]
机构
[1] Informat Engn Univ, Zhengzhou 450001, Henan, Peoples R China
[2] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou 450007, Henan, Peoples R China
关键词
Binary Similarity; Composite Feature Model; Cross-Architecture; Graph Embedding Network; Vulnerability Detection;
D O I
10.3837/tiis.2023.08.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.
引用
收藏
页码:2101 / 2123
页数:23
相关论文
共 50 条
  • [31] Accurate and Scalable Cross-Architecture Cross-OS Binary Code Search With Emulation (vol 45, pg 1125, 2019)
    Xue, Yinxing
    Xu, Zhengzi
    Chandramohan, Mahinthan
    Liu, Yang
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (05) : 1088 - 1088
  • [32] A simple function embedding approach for binary similarity detection
    Li, Weilong
    Jin, Shuyuan
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 570 - 577
  • [33] Cross-platform binary code similarity detection based on NMT and graph embedding
    Zhu, Xiaodong
    Jiang, Liehui
    Chen, Zeng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 4528 - 4551
  • [34] Change Detection in Feature Space using Local Binary Similarity Patterns
    Bilodeau, Guillaume-Alexandre
    Jodoin, Jean-Philippe
    Saunier, Nicolas
    2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 106 - 112
  • [35] A Graph Neural Network Based Model for IoT Binary Components Similarity Detection
    Wang, Zhiyu
    Hu, Xulun
    Zuo, Fang
    Li, Hong
    Zhang, Yiran
    Wang, Weifeng
    WIRELESS SENSOR NETWORKS, CWSN 2022, 2022, 1715 : 120 - 131
  • [36] OneGraph: a cross-architecture framework for large-scale graph computing on GPUs based on oneAPI
    Shiyang Li
    Jingyu Zhu
    Jiaxun Han
    Yuting Peng
    Zhuoran Wang
    Xiaoli Gong
    Gang Wang
    Jin Zhang
    Xuqiang Wang
    CCF Transactions on High Performance Computing, 2024, 6 : 179 - 191
  • [37] OneGraph: a cross-architecture framework for large-scale graph computing on GPUs based on oneAPI
    Li, Shiyang
    Zhu, Jingyu
    Han, Jiaxun
    Peng, Yuting
    Wang, Zhuoran
    Gong, Xiaoli
    Wang, Gang
    Zhang, Jin
    Wang, Xuqiang
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (02) : 179 - 191
  • [38] Cross-architecture tuning of silicon and SiGe-based quantum devices using machine learning
    Severin, B.
    Lennon, D. T.
    Camenzind, L. C.
    Vigneau, F.
    Fedele, F.
    Jirovec, D.
    Ballabio, A.
    Chrastina, D.
    Isella, G.
    de Kruijf, M.
    Carballido, M. J.
    Svab, S.
    Kuhlmann, A. V.
    Geyer, S.
    Froning, F. N. M.
    Moon, H.
    Osborne, M. A.
    Sejdinovic, D.
    Katsaros, G.
    Zumbuehl, D. M.
    Briggs, G. A. D.
    Ares, N.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Binary Function Similarity Detection Based on Graph Neural Network with Self-Attention Mechanism
    Wu, Dingjie
    He, Xuanzhang
    Zhang, Yao
    Zhu, Junjie
    Zhang, Xinyuan
    Ye, Minchao
    Gao, Zhigang
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 971 - 975
  • [40] Double-Layer Positional Encoding Embedding Method for Cross-Platform Binary Function Similarity Detection
    JIANG Xunzhi
    WANG Shen
    YU Xiangzhan
    GONG Yuxin
    Chinese Journal of Electronics, 2022, 31 (04) : 604 - 611