GraphBinMatch: Graph-based Similarity Learning for Cross-Language Binary and Source Code Matching

被引:0
|
作者
TehraniJamsaz, Ali [1 ]
Chen, Hanze [1 ]
Jannesari, Ali [1 ]
机构
[1] Iowa State Univ, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
cross-language; code similarity; binary-source matching;
D O I
10.1109/IPDPSW63119.2024.00103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Matching binary to source code and vice versa has various applications in different fields, such as computer security, software engineering, and reverse engineering. Even though there exist methods that try to match source code with binary code to accelerate the reverse engineering process, most of them arc designed to focus on one programming language. However, in real life, programs are developed using different programming languages depending on their requirements. Thus, cross-language binary-to-source code matching has recently gained more attention. Nonetheless, the existing approaches still stniggle to have precise predictions due to the inherent difficulties when the problem of matching binary code and source code needs to be addressed across programming languages. In this paper, we address the problem of cross-language binary source code matching. We propose GraphBinMatch, an approach based on a graph neural network that learns the similarity between binary and source codes. We evaluate GraphBinMatch on several tasks, such as cross-language binary-to-source code matching and cross-language source-to-source matching We also evaluate the performance of our approach on single-language binary-to-source code matching. Experimental results show that GraphBinMatch significantly outperforms stale-of-the-art, with improvements as high as 15% over the Fl score.
引用
下载
收藏
页码:506 / 515
页数:10
相关论文
共 50 条
  • [21] BCD: Decomposing Binary Code Into Components Using Graph-Based Clustering
    Karande, Vishal
    Chandra, Swarup
    Lin, Zhiqiang
    Caballero, Juan
    Khan, Latifur
    Hamlen, Kevin W.
    PROCEEDINGS OF THE 2018 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS'18), 2018, : 393 - 398
  • [22] Contrastive Learning for Graph-Based Vessel Trajectory Similarity Computation
    Luo, Sizhe
    Zeng, Weiming
    Sun, Bowen
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (09)
  • [23] Graph-based Deep Learning in Natural Language Processing
    Vashishth, Shikhar
    Yadati, Naganand
    Talukdar, Partha
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 371 - 372
  • [24] Graph-based code semantics learning for efficient semantic code clone detection
    Yu, Dongjin
    Yang, Quanxin
    Chen, Xin
    Chen, Jie
    Xu, Yihang
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 156
  • [25] Neural Network-based Graph Embedding for Cross-Platform Binary Code Similarity Detection
    Xu, Xiaojun
    Liu, Chang
    Feng, Qian
    Yin, Heng
    Song, Le
    Song, Dawn
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 363 - 376
  • [26] Hierarchical Learning of Cross-Language Mappings through Distributed Vector Representations for Code
    Bui, Nghi D. Q.
    Jiang, Lingxiao
    2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING TECHNOLOGIES RESULTS (ICSE-NIER), 2018, : 33 - 36
  • [27] Cross-language Citation Recommendation via Hierarchical Representation Learning on Heterogeneous Graph
    Jiang, Zhuoren
    Yin, Yue
    Gao, Liangcai
    Lu, Yao
    Liu, Xiaozhong
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 635 - 644
  • [28] TCCCD: Triplet-Based Cross-Language Code Clone Detection
    Fang, Yong
    Zhou, Fangzheng
    Xu, Yijia
    Liu, Zhonglin
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [29] Detection of Software Security Weaknesses Using Cross-Language Source Code Representation (CLaSCoRe)
    Zaharia, Sergiu
    Rebedea, Traian
    Trausan-Matu, Stefan
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [30] Position Distribution Matters: A Graph-Based Binary Function Similarity Analysis Method
    Pan, Zulie
    Wang, Taiyan
    Yu, Lu
    Yan, Yintong
    ELECTRONICS, 2022, 11 (15)