SROBR: Semantic Representation of Obfuscation-Resilient Binary Code

被引:1
|
作者
Tang K. [1 ]
Shan Z. [1 ]
Liu F. [1 ]
Huang Y. [1 ]
Sun R. [1 ]
Qiao M. [1 ]
Zhang C. [1 ]
Wang J. [1 ]
Gui H. [1 ]
机构
[1] State Key Laboratory of Mathematical Engineering and Advanced Computing, China
关键词
D O I
10.1155/2022/4095481
中图分类号
学科分类号
摘要
With the rapid development of information technology, the scale of software has increased exponentially. Binary code similarity detection technology plays an important role in many fields, such as detecting software plagiarism, vulnerabilities discovery, and copyright solution issues. Nevertheless, what cannot be ignored is that a variety of approaches to binary code semantic representation have been introduced recently, but few can catch up with existing code obfuscation techniques due to their maturing and extensive development. In order to solve this problem, we propose a new neural network model, named SROBR, which is a deep integration of natural language processing model and graph neural network. In SROBR, BERT is applied to capture sequence information of the binary code at the first place, and then GAT is utilized to capture the structural information. It combines natural language processing and graph neural network, which can capture the semantic information of binary programs while resisting obfuscation options in a more efficient way. Through binary code similarity detection task and obfuscated option classification task, the experimental results demonstrate that SROBR outperforms existing binary similarity detection methods in resisting obfuscation techniques. © 2022 Ke Tang et al.
引用
收藏
相关论文
共 50 条
  • [31] BRDC: Binary representation of displacement code for line
    Miao, LF
    Liu, XG
    Peng, QS
    Bao, HJ
    CAD/GRAPHICS '2001: PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN AND COMPUTER GRAPHICS, VOLS 1 AND 2, 2001, : 233 - 239
  • [32] BRDC: binary representation of displacement code for line
    Miao, LF
    Liu, XG
    Peng, QS
    Bao, HJ
    COMPUTERS & GRAPHICS-UK, 2002, 26 (03): : 401 - 408
  • [33] On the Representation of Results of Binary Code Reverse Engineering
    V. A. Padaryan
    I. N. Ledovskikh
    Programming and Computer Software, 2018, 44 : 200 - 206
  • [34] Hierarchical semantic-aware neural code representation
    Jiang, Yuan
    Su, Xiaohong
    Treude, Christoph
    Wang, Tiantian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 191
  • [35] BINARY CODE LEARNING WITH SEMANTIC RANKING BASED SUPERVISION
    Viet-Anh Nguyen
    Do, Minh N.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1165 - 1169
  • [36] The spatter code: Holographic reduced representation in the binary domain
    Kanerva, P
    PROCEEDINGS OF THE NINETEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1997, : 965 - 965
  • [37] ROBUST TEXTURE REPRESENTATION BY USING BINARY CODE ENSEMBLE
    Song, Tiecheng
    Meng, Fanman
    Luo, Bing
    Huang, Chao
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [38] Extraction of Code-mixed Aspect Topics in Semantic Representation
    Asnani, Kavita
    Pawar, Jyoti D.
    COMPUTACION Y SISTEMAS, 2018, 22 (01): : 55 - 63
  • [39] Binary Code Representation With Well-Balanced Instruction Normalization
    Koo, Hyungjoon
    Park, Soyeon
    Choi, Daejin
    Kim, Taesoo
    IEEE ACCESS, 2023, 11 : 29183 - 29198
  • [40] Generalized Binary Representation for the Nonbinary LDPC Code With Decoder Design
    Yu, Yang
    Chen, Wen
    Li, Jun
    Ma, Xiao
    Bai, Baoming
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2014, 62 (09) : 3070 - 3083