Large Language Models for Code Obfuscation Evaluation of the Obfuscation Capabilities of OpenAI's GPT-3.5 on C Source Code

被引:0
|
作者
Kochberger, Patrick [1 ,2 ]
Gramberger, Maximilian [1 ]
Schrittwieser, Sebastian [2 ]
Lawitschka, Caroline [2 ]
Weippl, Edgar R. [3 ]
机构
[1] St Polten Univ Appl Sci, Inst IT Secur Res, St Polten, Austria
[2] Univ Vienna, Res Grp Secur & Privacy, Vienna, Austria
[3] SBA Res, Vienna, Austria
基金
奥地利科学基金会;
关键词
Software Protections; Code Obfuscation; Large Language Model; GPT;
D O I
10.5220/0012167000003555
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explores the efficacy of large language models, specifically GPT-3.5, in obfuscating C source code for software protection. We utilized eight distinct obfuscation techniques in tandem with seven representative C code samples to conduct a comprehensive analysis. The evaluation was performed using a Python-based tool we developed, which interfaces with the OpenAI API to access GPT-3.5. Our metrics of evaluation included the correctness and diversity of the obfuscated code, along with the robustness of the resultant protection. While the diversity of the resulting code was found to be commendable, our findings indicate a prevalent issue with the correctness of the obfuscated code and the overall level of protection provided. Consequently, we assert that while promising, the feasibility of deploying large language models for automatic code obfuscation is not yet sufficiently established. This study signifies an important step towards understanding the limitations and potential of AI-based code obfuscation, thereby informing future research in this area.
引用
收藏
页码:7 / 19
页数:13
相关论文
共 39 条
  • [21] L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
    Ni, Ansong
    Yin, Pengcheng
    Zhao, Yilun
    Riddell, Martin
    Feng, Troy
    Shen, Rui
    Yin, Stephen
    Liu, Ye
    Yavuz, Semih
    Xiong, Caiming
    Joty, Shafiq
    Zhou, Yingbo
    Radev, Dragomir
    Cohan, Arman
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1311 - 1329
  • [22] Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
    Liu, Jiawei
    Xia, Chunqiu Steven
    Wang, Yuyao
    Zhang, Lingming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Need a Programming Exercise Generated in Your Native Language? ChatGPT's Got Your Back: Automatic Generation of Non-English Programming Exercises Using OpenAI GPT-3.5
    Jordan, Mollie
    Ly, Kevin
    Raj, Adalbert Gerald Soosai
    PROCEEDINGS OF THE 55TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE 2024, VOL. 1, 2024, : 618 - 624
  • [24] GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair
    Ribeiro, Francisco
    Castro de Macedo, Jose Nuno
    Tsushima, Kanae
    Abreu, Rui
    Saraiva, Joao
    PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2023, 2023, : 111 - 124
  • [25] FormalEval: A Method for Automatic Evaluation of Code Generation via Large Language Models
    Yang, Sichao
    Yang, Ye
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 660 - 665
  • [26] Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
    Macedo, Marcos
    Tian, Yuan
    Cogo, Filipe R.
    Adams, Bram
    arXiv,
  • [27] Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
    Macedo, Marcos
    Tian, Yuan
    Cogo, Filipe R.
    Adams, Bram
    PROCEEDINGS 2024 IEEE/ACM FIRST INTERNATIONAL CONFERENCE ON AI FOUNDATION MODELS AND SOFTWARE ENGINEERING, FORGE 2024, 2024, : 57 - 68
  • [28] Can large language models pass official high-grade exams of the European Society of Neuroradiology courses? A direct comparison between OpenAI chatGPT 3.5, OpenAI GPT4 and Google Bard
    D'Anna, Gennaro
    Van Cauter, Sofie
    Thurnher, Majda
    Van Goethem, Johan
    Haller, Sven
    NEURORADIOLOGY, 2024, 66 (08) : 1245 - 1250
  • [29] Enhancing Code Security Through Open-Source Large Language Models: A Comparative Study
    Ridley, Norah
    Branca, Enrico
    Kimber, Jadyn
    Stakhanova, Natalia
    FOUNDATIONS AND PRACTICE OF SECURITY, PT I, FPS 2023, 2024, 14551 : 233 - 249
  • [30] An Empirical Evaluation of Large Language Models in Static Code Analysis for PHP Vulnerability Detection
    Cetin, Orcun
    Ekmekcioglu, Emre
    Arief, Budi
    Hernandez-Castro, Julio
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (09) : 1163 - 1183