Large Language Models for Code Obfuscation Evaluation of the Obfuscation Capabilities of OpenAI's GPT-3.5 on C Source Code

被引:0
|
作者
Kochberger, Patrick [1 ,2 ]
Gramberger, Maximilian [1 ]
Schrittwieser, Sebastian [2 ]
Lawitschka, Caroline [2 ]
Weippl, Edgar R. [3 ]
机构
[1] St Polten Univ Appl Sci, Inst IT Secur Res, St Polten, Austria
[2] Univ Vienna, Res Grp Secur & Privacy, Vienna, Austria
[3] SBA Res, Vienna, Austria
基金
奥地利科学基金会;
关键词
Software Protections; Code Obfuscation; Large Language Model; GPT;
D O I
10.5220/0012167000003555
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explores the efficacy of large language models, specifically GPT-3.5, in obfuscating C source code for software protection. We utilized eight distinct obfuscation techniques in tandem with seven representative C code samples to conduct a comprehensive analysis. The evaluation was performed using a Python-based tool we developed, which interfaces with the OpenAI API to access GPT-3.5. Our metrics of evaluation included the correctness and diversity of the obfuscated code, along with the robustness of the resultant protection. While the diversity of the resulting code was found to be commendable, our findings indicate a prevalent issue with the correctness of the obfuscated code and the overall level of protection provided. Consequently, we assert that while promising, the feasibility of deploying large language models for automatic code obfuscation is not yet sufficiently established. This study signifies an important step towards understanding the limitations and potential of AI-based code obfuscation, thereby informing future research in this area.
引用
收藏
页码:7 / 19
页数:13
相关论文
共 39 条
  • [31] Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation
    Jin, Kailun
    Wang, Chung-Yu
    Hung Viet Pham
    Hemmati, Hadi
    2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 167 - 171
  • [32] PromSec: Prompt Optimization for Secure Generation of Functional Source Code with Large Language Models (LLMs)
    Nazzal, Mahmoud
    Khalil, Issa
    Khreishah, Abdallah
    Phan, NhatHai
    CCS 2024 - Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security, : 2266 - 2279
  • [33] Evaluating large language models for surgical chart review of second stage implant-based breast reconstruction: a comparative analysis of manual review, GPT-3.5 Turbo, and GPT-4 Turbo
    Lakhlani, Devi
    Dadhania, Dhruv
    Nazerali, Rahim
    EUROPEAN JOURNAL OF PLASTIC SURGERY, 2025, 48 (01)
  • [34] Do Natural Language Processing models understand simulations? Application of GPT-3 to translate simulation source code to English
    Jackson, Ilya
    Rolf, Benjamin
    IFAC PAPERSONLINE, 2023, 56 (02): : 221 - 226
  • [35] Towards Predicting Source Code Changes Based on Natural Language Processing Models: An Empirical Evaluation
    Kaibe, Yuto
    Okamura, Hiroyuki
    Dohi, Tadashi
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW, 2023, : 108 - 111
  • [36] Human-Comparable Sensitivity of Large Language Models inIdenti fying Eligible Studies Through Title and Abstract Screening:3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews
    Matsui, Kentaro
    Utsumi, Tomohiro
    Aoki, Yumi
    Maruki, Taku
    Takeshima, Masahiro
    Takaesu, Yoshikazu
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [37] Finding Equivalent OpenMP Fortran and C/C plus plus Code Snippets Using Large Language Models
    Sekender, Naveed
    Lin, Pei-Hung
    Liao, Chunhua
    ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 143 - 160
  • [38] Evaluating Large Language Models for Real-World Vulnerability Repair in C/C plus plus Code
    Zhang, Lan
    Zou, Qingtian
    Singhal, Anoop
    Sun, Xiaoyan
    Liu, Peng
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS, IWSPA 2024, 2024, : 49 - 58
  • [39] Benchmarking Open-Source Large Language Models on Code-Switched Tagalog-English Retrieval Augmented Generation
    Adoptante, Aunhel John M.
    Castro, Jasper Adrian Dwight, V
    Medrana, Micholo Lanz B.
    Ocampo, Alyssa Patricia B.
    Peramo, Elmer C.
    Miranda, Melissa Ruth M.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2025, 16 (02) : 233 - 242