Large Language Models for Code Obfuscation Evaluation of the Obfuscation Capabilities of OpenAI's GPT-3.5 on C Source Code

被引：0

作者：

Kochberger, Patrick ^{[1
,2
]}

Gramberger, Maximilian ^{[1
]}

Schrittwieser, Sebastian ^{[2
]}

Lawitschka, Caroline ^{[2
]}

Weippl, Edgar R. ^{[3
]}

机构：

[1] St Polten Univ Appl Sci, Inst IT Secur Res, St Polten, Austria

[2] Univ Vienna, Res Grp Secur & Privacy, Vienna, Austria

[3] SBA Res, Vienna, Austria

来源：

PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, SECRYPT 2023 | 2023年

基金：

奥地利科学基金会;

关键词：

Software Protections; Code Obfuscation; Large Language Model; GPT;

D O I：

10.5220/0012167000003555

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study explores the efficacy of large language models, specifically GPT-3.5, in obfuscating C source code for software protection. We utilized eight distinct obfuscation techniques in tandem with seven representative C code samples to conduct a comprehensive analysis. The evaluation was performed using a Python-based tool we developed, which interfaces with the OpenAI API to access GPT-3.5. Our metrics of evaluation included the correctness and diversity of the obfuscated code, along with the robustness of the resultant protection. While the diversity of the resulting code was found to be commendable, our findings indicate a prevalent issue with the correctness of the obfuscated code and the overall level of protection provided. Consequently, we assert that while promising, the feasibility of deploying large language models for automatic code obfuscation is not yet sufficiently established. This study signifies an important step towards understanding the limitations and potential of AI-based code obfuscation, thereby informing future research in this area.

引用

页码：7 / 19

页数：13

共 39 条

[21] L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Ni, Ansong
Yin, Pengcheng
Zhao, Yilun
Riddell, Martin
Feng, Troy
Shen, Rui
Yin, Stephen
Liu, Ye
Yavuz, Semih
Xiong, Caiming
Joty, Shafiq
Zhou, Yingbo
Radev, Dragomir
Cohan, Arman
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1311 - 1329
[22] Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
Liu, Jiawei
Xia, Chunqiu Steven
Wang, Yuyao
Zhang, Lingming
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[23] Need a Programming Exercise Generated in Your Native Language? ChatGPT's Got Your Back: Automatic Generation of Non-English Programming Exercises Using OpenAI GPT-3.5
Jordan, Mollie
Ly, Kevin
Raj, Adalbert Gerald Soosai
PROCEEDINGS OF THE 55TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE 2024, VOL. 1, 2024, : 618 - 624
[24] GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair
Ribeiro, Francisco
Castro de Macedo, Jose Nuno
Tsushima, Kanae
Abreu, Rui
Saraiva, Joao
PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2023, 2023, : 111 - 124
[25] FormalEval: A Method for Automatic Evaluation of Code Generation via Large Language Models
Yang, Sichao
Yang, Ye
2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 660 - 665
[26] Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
Macedo, Marcos
Tian, Yuan
Cogo, Filipe R.
Adams, Bram
arXiv,
[27] Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
Macedo, Marcos
Tian, Yuan
Cogo, Filipe R.
Adams, Bram
PROCEEDINGS 2024 IEEE/ACM FIRST INTERNATIONAL CONFERENCE ON AI FOUNDATION MODELS AND SOFTWARE ENGINEERING, FORGE 2024, 2024, : 57 - 68
[28] Can large language models pass official high-grade exams of the European Society of Neuroradiology courses? A direct comparison between OpenAI chatGPT 3.5, OpenAI GPT4 and Google Bard
D'Anna, Gennaro
Van Cauter, Sofie
Thurnher, Majda
Van Goethem, Johan
Haller, Sven
NEURORADIOLOGY, 2024, 66 (08) : 1245 - 1250
[29] Enhancing Code Security Through Open-Source Large Language Models: A Comparative Study
Ridley, Norah
Branca, Enrico
Kimber, Jadyn
Stakhanova, Natalia
FOUNDATIONS AND PRACTICE OF SECURITY, PT I, FPS 2023, 2024, 14551 : 233 - 249
[30] An Empirical Evaluation of Large Language Models in Static Code Analysis for PHP Vulnerability Detection
Cetin, Orcun
Ekmekcioglu, Emre
Arief, Budi
Hernandez-Castro, Julio
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (09) : 1163 - 1183

← 1 2 3 4 →