Towards Minimal Edits in Automated Program Repair: A Hybrid Framework Integrating Graph Neural Networks and Large Language Models

被引:0
|
作者
Xu, Zhenyu [1 ]
Sheng, Victor S. [1 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
关键词
Automated Program Repair; Graph Neural Networks; Large Language Models;
D O I
10.1007/978-3-031-72344-5_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models trained on Code (LLMCs) have been shown to be effective in Automated Program Repair (APR) tasks, introducing new innovations to the field. Typically, LLMCs do not engage in error localization for APR tasks, instead treating APR more as a code refinement task. This approach often results in larger edit distances, altering the program's original structure. The principle of making minimal edits is crucial in certain scenarios, such as when correcting student programming assignments or software group development, where it's better to preserve the original intent of the code with as few changes as possible. To address these challenges, we introduce a hybrid framework for automated program repair that combines graph neural networks and large language models, which we refer to as HFRepair. HFRepair leverages the precise error localization capability of DrRepair for C programs, combining it with LLMCs to perform the line-level APR task based on the code context, aiming for minimal edits. Our experimental results demonstrate that HFRepair significantly outperforms previous state-of-the-art methods in benchmark tests. For instance, on the DeepFix dataset, HFRepair improves the full repair rate from 67.9% and 71.4% (achieved by DrRepair and BIFI, respectively) to 82.2%, while reducing average edit distance from 33.4 and 27.7 to 11.6.
引用
收藏
页码:402 / 416
页数:15
相关论文
共 13 条
  • [1] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
  • [2] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    SPLASH Companion 2023 - Companion Proceedings of the 2023 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, 2023, : 7 - 9
  • [3] Hybrid-LLM-GNN: integrating large language models and graph neural networks for enhanced materials property prediction
    Li, Youjia
    Gupta, Vishu
    Kilic, Muhammed Nur Talha
    Choudhary, Kamal
    Wines, Daniel
    Liao, Wei-keng
    Choudhary, Alok
    Agrawal, Ankit
    DIGITAL DISCOVERY, 2025, 4 (02): : 376 - 383
  • [4] Leveraging Large Language Models for Automated Program Repair in Programming Education
    Murali, Pavithra Sripathanallur
    XRDS: Crossroads, 2025, 31 (02): : 58 - 60
  • [5] Automated Program Repair in the Era of Large Pre-trained Language Models
    Xia, Chunqiu Steven
    Wei, Yuxiang
    Zhang, Lingming
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 1482 - 1494
  • [6] Accelerating Neural Networks for Large Language Models and Graph Processing with Silicon Photonics
    Afifi, Salma
    Sunny, Febin
    Nikdast, Mandi
    Pasricha, Sudeep
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [7] Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair
    Wei, Yuxiang
    Xia, Chunqiu Steven
    Zhang, Lingming
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 172 - 184
  • [8] An Empirical Study on Fine-tuning Large Language Models of Code for Automated Program Repair
    Huang, Kai
    Meng, Xiangxin
    Zhang, Jian
    Liu, Yang
    Wang, Wenjie
    Li, Shuhao
    Zhang, Yuqing
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1162 - 1174
  • [9] Multimodal prediction of student performance: A fusion of signed graph neural networks and large language models
    Wang, Sijie
    Ni, Lin
    Zhang, Zeyu
    Li, Xiaoxuan
    Zheng, Xianda
    Liu, Jiamou
    PATTERN RECOGNITION LETTERS, 2024, 181 : 1 - 8
  • [10] A Novel Approach for Automated Program Repair using Round-Trip Translation with Large Language Models
    Ruiz, Fernando Vallecillos
    Grishina, Anastasiia
    Hort, Max
    Moonen, Leon
    arXiv,