Codeflaws: A Programming Competition Benchmark for Evaluating Automated Program Repair Tools

被引:92
|
作者
Tan, Shin Hwei [1 ]
Yi, Jooyong [2 ]
Yulis [1 ]
Mechtaev, Sergey [1 ]
Roychoudhury, Abhik [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Innopolis Univ, Innopolis, Russia
关键词
automated program repair; defect classes; empirical evaluation; benchmark;
D O I
10.1109/ICSE-C.2017.76
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Several automated program repair techniques have been proposed to reduce the time and effort spent in bug-fixing. While these repair tools are designed to be generic such that they could address many software faults, different repair tools may fix certain types of faults more effectively than other tools. Therefore, it is important to compare more objectively the effectiveness of different repair tools on various fault types. However, existing benchmarks on automated program repairs do not allow thorough investigation of the relationship between fault types and the effectiveness of repair tools. We present Codeflaws, a set of 3902 defects from 7436 programs automatically classified across 39 defect classes (we refer to different types of fault as defect classes derived from the syntactic differences between a buggy program and a patched program).
引用
收藏
页码:180 / 182
页数:3
相关论文
共 50 条
  • [21] Benchmark Tools for Evaluating AGVs at Industrial Environments
    Yuste, Hector
    Armesto, Leopoldo
    Tornero, Josep
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 2657 - 2662
  • [22] Towards a Benchmark for Evaluating Reverse Engineering Tools
    Fueloep, Lajos Jeno
    Hegedus, Peter
    Ferenc, Rudolf
    Gyimothy, Tibor
    FIFTEENTH WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, 2008, : 335 - 336
  • [23] Automated Program Repair
    Le Goues, Claire
    Pradel, Michael
    Roychoudhury, Abhik
    COMMUNICATIONS OF THE ACM, 2019, 62 (12) : 56 - 65
  • [24] RUGRAT: Evaluating program analysis and testing tools and compilers with large generated random benchmark applications
    Hussain, Ishtiaque
    Csallner, Christoph
    Grechanik, Mark
    Xie, Qing
    Park, Sangmin
    Taneja, Kunal
    Hossain, B. M. Mainul
    SOFTWARE-PRACTICE & EXPERIENCE, 2016, 46 (03): : 405 - 431
  • [25] Improved representation and genetic operators for linear genetic programming for automated program repair
    Oliveira, Vinicius Paulo L.
    de Souza, Eduardo Faria
    Le Goues, Claire
    Camilo-Junior, Celso G.
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2980 - 3006
  • [26] Improved representation and genetic operators for linear genetic programming for automated program repair
    Vinicius Paulo L. Oliveira
    Eduardo Faria de Souza
    Claire Le Goues
    Celso G. Camilo-Junior
    Empirical Software Engineering, 2018, 23 : 2980 - 3006
  • [27] Program Repair Competition 2024
    Shariffdeen, Ridwan
    Noller, Yannic
    Mirchev, Martin
    Ruan, Haifeng
    Gao, Xiang
    Costea, Andreea
    Duck, Gregory J.
    Roychoudhury, Abhik
    2024 ACM/IEEE INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR, APR 2024, 2024, : 46 - 49
  • [28] Automated Assessment in a Programming Tools Course
    Fernandez Aleman, Jose Luis
    IEEE TRANSACTIONS ON EDUCATION, 2011, 54 (04) : 576 - 581
  • [29] AUTOMATED TOOLS FOR SCIENTIFIC PROGRAMMING SCENE
    STUCKI, LG
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1975, (169): : 27 - 27
  • [30] AUTOMATED TOOLS WILL EASE PROGRAMMING CRISIS
    DAVIS, H
    DATA MANAGEMENT, 1982, 20 (12): : 60 - 64