Using regression trees to classify fault-prone software modules

被引:77
|
作者
Khoshgoftaar, TM [1 ]
Allen, EB
Deng, JY
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Empir Software Engn Lab, Boca Raton, FL 33431 USA
[2] Mississippi State Univ, Dept Comp Sci, Mississippi State, MS 39762 USA
[3] Motorola Metrowerks Corp, Austin, TX 78758 USA
关键词
classification; fault-prone modules; regression trees; software metrics; software reliability; S-Plus;
D O I
10.1109/TR.2002.804488
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Software faults are defects in software modules that might cause failures. Software developers tend to focus on faults, because they are closely related to the amount of rework necessary to prevent future operational software failures. The goal of this paper is to predict which modules are fault-prone and to do it early enough in the life cycle to be useful to developers. A regression tree is an algorithm represented by an abstract tree, where the response variable is a real quantity. Software modules are classified as fault-prone or not, by comparing the predicted value to a threshold. A classification rule is proposed that allows one to choose a preferred balance between the two types of misclassification rates. A case study of a very large telecommunications systems considered software modules to be fault-prone if any faults were discovered by customers. Our research shows that classifying fault-prone modules with regression trees and the using the classification rule in this paper, resulted in predictions with satisfactory accuracy and robustness.
引用
收藏
页码:455 / 462
页数:8
相关论文
共 50 条
  • [41] The application of fuzzy enhanced case-based reasoning for identifying fault-prone modules
    Schenker, DF
    Khoshgoftaar, TM
    [J]. THIRD IEEE INTERNATIONAL HIGH-ASSURANCE SYSTEMS ENGINEERING SYMPOSIUM, PROCEEDINGS, 1998, : 90 - 97
  • [42] Prediction of software fault-prone classes using ensemble random forest with adaptive synthetic sampling algorithm
    Balaram, A.
    Vasundra, S.
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (01)
  • [43] Prediction of software fault-prone classes using ensemble random forest with adaptive synthetic sampling algorithm
    A. Balaram
    S. Vasundra
    [J]. Automated Software Engineering, 2022, 29
  • [44] Result Evaluation of Acceptance Testing using Predicting Fault-Prone Module
    Shrivastava, Ananya
    Rajavat, Anand
    [J]. 2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [45] Indentifying Fault-Prone Object in the Web Service
    Kim, Kapsu
    Hong, Myunghui
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [46] Mining Co-location Relationships among Bug Reports to Localize Fault-Prone Modules
    Chen, Ing-Xiang
    Li, Chien-Hung
    Yang, Cheng-Zen
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (05): : 1154 - 1161
  • [47] Fault-Prone Module Prediction Approaches Using Identifiers in Source Code
    Mizuno, Osamu
    Kawashima, Naoki
    Kawamoto, Kimiaki
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (01) : 36 - 49
  • [48] Trusted Computing for Fault-Prone Wireless Networks
    Gilbert, Seth
    Kowalski, Dariusz R.
    [J]. DISTRIBUTED COMPUTING, 2010, 6343 : 359 - +
  • [49] Helping Testers by Fault-Prone Functionality Prediction
    Tabata, Keiichi
    Tanno, Haruto
    Oinuma, Morihide
    [J]. 2015 10TH ASIA-PACIFIC SYMPOSIUM ON INFORMATION AND TELECOMMUNICATION TECHNOLOGIES (APSITT), 2015,
  • [50] Iterative Identification of Fault-Prone Binaries Using In-Process Metrics
    Layman, Lucas
    Kudrjavets, Gunnar
    Nagappan, Nachiappan
    [J]. ESEM'08: PROCEEDINGS OF THE 2008 ACM-IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT, 2008, : 206 - +