Using regression trees to classify fault-prone software modules

被引:77
|
作者
Khoshgoftaar, TM [1 ]
Allen, EB
Deng, JY
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Empir Software Engn Lab, Boca Raton, FL 33431 USA
[2] Mississippi State Univ, Dept Comp Sci, Mississippi State, MS 39762 USA
[3] Motorola Metrowerks Corp, Austin, TX 78758 USA
关键词
classification; fault-prone modules; regression trees; software metrics; software reliability; S-Plus;
D O I
10.1109/TR.2002.804488
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Software faults are defects in software modules that might cause failures. Software developers tend to focus on faults, because they are closely related to the amount of rework necessary to prevent future operational software failures. The goal of this paper is to predict which modules are fault-prone and to do it early enough in the life cycle to be useful to developers. A regression tree is an algorithm represented by an abstract tree, where the response variable is a real quantity. Software modules are classified as fault-prone or not, by comparing the predicted value to a threshold. A classification rule is proposed that allows one to choose a preferred balance between the two types of misclassification rates. A case study of a very large telecommunications systems considered software modules to be fault-prone if any faults were discovered by customers. Our research shows that classifying fault-prone modules with regression trees and the using the classification rule in this paper, resulted in predictions with satisfactory accuracy and robustness.
引用
收藏
页码:455 / 462
页数:8
相关论文
共 50 条
  • [1] Ordering Fault-Prone Software Modules
    Taghi M. Khoshgoftaar
    Edward B. Allen
    [J]. Software Quality Journal, 2003, 11 : 19 - 37
  • [2] Ordering fault-prone software modules
    Khoshgoftaar, TM
    Allen, EB
    [J]. SOFTWARE QUALITY JOURNAL, 2003, 11 (01) : 19 - 37
  • [3] Uncertain classification of fault-prone software modules
    Khoshgoftaar T.M.
    Yuan X.
    Allen E.B.
    Jones W.D.
    Hudepohl J.P.
    [J]. Empirical Software Engineering, 2002, 7 (4) : 297 - 318
  • [4] Benefits of principal components analysis with classification trees of fault-prone software modules
    Khoshgoftaar, TM
    Allen, EB
    Shan, RQ
    [J]. 6TH ISSAT INTERNATIONAL CONFERENCE ON RELIABILITY AND QUALITY IN DESIGN, PROCEEDINGS, 2000, : 60 - 64
  • [5] Using product, process, and execution metrics to predict fault-prone software modules with classification trees
    Khoshgoftaar, TM
    Shan, RQ
    Allen, EB
    [J]. FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2000, : 301 - 310
  • [6] Predicting the order of fault-prone modules in legacy software
    Khosgoftaar, TM
    Allen, EB
    [J]. NINTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1998, : 344 - 353
  • [7] Prediction of fault-prone software modules using a generic text discriminator
    Mizuno, Osamu
    Kikuno, Tohru
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (04): : 888 - 896
  • [8] Experience in Predicting Fault-Prone Software Modules Using Complexity Metrics
    Yu, Liguo
    Mishra, Alok
    [J]. QUALITY TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2012, 9 (04): : 421 - 433
  • [9] Predicting fault-prone software modules in telephone switches
    Ohlsson, N
    Alberg, H
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (12) : 886 - 894
  • [10] Predicting Fault-Prone Software Modules with Rank Sum Classification
    Cahill, Jaspar
    Hogan, James M.
    Thomas, Richard
    [J]. 2013 22ND AUSTRALASIAN CONFERENCE ON SOFTWARE ENGINEERING (ASWEC), 2013, : 211 - 219