Improving Software Quality Prediction by Noise Filtering Techniques

被引:2
|
作者
Taghi M.Khoshgoftaar
Pierre Rebours
机构
[1] U.S.A.
[2] Empirical Software Engineering Laboratory Department of Computer Science and Engineering Florida Atlantic University Boca Raton
[3] FL
关键词
noise filtering; data quality; software quality classification; expected cost of misclassification; voting expert;
D O I
暂无
中图分类号
TP311.52 []; TN911.4 [噪声与干扰];
学科分类号
081002 ; 081202 ; 0835 ;
摘要
Accuracy of machine learners is affected by quality of the data the learners are induced on.In this paper, quality of the training dataset is improved by removing instances detected as noisy by the Partitioning Filter.The fit dataset is first split into subsets,and different base learners are induced on each of these splits.The predictions are combined in such a way that an instance is identified as noisy if it is misclassified by a certain number of base learners.Two versions of the Partitioning Filter are used:Multiple-Partitioning Filter and Iterative-Partitioning Filter.The number of instances removed by the filters is tuned by the voting scheme of the filter and the number of iterations.The primary aim of this study is to compare the predictive performances of the final models built on the filtered and the un-filtered training datasets. A case study of software measurement data of a high assurance software project is performed.It is shown that predictive performances of models built on the filtered fit datasets and evaluated on a noisy test dataset are generally better than those built on the noisy(un-filtered)fit dataset.However,predictive performance based on certain aggressive filters is affected by presence of noise in the evaluation dataset.
引用
收藏
页码:387 / 396
页数:10
相关论文
共 50 条
  • [1] Improving software quality prediction by noise filtering techniques
    Khoshgoftaar, Taghi M.
    Rebours, Pierre
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (03) : 387 - 396
  • [2] Improving Software Quality Prediction by Noise Filtering Techniques
    Taghi M. Khoshgoftaar
    Pierre Rebours
    [J]. Journal of Computer Science and Technology, 2007, 22 : 387 - 396
  • [3] Simulated annealing for improving software quality prediction
    Bouktif, Salah
    Sahraoui, Houari
    Antoniol, Giuliano
    [J]. GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1893 - +
  • [4] Software Quality Prediction Techniques: A Comparative Analysis
    Shafi, Sana
    Hassan, Syed Muhammad
    Arshaq, Afsah
    Khan, Malik Jahan
    Shamail, Shafay
    [J]. 2008 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2008, : 242 - 246
  • [5] Evaluating noise elimination techniques for software quality estimation
    Khoshgoftaar, Taghi M.
    Rebours, Pierre
    [J]. INTELLIGENT DATA ANALYSIS, 2005, 9 (05) : 487 - 508
  • [6] Refactoring Techniques for Improving Software Quality: Practitioners' Perspectives
    Almogahed, Abdullah
    Omar, Mazni
    [J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2021, 20 (04): : 511 - 539
  • [7] Application of Computational Linguistics Techniques for Improving Software Quality
    Boudeffa, Amin
    Abherve, Antonin
    Bagnato, Alessandra
    Thomas, Cedric
    Hamant, Martin
    Montasser, Assad
    [J]. PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2019, 2019, 11915 : 577 - 582
  • [8] Improving software quality using statistical testing techniques
    Kelly, DP
    Oshana, RS
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2000, 42 (12) : 801 - 807
  • [9] Improving Query Suggestion through Noise Filtering and Query Length Prediction
    Wu, Liang
    Cao, Bin
    Zhou, Yuanchun
    Li, Jianhui
    [J]. WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 399 - 400
  • [10] Software quality prediction using data mining techniques
    Merzah, Bayadaa M.
    [J]. 2019 International Conference on Information and Communications Technology, ICOIACT 2019, 2019, : 394 - 397