Software Defect Prediction using Propositionalization based Data Preprocessing: An Empirical Study

被引:0
|
作者
Pak, CholMyong [1 ,2 ]
Wang, Tian Tian [3 ]
Su, Xiao Hong [3 ]
机构
[1] Harbin Inst Technol, Harbin, Heilongjiang, Peoples R China
[2] Kim Il Sung Univ, Pyongyang, North Korea
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
software defect prediction; data preprocessing; propostionalization; classifier; METRICS;
D O I
10.1109/ICDSBA.2018.00021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data preprocessing can be used to improve classifier performance in classification problems. Software defect prediction is also one of classification problems, so it is needed to use data preprocessing for improving the performance of model. In this paper, we study about the software defect prediction using propositonalization based data preprocessing method. We proposed propositionalization using decision tree as data preprocessing method and made experiments by using common classifiers over 17 datasets from the PROMISE repository. We also used paired t-test to compare propositionalization using decision tree with attribute subset selection and principal component analysis. Results showed that Propostionalization using decision tree improved the performance of software defect prediction significantly and it was more effective than attribute subset selection and principal component analysis. There were no statistically significant differences between top 5 classifiers.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [21] An Empirical Study on Data Sampling Methods in Addressing Class Imbalance Problem in Software Defect Prediction
    Odejide, Babajide J.
    Bajeh, Amos O.
    Balogun, Abdullateef O.
    Alanamu, Zubair O.
    Adewole, Kayode S.
    Akintola, Abimbola G.
    Salihu, Shakirat A.
    Usman-Hamza, Fatima E.
    Mojeed, Hammed A.
    [J]. SOFTWARE ENGINEERING PERSPECTIVES IN SYSTEMS, VOL. 1, 2022, 501 : 594 - 610
  • [22] Towards graph-anonymization of software analytics data: empirical study on JIT defect prediction
    Malik, Akshat
    Adams, Bram
    Hassan, Ahmed
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (04)
  • [23] A novel preprocessing approach for imbalanced learning in software defect prediction
    Bashir, Kamal
    Li, Tianrui
    Yohannese, Chubato Wondaferaw
    Yahaya, Mahama
    Ali, Tayseer
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 500 - 508
  • [24] An Improved Software Defect Prediction Algorithm Using Self-organizing Maps Combined with Hierarchical Clustering and Data Preprocessing
    Shakhovska, Natalya
    Yakovyna, Vitaliy
    Kryvinska, Natalia
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT I, 2020, 12391 : 414 - 424
  • [25] Software Defect Prediction Based on Stability Test Data
    Okumoto, Kazu
    [J]. 2011 INTERNATIONAL CONFERENCE ON QUALITY, RELIABILITY, RISK, MAINTENANCE, AND SAFETY ENGINEERING (ICQR2MSE), 2011, : 385 - 387
  • [26] Research on Software Defect Prediction Based on Data Mining
    Chen, Yuan
    Shen, Xiang-heng
    Du, Peng
    Ge, Bing
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 563 - 567
  • [27] Empirical assessment of machine learning based software defect prediction techniques
    Challagulla, VUB
    Bastani, FB
    Yen, IL
    Paul, RA
    [J]. WORDS 2005: 10TH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE, PROCEEDINGS, 2005, : 263 - 270
  • [28] Empirical assessment of machine learning based software defect prediction techniques
    Challagulla, Venkata Udaya B.
    Bastani, Farokh B.
    Yen, I-Ling
    Paul, Raymond A.
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (02) : 389 - 400
  • [29] Empirical Study: How Issue Classification Influences Software Defect Prediction
    Afric, Petar
    Vukadin, Davor
    Silic, Marin
    Delac, Goran
    [J]. IEEE ACCESS, 2023, 11 : 11732 - 11748
  • [30] An empirical study on pareto based multi-objective feature selection for software defect prediction
    Ni, Chao
    Chen, Xiang
    Wu, Fangfang
    Shen, Yuxiang
    Gu, Qing
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 215 - 238