A novel under sampling strategy for efficient software defect analysis of skewed distributed data

被引:0
|
作者
K. Nitalaksheswara Rao
Ch. Satyananda Reddy
机构
[1] Andhra University,Department of Computer Science and Systems Engineering
来源
Evolving Systems | 2020年 / 11卷
关键词
Software defects analysis; Classification; Decision tree; Class imbalance learning; Under sampling;
D O I
暂无
中图分类号
学科分类号
摘要
The software quality development process is a continuous process which starts by identifying a reliable fault detection technique. The implementation of the effective fault detection technique depends on the properties of the dataset in terms of domain information, characteristics of input data, complexity, etc. The early detection of defective modules provide more time for the developers to allocate resources effectively to deliver the quality software in time. The class imbalance nature of the software defect datasets indicates that the existing techniques are unsuccessful for identifying all the defective modules. Misclassification of the defective modules in the software engineering datasets invites unexpected loses to the software developers. To classify the class imbalance software datasets in an efficient way, we have proposed a novel approach called as under sampling strategy. This proposed approach uses under sampling strategy to reduce the less prominent instances from majority subset. The experimental results confirm that the proposed approach can deliver more accuracy in predicting the modules which are error prone with less and simple rules.
引用
收藏
页码:119 / 131
页数:12
相关论文
共 50 条
  • [11] Research on Data Extraction and Analysis of Software Defect in IoT Communication Software
    Bi, Wenbin
    Yu, Fang
    Cao, Ning
    Huo, Wei
    Cao, Guangsheng
    Han, Xiuli
    Sun, Lili
    Higgs, Russell
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (02): : 1837 - 1854
  • [12] Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning
    Zhao, Dongfang
    Liu, Jiafeng
    Wu, Rui
    Cheng, Dansong
    Tang, Xianglong
    [J]. IEEE ACCESS, 2019, 7 : 55763 - 55769
  • [13] A Novel Method for Software Defect Prediction in the Context of Big Data
    Chang, Ruihua
    Shen, Xiaowei
    Wang, Binghe
    Xu, Qiuping
    [J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 105 - 109
  • [14] A novel access control strategy for distributed data systems
    Zhang, Jianying
    Wang, Xiukun
    Liu, Hongbo
    Meng, Jun
    [J]. Journal of Digital Information Management, 2010, 8 (05): : 291 - 297
  • [15] Security analysis and defense strategy of distributed filtering under false data injection attacks
    Zhou, Jiayu
    Yang, Wen
    Zhang, Heng
    Zheng, Wei Xing
    Xu, Yong
    Tang, Yang
    [J]. AUTOMATICA, 2022, 138
  • [16] HSDD: a Hybrid Sampling Strategy for Class Imbalance in Defect Prediction Data Sets
    Ozturk, M. Maruf
    Zengin, Ahmet
    [J]. 2016 FIFTH INTERNATIONAL CONFERENCE ON FUTURE COMMUNICATION TECHNOLOGIES (FGCT), 2016, : 60 - 69
  • [17] HSDD: a Hybrid Sampling Strategy for Class Imbalance in Defect Prediction Data Sets
    Ozturk, M. Maruf
    Zengin, Ahmet
    [J]. 2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016), 2016, : 225 - 234
  • [18] An Efficient Data Distribution Strategy for Distributed Graph Processing System
    Mukherjee, Aradhita
    Chaki, Rituparna
    Chaki, Nabendu
    [J]. COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT (CISIM 2022), 2022, 13293 : 360 - 373
  • [19] An empirical study of data sampling techniques for just-in-time software defect prediction
    Li, Zhiqiang
    Du, Qiannan
    Zhang, Hongyu
    Jing, Xiao-Yuan
    Wu, Fei
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (02)
  • [20] A method to find an efficient and robust sampling strategy under model uncertainty
    Bueno, Edgar
    Hedlin, Dan
    [J]. SURVEY METHODOLOGY, 2021, 47 (01) : 75 - 89