Cost-sensitive Dictionary Learning for Software Defect Prediction

被引:12
|
作者
Niu, Liang [1 ]
Wan, Jianwu [1 ,2 ]
Wang, Hongyuan [1 ]
Zhou, Kaiwei [1 ]
机构
[1] Changzhou Univ, Sch Informat Sci & Engn, Changzhou 213164, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Software defect prediction; Cost-sensitive; Dictionary learning; Discrimination; LABEL PROPAGATION; NEURAL-NETWORKS; RECOGNITION; INFORMATION; MACHINE; QUALITY;
D O I
10.1007/s11063-020-10355-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, software defect prediction has been recognized as a cost-sensitive learning problem. To deal with the unequal misclassification losses resulted by different classification errors, some cost-sensitive dictionary learning methods have been proposed recently. Generally speaking, these methods usually define the misclassification costs to measure the unequal losses and then propose to minimize the cost-sensitive reconstruction loss by embedding the cost information into the reconstruction function of dictionary learning. Although promising performance has been achieved, their cost-sensitive reconstruction functions are not well-designed. In addition, no sufficient attentions are paid to the coding coefficients which can also be helpful to reduce the reconstruction loss. To address these issues, this paper proposes a new cost-sensitive reconstruction loss function and introduces an additional cost-sensitive discrimination regularization for the coding coefficients. Both the two terms are jointly optimized in a unified cost-sensitive dictionary learning framework. By doing so, we can achieve the minimum reconstruction loss and thus obtain a more cost-sensitive dictionary for feature encoding of test data. In the experimental part, we have conducted extensive experiments ontwenty-fivesoftware projects from four benchmark datasets of NASA, AEEEM, ReLink and Jureczko. The results, in comparison withtenstate-of-the-art software defect prediction methods, demonstrate the effectiveness of learned cost-sensitive dictionary for software defect prediction.
引用
收藏
页码:2415 / 2449
页数:35
相关论文
共 50 条
  • [31] Cost-sensitive and ensemble-based prediction model for outsourced software project risk prediction
    Hu, Yong
    Feng, Bin
    Mo, Xizhu
    Zhang, Xiangzhou
    Ngai, E. W. T.
    Fan, Ming
    Liu, Mei
    DECISION SUPPORT SYSTEMS, 2015, 72 : 11 - 23
  • [32] On the Effectiveness of Cost Sensitive Neural Networks for Software Defect Prediction
    Muthukumaran, K.
    Dasgupta, Amrita
    Abhidnya, Shirode
    Neti, Lalita Bhanu Murthy
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 557 - 570
  • [33] Cost Sensitive Decision Forest and Voting for Software Defect Prediction
    Siers, Michael J.
    Islam, Md Zahidul
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 929 - 936
  • [34] Instance-dependent misclassification cost-sensitive learning for default prediction
    Xing, Jin
    Chi, Guotai
    Pan, Ancheng
    RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2024, 69
  • [35] Cost-sensitive boosting in software quality modeling
    Khoshgoftaar, TM
    7TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2002, : 51 - 60
  • [36] Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning
    Yang, Pei-Tse
    Wu, Wen-Shuo
    Wu, Chia-Chun
    Shih, Yi-Nuo
    Hsieh, Chung-Ho
    Hsu, Jia-Lien
    OPEN MEDICINE, 2021, 16 (01): : 754 - 768
  • [37] Predicting Software Defects: A Cost-Sensitive Approach
    Bezerra, Miguel E. R.
    Oliveira, Adriano L. I.
    Adeodato, Paulo J. L.
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2515 - 2522
  • [38] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [39] Cost-sensitive learning of SVM for ranking
    Xu, Jun
    Cao, Yunbo
    Li, Hang
    Huang, Yalou
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 833 - 840
  • [40] Cost-Sensitive Learning in Answer Extraction
    Wiegand, Michael
    Leidner, Jochen L.
    Klakow, Dietrich
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 711 - 714