Cost-sensitive Dictionary Learning for Software Defect Prediction

被引:12
|
作者
Niu, Liang [1 ]
Wan, Jianwu [1 ,2 ]
Wang, Hongyuan [1 ]
Zhou, Kaiwei [1 ]
机构
[1] Changzhou Univ, Sch Informat Sci & Engn, Changzhou 213164, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Software defect prediction; Cost-sensitive; Dictionary learning; Discrimination; LABEL PROPAGATION; NEURAL-NETWORKS; RECOGNITION; INFORMATION; MACHINE; QUALITY;
D O I
10.1007/s11063-020-10355-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, software defect prediction has been recognized as a cost-sensitive learning problem. To deal with the unequal misclassification losses resulted by different classification errors, some cost-sensitive dictionary learning methods have been proposed recently. Generally speaking, these methods usually define the misclassification costs to measure the unequal losses and then propose to minimize the cost-sensitive reconstruction loss by embedding the cost information into the reconstruction function of dictionary learning. Although promising performance has been achieved, their cost-sensitive reconstruction functions are not well-designed. In addition, no sufficient attentions are paid to the coding coefficients which can also be helpful to reduce the reconstruction loss. To address these issues, this paper proposes a new cost-sensitive reconstruction loss function and introduces an additional cost-sensitive discrimination regularization for the coding coefficients. Both the two terms are jointly optimized in a unified cost-sensitive dictionary learning framework. By doing so, we can achieve the minimum reconstruction loss and thus obtain a more cost-sensitive dictionary for feature encoding of test data. In the experimental part, we have conducted extensive experiments ontwenty-fivesoftware projects from four benchmark datasets of NASA, AEEEM, ReLink and Jureczko. The results, in comparison withtenstate-of-the-art software defect prediction methods, demonstrate the effectiveness of learned cost-sensitive dictionary for software defect prediction.
引用
收藏
页码:2415 / 2449
页数:35
相关论文
共 50 条
  • [41] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [42] Adversarial Learning With Cost-Sensitive Classes
    Shen, Haojing
    Chen, Sihong
    Wang, Ran
    Wang, Xizhao
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4855 - 4866
  • [43] Cost-Sensitive Decision Tree Learning
    Vadera, Sunil
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 4 - 5
  • [44] Cost-sensitive positive and unlabeled learning
    Chen, Xiuhua
    Gong, Chen
    Yang, Jian
    INFORMATION SCIENCES, 2021, 558 : 229 - 245
  • [45] Robust SVM for Cost-Sensitive Learning
    Jiangzhang Gan
    Jiaye Li
    Yangcai Xie
    Neural Processing Letters, 2022, 54 : 2737 - 2758
  • [46] Cost-Sensitive Action Model Learning
    Rao, Dongning
    Jiang, Zhihua
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2016, 24 (02) : 167 - 193
  • [47] Roulette sampling for cost-sensitive learning
    Sheng, Victor S.
    Ling, Charles X.
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 724 - +
  • [48] Robust SVM for Cost-Sensitive Learning
    Gan, Jiangzhang
    Li, Jiaye
    Xie, Yangcai
    NEURAL PROCESSING LETTERS, 2022, 54 (04) : 2737 - 2758
  • [49] Active learning for cost-sensitive classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daumé Iii, Hal
    Langford, John
    Journal of Machine Learning Research, 2019, 20
  • [50] Cost-Sensitive Learning with Noisy Labels
    Natarajan, Nagarajan
    Dhillon, Inderjit S.
    Ravikumar, Pradeep
    Tewari, Ambuj
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18 : 1 - 33