Predictive software maintenance utilizing cross-project data

被引:0
|
作者
Yogita Khatri
Sandeep Kumar Singh
机构
[1] Jaypee Institute of Information Technology,Department of Computer Science Engineering and Information Technology
关键词
Cross-project fault prediction; Machine learning-based evaluation metrics; Effort-based evaluation metrics; Software quality; Software maintenance;
D O I
暂无
中图分类号
学科分类号
摘要
To improve the software quality and reduce the maintenance cost, cross-project fault prediction (CPFP) identifies faulty software components in a particular project (aka target project) using the historical fault data of other projects (aka source/reference projects). Although several diverse approaches/models have been proposed in the past, there exists room for improvement in the prediction performance. Further, they did not consider effort-based evaluation metrics (EBEMs), which are important to ensure the model’s application in the industry, undertaking a realistic constraint of having a limited inspection effort. Besides, they validated their respective approaches using a limited number of datasets. Addressing these issues, we propose an improved CPFP model with its validation on a large corpus of data containing 62 datasets in terms of EBEMs (PIM@20%, Cost-effectiveness@20%, and IFA) and other machine learning-based evaluation metrics (MLBEMs) like PF, G-measure, and MCC. The reference data and the target data are first normalized to reduce the distribution divergence between them and then the relevant training data is selected from the reference data using the KNN algorithm. Seeing the experimental and statistical test results, we claim the efficacy of our proposed model over state-of-the-art CPFP models namely the Turhan-Filter and Cruz model comprehensively. Thus, the proposed CPFP model provides an effective solution for predicting faulty software components, enabling practitioners in developing quality software with lesser maintenance cost.
引用
收藏
页码:1503 / 1518
页数:15
相关论文
共 50 条
  • [1] Predictive software maintenance utilizing cross-project data
    Khatri, Yogita
    Singh, Sandeep Kumar
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (04) : 1503 - 1518
  • [2] Cross-Project Data for Software Engineering
    Menzies, Tim
    COMPUTER, 2015, 48 (12) : 6 - 6
  • [3] Automated topic namingSupporting cross-project analysis of software maintenance activities
    Abram Hindle
    Neil A. Ernst
    Michael W. Godfrey
    John Mylopoulos
    Empirical Software Engineering, 2013, 18 : 1125 - 1155
  • [4] Automated topic naming Supporting cross-project analysis of software maintenance activities
    Hindle, Abram
    Ernst, Neil A.
    Godfrey, Michael W.
    Mylopoulos, John
    EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (06) : 1125 - 1155
  • [5] Assessing Cross-Project Technique for Software Maintainability Prediction
    Gupta, Shikha
    Chug, Anuradha
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 656 - 665
  • [6] Manifold Learning for Cross-project Software Defect Prediction
    Sun, Jing
    Jing, Xiaoyuan
    Dong, Xiwei
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 567 - 571
  • [7] A Survey on Cross-Project Software Defect Prediction Methods
    Chen X.
    Wang L.-P.
    Gu Q.
    Wang Z.
    Ni C.
    Liu W.-S.
    Wang Q.-P.
    2018, Science Press (41): : 254 - 274
  • [8] An Extensive Study on Cross-project Predictive Mutation Testing
    Mao, Dongyu
    Chen, Lingchao
    Zhang, Lingming
    2019 IEEE 12TH CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2019), 2019, : 160 - 171
  • [9] Impact Analysis of Cross-Project Bugs on Software Ecosystems
    Ma, Wanwangying
    Chen, Lin
    Zhang, Xiangyu
    Feng, Yang
    Xu, Zhaogui
    Chen, Zhifei
    Zhou, Yuming
    Xu, Baowen
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 100 - 111
  • [10] Cross-project software defect prediction based on multi-source data sets
    Huang Junfu
    Wang Yawen
    Gong Yunzhan
    Jin Dahai
    The Journal of China Universities of Posts and Telecommunications, 2021, 28 (04) : 75 - 87