Systematic literature review of machine learning based software development effort estimation models

被引:300
|
作者
Wen, Jianfeng [1 ]
Li, Shixian [1 ]
Lin, Zhiyong [2 ]
Hu, Yong [3 ]
Huang, Changqin [4 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Guangdong Polytech Normal Univ, Dept Comp Sci, Guangzhou, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Inst Business Intelligence & Knowledge Discovery, Dept Commerce E, Guangdong Univ Foreign Studies, Guangzhou 510275, Guangdong, Peoples R China
[4] S China Normal Univ, Engn Res Ctr Comp Network & Informat Syst, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Software effort estimation; Machine learning; Systematic literature review; DEVELOPMENT COST ESTIMATION; ARTIFICIAL NEURAL-NETWORKS; EFFORT PREDICTION; EMPIRICAL VALIDATION; GENETIC ALGORITHM; PROJECT EFFORT; ANALOGY; REGRESSION; INFORMATION; SELECTION;
D O I
10.1016/j.infsof.2011.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software development effort estimation (SDEE) is the process of predicting the effort required to develop a software system. In order to improve estimation accuracy, many researchers have proposed machine learning (ML) based SDEE models (ML models) since 1990s. However, there has been no attempt to analyze the empirical evidence on ML models in a systematic way. Objective: This research aims to systematically analyze ML models from four aspects: type of ML technique, estimation accuracy, model comparison, and estimation context. Method: We performed a systematic literature review of empirical studies on ML model published in the last two decades (1991-2010). Results: We have identified 84 primary studies relevant to the objective of this research. After investigating these studies, we found that eight types of ML techniques have been employed in SDEE models. Overall speaking, the estimation accuracy of these ML models is close to the acceptable level and is better than that of non-ML models. Furthermore, different ML models have different strengths and weaknesses and thus favor different estimation contexts. Conclusion: ML models are promising in the field of SDEE. However, the application of ML models in industry is still limited, so that more effort and incentives are needed to facilitate the application of ML models. To this end, based on the findings of this review, we provide recommendations for researchers as well as guidelines for practitioners. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:41 / 59
页数:19
相关论文
共 50 条
  • [41] Preliminary performance study of a brief review on machine learning techniques for analogy based software effort estimation
    Kumar, K. Harish
    Srinivas, K.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 2141 - 2165
  • [42] Preliminary performance study of a brief review on machine learning techniques for analogy based software effort estimation
    K. Harish Kumar
    K. Srinivas
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2141 - 2165
  • [43] Systematic literature review of ensemble effort estimation
    Idri, Ali
    Hosni, Mohamed
    Abran, Alain
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 118 : 151 - 175
  • [44] Software Effort Estimation with Use Case Points using Ensemble Machine Learning Models
    Marapelli, Bhaskar
    Carie, Anil
    Islam, Sardar M. N.
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 333 - 338
  • [45] Machine learning based software effort estimation using development-centric features for crowdsourcing platform
    Ying, Zuobin
    Ling, Min
    Zhang, Yiwen
    INTELLIGENT DATA ANALYSIS, 2024, 28 (02) : 451 - 465
  • [46] Machine learning based software effort estimation using development-centric features for crowdsourcing platform
    Yasmin, Anum
    Haider, Wasi
    Daud, Ali
    Banjar, Ameen
    INTELLIGENT DATA ANALYSIS, 2024, 28 (01) : 299 - 329
  • [47] Machine Learning Models for Automatic Labeling: A Systematic Literature Review
    Fredriksson, Teodor
    Bosch, Jan
    Olsson, Helena
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 552 - 561
  • [48] Software Risk Prediction: Systematic Literature Review on Machine Learning Techniques
    Mahmud, Mahmudul Hoque
    Nayan, Md Tanzirul Haque
    Ashir, Dewan Md Nur Anjum
    Kabir, Md Alamgir
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [49] A systematic literature review of machine learning techniques for software maintainability prediction
    Alsolai, Hadeel
    Roper, Marc
    INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 119
  • [50] A systematic review of studies on use case points and expert-based estimation of software development effort
    Mahmood, Yasir
    Kama, Nazri
    Azmi, Azri
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2020, 32 (07)