Systematic literature review of machine learning based software development effort estimation models

被引：300

作者：

Wen, Jianfeng ^{[1
]}

Li, Shixian ^{[1
]}

Lin, Zhiyong ^{[2
]}

Hu, Yong ^{[3
]}

Huang, Changqin ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Dept Comp Sci, Guangzhou 510275, Guangdong, Peoples R China

[2] Guangdong Polytech Normal Univ, Dept Comp Sci, Guangzhou, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Inst Business Intelligence & Knowledge Discovery, Dept Commerce E, Guangdong Univ Foreign Studies, Guangzhou 510275, Guangdong, Peoples R China

[4] S China Normal Univ, Engn Res Ctr Comp Network & Informat Syst, Guangzhou, Guangdong, Peoples R China

来源：

INFORMATION AND SOFTWARE TECHNOLOGY | 2012年 / 54卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Software effort estimation; Machine learning; Systematic literature review; DEVELOPMENT COST ESTIMATION; ARTIFICIAL NEURAL-NETWORKS; EFFORT PREDICTION; EMPIRICAL VALIDATION; GENETIC ALGORITHM; PROJECT EFFORT; ANALOGY; REGRESSION; INFORMATION; SELECTION;

D O I：

10.1016/j.infsof.2011.09.002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Context: Software development effort estimation (SDEE) is the process of predicting the effort required to develop a software system. In order to improve estimation accuracy, many researchers have proposed machine learning (ML) based SDEE models (ML models) since 1990s. However, there has been no attempt to analyze the empirical evidence on ML models in a systematic way. Objective: This research aims to systematically analyze ML models from four aspects: type of ML technique, estimation accuracy, model comparison, and estimation context. Method: We performed a systematic literature review of empirical studies on ML model published in the last two decades (1991-2010). Results: We have identified 84 primary studies relevant to the objective of this research. After investigating these studies, we found that eight types of ML techniques have been employed in SDEE models. Overall speaking, the estimation accuracy of these ML models is close to the acceptable level and is better than that of non-ML models. Furthermore, different ML models have different strengths and weaknesses and thus favor different estimation contexts. Conclusion: ML models are promising in the field of SDEE. However, the application of ML models in industry is still limited, so that more effort and incentives are needed to facilitate the application of ML models. To this end, based on the findings of this review, we provide recommendations for researchers as well as guidelines for practitioners. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：41 / 59

页数：19

共 50 条

[41] Preliminary performance study of a brief review on machine learning techniques for analogy based software effort estimation
Kumar, K. Harish
Srinivas, K.
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 2141 - 2165
[42] Preliminary performance study of a brief review on machine learning techniques for analogy based software effort estimation
K. Harish Kumar
K. Srinivas
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2141 - 2165
[43] Systematic literature review of ensemble effort estimation
Idri, Ali
Hosni, Mohamed
Abran, Alain
JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 118 : 151 - 175
[44] Software Effort Estimation with Use Case Points using Ensemble Machine Learning Models
Marapelli, Bhaskar
Carie, Anil
Islam, Sardar M. N.
INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 333 - 338
[45] Machine learning based software effort estimation using development-centric features for crowdsourcing platform
Ying, Zuobin
Ling, Min
Zhang, Yiwen
INTELLIGENT DATA ANALYSIS, 2024, 28 (02) : 451 - 465
[46] Machine learning based software effort estimation using development-centric features for crowdsourcing platform
Yasmin, Anum
Haider, Wasi
Daud, Ali
Banjar, Ameen
INTELLIGENT DATA ANALYSIS, 2024, 28 (01) : 299 - 329
[47] Machine Learning Models for Automatic Labeling: A Systematic Literature Review
Fredriksson, Teodor
Bosch, Jan
Olsson, Helena
ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 552 - 561
[48] Software Risk Prediction: Systematic Literature Review on Machine Learning Techniques
Mahmud, Mahmudul Hoque
Nayan, Md Tanzirul Haque
Ashir, Dewan Md Nur Anjum
Kabir, Md Alamgir
APPLIED SCIENCES-BASEL, 2022, 12 (22):
[49] A systematic literature review of machine learning techniques for software maintainability prediction
Alsolai, Hadeel
Roper, Marc
INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 119
[50] A systematic review of studies on use case points and expert-based estimation of software development effort
Mahmood, Yasir
Kama, Nazri
Azmi, Azri
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2020, 32 (07)

← 1 2 3 4 5 →