Analyzing the Stationarity Process in Software Effort Estimation Datasets

被引:2
|
作者
Bosu, Michael Franklin [1 ]
MacDonell, Stephen G. [2 ]
Whigham, Peter A. [2 ]
机构
[1] Waikato Inst Technol, Ctr Informat Technol, Hamilton 3240, New Zealand
[2] Univ Otago, Dept Informat Sci, Dunedin 9054, New Zealand
关键词
Software effort estimation; software processes; stationarity; kernel estimators; weighted linear regression; MOVING WINDOWS; DEVELOPMENT COST;
D O I
10.1142/S0218194020400239
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software effort estimation models are typically developed based on an underlying assumption that all data points are equally relevant to the prediction of effort for future projects. The dynamic nature of several aspects of the software engineering process could mean that this assumption does not hold in at least some cases. This study employs three kernel estimator functions to test the stationarity assumption in five software engineering datasets that have been used in the construction of software effort estimation models. The kernel estimators are used in the generation of nonuniform weights which are subsequently employed in weighted linear regression modeling. In each model, older projects are assigned smaller weights while the more recently completed projects are assigned larger weights, to reflect their potentially greater relevance to present or future projects that need to be estimated. Prediction errors are compared to those obtained from uniform models. Our results indicate that, for the datasets that exhibit underlying nonstationary processes, uniform models are more accurate than the nonuniform models; that is, models based on kernel estimator functions are worse than the models where no weighting was applied. In contrast, the accuracies of uniform and nonuniform models for datasets that exhibited stationary processes were essentially equivalent. Our analysis indicates that as the heterogeneity of a dataset increases, the effect of stationarity is overridden. The results of our study also confirm prior findings that the accuracy of effort estimation models is independent of the type of kernel estimator function used in model development.
引用
收藏
页码:1607 / 1640
页数:34
相关论文
共 50 条
  • [1] Review of Existing Datasets Used for Software Effort Estimation
    Rahman, Mizanur
    Goncalves, Teresa
    Sarwar, Hasan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 921 - 931
  • [2] Analyzing the Effect of Variables in the Software Development Effort Estimation
    Velarde, H.
    Santiesteban, C.
    Garcia, A.
    Casillas, J.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (08) : 3797 - 3803
  • [3] Experience: Quality Benchmarking of Datasets Used in Software Effort Estimation
    Bosu, Michael F.
    Macdonell, Stephen G.
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2019, 11 (04):
  • [4] Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets
    Phannachitta, Passakorn
    Keung, Jacky
    Bennin, Kwabena Ebo
    Monden, Akito
    Matsumoto, Kenichi
    [J]. 2016 23RD ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2016), 2016, : 185 - 192
  • [5] A Virtual Study of Moving Windows for Software Effort Estimation Using Finnish Datasets
    Amasaki, Sousuke
    Lokan, Chris
    [J]. PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT (PROFES 2017), 2017, 10611 : 71 - 79
  • [6] Appropriate number of analogues in analogy based software effort estimation using quality datasets
    Pal, Nisha
    Yadav, Mahendra Pratap
    Yadav, Dharmendra Kumar
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (01): : 531 - 546
  • [7] Appropriate number of analogues in analogy based software effort estimation using quality datasets
    Nisha Pal
    Mahendra Pratap Yadav
    Dharmendra Kumar Yadav
    [J]. Cluster Computing, 2024, 27 : 531 - 546
  • [8] Software renewal process comprehension using dynamic effort estimation
    Caivano, D
    Lanubile, F
    Visaggio, G
    [J]. IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS: SYSTEMS AND SOFTWARE EVOLUTION IN THE ERA OF THE INTERNET, 2001, : 209 - 218
  • [9] Analyzing the Effectiveness of the Gaussian Mixture Model Clustering Algorithm in Software Enhancement Effort Estimation
    Hai, Vo Van
    Nhung, Ho Le Thi Kim
    Prokopova, Zdenka
    Silhavy, Radek
    Silhavy, Petr
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 255 - 268
  • [10] SOFTWARE EFFORT MODELS FOR EARLY ESTIMATION OF PROCESS-CONTROL APPLICATIONS
    MUKHOPADHYAY, T
    KEKRE, S
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (10) : 915 - 924