Statistical Issues on Optimization for Software Metric Models with Missing Data

被引:0
|
作者
Xie, Tianfa [1 ]
Ding, Wenxing [2 ]
机构
[1] Beijing Univ Technol, Coll Appl Sci, Beijing, Peoples R China
[2] China Natl Inst Standardizat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
software metrics; model optimization; k-NN imputation; kernel function; OBJECT-ORIENTED METRICS; NEURAL-NETWORKS; EMPIRICAL VALIDATION; PREDICTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When developing new software, software metric models are often applied in predicting certain important elements such as total work effort or error rate and so on. The procedures during the regression model construction using certain historical data, such as determination of the independent metrics, imputation of missing values and combining levels for independent categorical metrics, have been discussed already. In this paper, how to choose some important parameters for the proposed procedures during the model construction is considered in depth. The selection of critical parameters in the k-nearest neighbors (k-NN) multiple imputation is specifically focused. An example is given for illustration with data from widely used database.
引用
收藏
页码:1155 / 1159
页数:5
相关论文
共 50 条
  • [1] A statistical methodology to simplify software metric models constructed using incomplete data
    Chan, Victor K. Y.
    Wong, W. Eric
    Xie, T. F.
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2007, 17 (06) : 689 - 707
  • [2] Advanced statistical models for software data
    Succi, G
    Stefanovic, M
    Pedrycz, W
    WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVII, PROCEEDINGS: CYBERNETICS AND INFORMATICS: CONCEPTS AND APPLICATIONS (PT II), 2001, : 531 - 536
  • [3] Application of a statistical methodology to simplify software quality metric models constructed using incomplete data samples
    Chan, Victor K. Y.
    Wong, W. Eric
    Xie, T. F.
    QSIC 2006: SIXTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, PROCEEDINGS, 2006, : 15 - 21
  • [4] Statistical inference for imperfect maintenance models with missing data
    Dijoux, Yann
    Fouladirad, Mitra
    Dinh Tuan Nguyen
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2016, 154 : 84 - 96
  • [5] BAYESIAN SENSITIVITY ANALYSIS OF STATISTICAL MODELS WITH MISSING DATA
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Tang, Niansheng
    STATISTICA SINICA, 2014, 24 (02) : 871 - 896
  • [6] Bayesian Case Influence Measures for Statistical Models With Missing Data
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Cho, Hyunsoon
    Tang, Niansheng
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2012, 21 (01) : 253 - 271
  • [7] Practical and statistical issues in missing data for longitudinal patient-reported outcomes
    Bell, Melanie L.
    Fairclough, Diane L.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2014, 23 (05) : 440 - 459
  • [8] Statistical estimation in partial linear models with covariate data missing at random
    Wang, Qi-Hua
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2009, 61 (01) : 47 - 84
  • [9] Statistical estimation in partial linear models with covariate data missing at random
    Qi-Hua Wang
    Annals of the Institute of Statistical Mathematics, 2009, 61 : 47 - 84
  • [10] Missing Data Imputation Techniques for Software Effort Estimation: A Study of Recent Issues and Challenges
    Almutlaq, Ayman Jalal Hassan
    Jawawi, Dayang N. A.
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 1144 - 1158