Feature selection for software effort estimation with localized neighborhood mutual information

被引:10
|
作者
Liu, Qin [1 ]
Xiao, Jiakai [2 ]
Zhu, Hongming [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
关键词
Feature selection; Case based reasoning; Neighborhood mutual information; Software effort estimation;
D O I
10.1007/s10586-018-1884-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is usually employed before applying case based reasoning (CBR) for Software Effort Estimation (SEE). Unfortunately, most feature selection methods treat CBR as a black box method so there is no guarantee on the appropriateness of CBR on selected feature subset. The key to solve the problem is to measure the appropriateness of CBR assumption for a given feature set. In this paper, a measure called localized neighborhood mutual information (LNI) is proposed for this purpose and a greedy method called LNI based feature selection (LFS) is designed for feature selection. Experiment with leave-one-out cross validation (LOOCV) on 6 benchmark datasets demonstrates that: (1) CBR makes effective estimation with the LFS selected subset compared with a randomized baseline method. Compared with three representative feature selection methods, (2) LFS achieves optimal MAR value on 3 out of 6 datasets with a 14% average improvement and (3) LFS achieves optimal MMRE on 5 out of 6 datasets with a 24% average improvement.
引用
收藏
页码:S6953 / S6961
页数:9
相关论文
共 50 条
  • [41] Class-specific feature selection using neighborhood mutual information with relevance-redundancy weight
    Ma, Xi-Ao
    Lu, Kecheng
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [42] AFIFC: Adaptive fuzzy neighborhood mutual information-based feature selection via label correlation
    Sun, Lin
    Xu, Feng
    Ding, Weiping
    Xu, Jiucheng
    PATTERN RECOGNITION, 2025, 164
  • [43] Feature Selection Technique for Effective Software Effort Estimation Using Multi-Layer Perceptrons
    Goyal, Somya
    Bhatia, Pradeep K.
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 181 - 192
  • [44] Semi-supervised Feature Selection by Mutual Information Based on Kernel Density Estimation
    Xu, Siqi
    Dai, Jianhua
    Shi, Hong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 818 - 823
  • [45] Mutual Information Estimation for Filter Based Feature Selection Using Particle Swarm Optimization
    Hoai Bach Nguyen
    Xue, Bing
    Andreae, Peter
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 719 - 736
  • [46] A COMBINATION OF MUTUAL AND NEIGHBORHOOD INFORMATION FOR BAND SELECTION IN HYPERSPECTRAL IMAGES
    Dey, Abhishek
    Ghosh, Susmita
    Ientilucci, Emmett J.
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6077 - 6080
  • [47] Fit Data Selection for Software Effort Estimation Models
    Toda, Koji
    Monden, Akito
    Matsumoto, Ken-ichi
    ESEM'08: PROCEEDINGS OF THE 2008 ACM-IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT, 2008, : 360 - 361
  • [48] Feature Selection Based on Neighborhood Self-Information
    Wang, Changzhong
    Huang, Yang
    Shao, Mingwen
    Hu, Qinghua
    Chen, Degang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 4031 - 4042
  • [49] Hybrid Feature Selection: Combining Fisher Criterion and Mutual Information for Efficient Feature Selection
    Dhir, Chandra Shekhar
    Lee, Soo Young
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT I, 2009, 5506 : 613 - 620
  • [50] Nearest-Neighborhood Linear Regression in an Application with Software Effort Estimation
    Leal, Luciana Q.
    Fagundes, Roberta A. A.
    de Souza, Renata M. C. R.
    Moura, Hermano P.
    Gusmao, Cristine M. G.
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 5030 - +