Feature selection for software effort estimation with localized neighborhood mutual information

被引:10
|
作者
Liu, Qin [1 ]
Xiao, Jiakai [2 ]
Zhu, Hongming [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
关键词
Feature selection; Case based reasoning; Neighborhood mutual information; Software effort estimation;
D O I
10.1007/s10586-018-1884-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is usually employed before applying case based reasoning (CBR) for Software Effort Estimation (SEE). Unfortunately, most feature selection methods treat CBR as a black box method so there is no guarantee on the appropriateness of CBR on selected feature subset. The key to solve the problem is to measure the appropriateness of CBR assumption for a given feature set. In this paper, a measure called localized neighborhood mutual information (LNI) is proposed for this purpose and a greedy method called LNI based feature selection (LFS) is designed for feature selection. Experiment with leave-one-out cross validation (LOOCV) on 6 benchmark datasets demonstrates that: (1) CBR makes effective estimation with the LFS selected subset compared with a randomized baseline method. Compared with three representative feature selection methods, (2) LFS achieves optimal MAR value on 3 out of 6 datasets with a 14% average improvement and (3) LFS achieves optimal MMRE on 5 out of 6 datasets with a 24% average improvement.
引用
收藏
页码:S6953 / S6961
页数:9
相关论文
共 50 条
  • [1] Feature selection for software effort estimation with localized neighborhood mutual information
    Qin Liu
    Jiakai Xiao
    Hongming Zhu
    Cluster Computing, 2019, 22 : 6953 - 6961
  • [2] Mutual information for feature selection: estimation or counting?
    Nguyen H.B.
    Xue B.
    Andreae P.
    Evolutionary Intelligence, 2016, 9 (3) : 95 - 110
  • [3] FEATURE SELECTION BASED ON STATISTICAL ESTIMATION OF MUTUAL INFORMATION
    Kozhevin, A. A.
    SIBERIAN ELECTRONIC MATHEMATICAL REPORTS-SIBIRSKIE ELEKTRONNYE MATEMATICHESKIE IZVESTIYA, 2021, 18 : 720 - 728
  • [4] Software Development Effort Estimation Using Feature Selection Techniques
    Hosni, Mohamed
    Idri, Ali
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 439 - 452
  • [5] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [6] A novel feature gene selection method based on neighborhood mutual information
    Chen, Tao
    Hong, Zenglin
    Zhao, Hui
    Yang, Xiao
    Wei, Jun
    International Journal of Hybrid Information Technology, 2015, 8 (07): : 277 - 292
  • [7] A Mutual Information-Based Hybrid Feature Selection Method for Software Cost Estimation Using Feature Clustering
    Shi, Shihai
    Liu, Qin
    INTERNATIONAL JOINT CONFERENCE ON APPLIED MATHEMATICS, STATISTICS AND PUBLIC ADMINISTRATION (AMSPA 2014), 2014, : 481 - 490
  • [8] A Mutual Information-Based Hybrid Feature Selection Method for Software Cost Estimation Using Feature Clustering
    Liu, Qin
    Shi, Shihai
    Zhu, Hongming
    Xiao, Jiakai
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 27 - 32
  • [9] A study of mutual information based feature selection for case based reasoning in software cost estimation
    Li, Y. F.
    Xie, M.
    Go, T. N.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5921 - 5931
  • [10] An Effective Feature Selection Method via Mutual Information Estimation
    Yang, Jian-Bo
    Ong, Chong-Jin
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (06): : 1550 - 1559