Feature Selection for High Dimensional Data Using Weighted K-Nearest Neighbors and Genetic Algorithm

被引:20
|
作者
Li, Shuangjie [1 ]
Zhang, Kaixiang [1 ]
Chen, Qianru [1 ]
Wang, Shuqin [1 ]
Zhang, Shaoqiang [1 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; weighted K-nearest neighbors; genetic algorithm; real coding; INFORMATION; FRAMEWORK; RELEVANCE;
D O I
10.1109/ACCESS.2020.3012768
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Too many input features in applications may lead to over-fitting and reduce the performance of the learning algorithm. Moreover, in most cases, each feature containing different information content has different effects on the prediction target. Therefore, a feature selection method for calculating the importance of each feature, called WKNNGAFS, is proposed in this paper. In this method, the genetic algorithm (GA) is adopted to search the optimal weight vector, the value of the i th component of which corresponds to the contribution degree of the i th feature to the classification from a global perspective. Besides, weighted K-nearest neighbors algorithm (WKNN), which takes both the different contributions of nearest neighbors and the different classification ability of each feature into account, is used to determine the target label. To evaluate the effectiveness of the proposed method, nine existing feature selection methods are compared with it on 13 real datasets, including 6 high dimensional microarray datasets. Experimental results demonstrate the method is more effective and can improve classification performance.
引用
收藏
页码:139512 / 139528
页数:17
相关论文
共 50 条
  • [1] Weighted k-nearest neighbors feature selection for high-dimensional multi-class data
    Bugata, Peter
    Drotar, Peter
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3066 - 3073
  • [2] Estimation of Missing Values Using a Weighted K-Nearest Neighbors Algorithm
    Ling, Wang
    Mei, Fu Dong
    [J]. 2009 INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY, VOL III, PROCEEDINGS,, 2009, : 660 - 663
  • [3] EDITING FOR THE K-NEAREST NEIGHBORS RULE BY A GENETIC ALGORITHM
    KUNCHEVA, LI
    [J]. PATTERN RECOGNITION LETTERS, 1995, 16 (08) : 809 - 814
  • [4] An Improved Weighted K-Nearest Neighbors Algorithm for High Accuracy in Indoor Localization
    Hoang-Anh Pham
    Quang-Thien-Tri Nguyen
    Thanh-Van Le
    [J]. PROCEEDINGS OF 2019 25TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC), 2019, : 24 - 27
  • [5] Enhancing the Irish NFI using k-nearest neighbors and a genetic algorithm
    McInerney, Daniel
    Barrett, Frank
    McRoberts, Ronald E.
    Tomppo, Erkki
    [J]. CANADIAN JOURNAL OF FOREST RESEARCH, 2018, 48 (12) : 1482 - 1494
  • [6] Sequential random k-nearest neighbor feature selection for high-dimensional data
    Park, Chan Hee
    Kim, Seoung Bum
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) : 2336 - 2342
  • [7] Weighted nearest neighbors feature selection
    Bugata, Peter
    Drotar, Peter
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 749 - 761
  • [8] K-nearest neighbors clustering algorithm
    Gauza, Dariusz
    Zukowska, Anna
    Nowak, Robert
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2014, 2014, 9290
  • [9] Density peaks clustering algorithm with K-nearest neighbors and weighted similarity
    Zhao J.
    Chen L.
    Wu R.-X.
    Zhang B.
    Han L.-Z.
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (12): : 2349 - 2357
  • [10] Weighted K-nearest neighbors classification based on Whale optimization algorithm
    Anvari, S.
    Azgomi, M. Abdollahi
    Dishabi, M. R. Ebrahimi
    Maheri, M.
    [J]. IRANIAN JOURNAL OF FUZZY SYSTEMS, 2023, 20 (03): : 61 - 74