A Novel Classifier-independent Feature Selection Algorithm for Imbalanced Datasets

被引:0
|
作者
Zhu, Quanyin [1 ]
Cao, Suqun [2 ]
机构
[1] Huaiyin Inst Technol, Dept Comp Engn, Huaian, Peoples R China
[2] Huaiyin Inst Technol, Dept Mech Engn, Huaian, Peoples R China
关键词
imbalanced datasets; feature selection; posterior probability;
D O I
10.1109/SNPD.2009.47
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A novel classifier-independent feature selection algorithm based on the posterior probability is proposed for imbalanced datasets. First, an imbalanced factor is introduced and computed by Parzen-window estimation. The middle point of Tomek links is chosen as the initial point. Accordingly, this algorithm is iterated to find out the boundary points which have the equality of posterior probability. Through the project computation on the normal vectors of these points, the weight of each feature can be obtained, which actually indicates the importance degree of each feature. The experimental results on 3 real-word datasets demonstrate that this proposed algorithm can not only reduce the computational cost but also overcome the shortcoming that lite majority class may be defected well but the minority class may be ignored in the conventional feature selection algorithm.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [1] Entropy criterion for classifier-independent feature selection
    Abe, N
    Kudo, M
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 689 - 695
  • [2] A divergence criterion for Classifier-Independent Feature Selection
    Abe, N
    Kudo, M
    Toyama, J
    Shimbo, M
    [J]. ADVANCES IN PATTERN RECOGNITION, 2000, 1876 : 668 - 676
  • [3] A Multi-Schematic Classifier-Independent Oversampling Approach for Imbalanced Datasets
    Bej, Saptarshi
    Schulz, Kristian
    Srivastava, Prashant
    Wolfien, Markus
    Wolkenhauer, Olaf
    [J]. IEEE ACCESS, 2021, 9 : 123358 - 123374
  • [4] Non-parametric classifier-independent feature selection
    Abe, N
    Kudo, M
    [J]. PATTERN RECOGNITION, 2006, 39 (05) : 737 - 746
  • [5] Classifier-independent feature selection on the basis of divergence criterion
    Abe, Naoto
    Kudo, Mineichi
    Toyama, Jun
    Shimbo, Masaru
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2006, 9 (2-3) : 127 - 137
  • [6] Classifier-independent feature selection on the basis of divergence criterion
    Naoto Abe
    Mineichi Kudo
    Jun Toyama
    Masaru Shimbo
    [J]. Pattern Analysis and Applications, 2006, 9 : 127 - 137
  • [7] FEATURE SELECTION FOR IMBALANCED DATASETS BASED ON IMPROVED GENETIC ALGORITHM
    Du, Limin
    Xu, Yang
    Jin, Liuqian
    [J]. DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 119 - 124
  • [8] Multi-class classifier-independent feature analysis
    Holz, HJ
    Loew, MH
    [J]. PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1219 - 1224
  • [9] FEATURE SELECTION FOR DATASETS WITH IMBALANCED CLASS DISTRIBUTIONS
    Kamal, Abu H. M.
    Zhu, Xingquan
    Pandya, Abhijit
    Hsu, Sam
    Narayanan, Ramaswamy
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2010, 20 (02) : 113 - 137
  • [10] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267