Imbalanced Data Classification Based on Feature Selection Techniques

被引:10
|
作者
Ksieniewicz, Pawel [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Dept Syst & Comp Networks, Wroclaw, Poland
关键词
Machine learning; Classification; Imbalanced data; Feature selection; Random search;
D O I
10.1007/978-3-030-03496-2_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The difficulty of the many classification tasks lies in the analyzed data nature, as disproportionate number of examples from different class in a learning set. Ignoring this characteristics causes that canonical classifiers display strongly biased performance on imbalanced datasets. In this work a novel classifier ensemble forming technique for imbalanced datasets is presented. On the one hand it takes into consideration selected features used for training individual classifiers, on the other hand it ensures an appropriate diversity of a classifier ensemble. The proposed method was tested on the basis of the computer experiments carried out on the several benchmark datasets. Their results seem to confirm the usefulness of the proposed concept.
引用
收藏
页码:296 / 303
页数:8
相关论文
共 50 条
  • [21] An effective distance based feature selection approach for imbalanced data
    Shahee, Shaukat Ali
    Ananthakumar, Usha
    [J]. APPLIED INTELLIGENCE, 2020, 50 (03) : 717 - 745
  • [22] An effective distance based feature selection approach for imbalanced data
    Shaukat Ali Shahee
    Usha Ananthakumar
    [J]. Applied Intelligence, 2020, 50 : 717 - 745
  • [23] Integration of feature vector selection and support vector machine for classification of imbalanced data
    Liu, Jie
    Zio, Enrico
    [J]. APPLIED SOFT COMPUTING, 2019, 75 : 702 - 711
  • [24] A weighted pattern matching approach for classification of imbalanced data with a fireworks-based algorithm for feature selection
    Sreeja, N. K.
    [J]. CONNECTION SCIENCE, 2019, 31 (02) : 143 - 168
  • [25] A Tabular Variational Auto Encoder-Based Hybrid Model for Imbalanced Data Classification With Feature Selection
    Abraham, Asha
    Mohideen, Habeeb Shaik
    Kayalvizhi, R.
    [J]. IEEE ACCESS, 2023, 11 : 122760 - 122771
  • [26] Assessments of Feature Selection Techniques with Respect to Data Sampling for Highly Imbalanced Software Measurement Data
    Gao, Kehan
    Khoshgoftaar, Taghi M.
    [J]. INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2015, 22 (02)
  • [27] Cluster-Based Instance Selection for the Imbalanced Data Classification
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2018, PT II, 2018, 11056 : 191 - 200
  • [28] Joint imbalanced classification and feature selection for hospital readmissions
    Du, Guodong
    Zhang, Jia
    Luo, Zhiming
    Ma, Fenglong
    Ma, Lei
    Li, Shaozi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 200
  • [29] Discriminative feature generation for classification of imbalanced data
    Suh, Sungho
    Lukowicz, Paul
    Lee, Yong Oh
    [J]. PATTERN RECOGNITION, 2022, 122
  • [30] Comparison of metrics for feature selection in imbalanced text classification
    Ogura, Hiroshi
    Amano, Hiromi
    Kondo, Masato
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 4978 - 4989