Imbalanced Data Classification Based on Feature Selection Techniques

被引:10
|
作者
Ksieniewicz, Pawel [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Dept Syst & Comp Networks, Wroclaw, Poland
关键词
Machine learning; Classification; Imbalanced data; Feature selection; Random search;
D O I
10.1007/978-3-030-03496-2_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The difficulty of the many classification tasks lies in the analyzed data nature, as disproportionate number of examples from different class in a learning set. Ignoring this characteristics causes that canonical classifiers display strongly biased performance on imbalanced datasets. In this work a novel classifier ensemble forming technique for imbalanced datasets is presented. On the one hand it takes into consideration selected features used for training individual classifiers, on the other hand it ensures an appropriate diversity of a classifier ensemble. The proposed method was tested on the basis of the computer experiments carried out on the several benchmark datasets. Their results seem to confirm the usefulness of the proposed concept.
引用
收藏
页码:296 / 303
页数:8
相关论文
共 50 条
  • [41] Feature Selection and Imbalanced Data Handling for Depression Detection
    Mousavian, Marzieh
    Chen, Jianhua
    Greening, Steven
    [J]. BRAIN INFORMATICS, BI 2018, 2018, 11309 : 349 - 358
  • [42] Multiset Feature Learning for Highly Imbalanced Data Classification
    Wu, Fei
    Jing, Xiao-Yuan
    Shan, Shiguang
    Zuo, Wangmeng
    Yang, Jing-Yu
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1583 - 1589
  • [43] Multiset Feature Learning for Highly Imbalanced Data Classification
    Jing, Xiao-Yuan
    Zhang, Xinyu
    Zhu, Xiaoke
    Wu, Fei
    You, Xinge
    Gao, Yang
    Shan, Shiguang
    Yang, Jing-Yu
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 139 - 156
  • [44] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [45] Ensemble classification for imbalanced data based on feature space partitioning and hybrid metaheuristics
    Lopez-Garcia, Pedro
    Masegosa, Antonio D.
    Osaba, Eneko
    Onieva, Enrique
    Perallos, Asier
    [J]. APPLIED INTELLIGENCE, 2019, 49 (08) : 2807 - 2822
  • [46] Default forecasting based on a novel group feature selection method for imbalanced data
    Chi, Guotai
    Xing, Jin
    Pan, Ancheng
    [J]. JOURNAL OF CREDIT RISK, 2023, 19 (03): : 51 - 77
  • [47] Ensemble classification for imbalanced data based on feature space partitioning and hybrid metaheuristics
    Pedro Lopez-Garcia
    Antonio D. Masegosa
    Eneko Osaba
    Enrique Onieva
    Asier Perallos
    [J]. Applied Intelligence, 2019, 49 : 2807 - 2822
  • [48] An Approach to Imbalanced Data Classification Based on Instance Selection and Over-Sampling
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 601 - 610
  • [49] Exploring Data Sampling Techniques for Imbalanced Classification Problems
    Sui, Yu
    Zhang, Xiaohui
    Huan, Jiajia
    Hong, Haifeng
    [J]. FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [50] Using Evolutionary Multiobjective Techniques for Imbalanced Classification Data
    Garcia, Sandra
    Aler, Ricardo
    Maria Galvan, Ines
    [J]. ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 422 - 427