Observation points classifier ensemble for high-dimensional imbalanced classification

被引:2
|
作者
He, Yulin [1 ,2 ]
Li, Xu [1 ]
Fournier-Viger, Philippe [1 ]
Huang, Joshua Zhexue [1 ,2 ]
Li, Mianjie [3 ]
Salloum, Salman [4 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Nanhai Ave 3688, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen, Peoples R China
[3] Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Macao, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION; SMOTE;
D O I
10.1049/cit2.12100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an Observation Points Classifier Ensemble (OPCE) algorithm is proposed to deal with High-Dimensional Imbalanced Classification (HDIC) problems based on data processed using the Multi-Dimensional Scaling (MDS) feature extraction technique. First, dimensionality of the original imbalanced data is reduced using MDS so that distances between any two different samples are preserved as well as possible. Second, a novel OPCE algorithm is applied to classify imbalanced samples by placing optimised observation points in a low-dimensional data space. Third, optimization of the observation point mappings is carried out to obtain a reliable assessment of the unknown samples. Exhaustive experiments have been conducted to evaluate the feasibility, rationality, and effectiveness of the proposed OPCE algorithm using seven benchmark HDIC data sets. Experimental results show that (1) the OPCE algorithm can be trained faster on low-dimensional imbalanced data than on high-dimensional data; (2) the OPCE algorithm can correctly identify samples as the number of optimised observation points is increased; and (3) statistical analysis reveals that OPCE yields better HDIC performances on the selected data sets in comparison with eight other HDIC algorithms. This demonstrates that OPCE is a viable algorithm to deal with HDIC problems.
引用
收藏
页码:500 / 517
页数:18
相关论文
共 50 条
  • [41] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [42] Near-channel classifier: symbiotic communication and classification in high-dimensional space
    Hersche M.
    Lippuner S.
    Korb M.
    Benini L.
    Rahimi A.
    Brain Informatics, 2021, 8 (01)
  • [43] Clustering of imbalanced high-dimensional media data
    Šárka Brodinová
    Maia Zaharieva
    Peter Filzmoser
    Thomas Ortner
    Christian Breiteneder
    Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
  • [44] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [45] Clustering of imbalanced high-dimensional media data
    Brodinova, Sarka
    Zaharieva, Maia
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
  • [46] Adaptive Semi-Supervised Classifier Ensemble for High Dimensional Data Classification
    Yu, Zhiwen
    Zhang, Yidong
    You, Jane
    Chen, C. L. Philip
    Wong, Hau-San
    Han, Guoqiang
    Zhang, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (02) : 366 - 379
  • [47] Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Wen, Xin
    Cao, Wenming
    Chen, C. L. Philip
    Wong, Hau-San
    You, Jane
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1387 - 1400
  • [48] Quasi-Linear SVM with Local Offsets for High-dimensional Imbalanced Data Classification
    Yanze, Li
    Harutoshi, Ogai
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 882 - 887
  • [49] Research on classification method of high-dimensional class-imbalanced datasets based on SVM
    Chunkai Zhang
    Ying Zhou
    Jianwei Guo
    Guoquan Wang
    Xuan Wang
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1765 - 1778
  • [50] Research on classification method of high-dimensional class-imbalanced datasets based on SVM
    Zhang, Chunkai
    Zhou, Ying
    Guo, Jianwei
    Wang, Guoquan
    Wang, Xuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (07) : 1765 - 1778