One-class ensemble classifier for data imbalance problems

被引:21
|
作者
Hayashi, Toshitaka [1 ]
Fujita, Hamido [2 ,3 ]
机构
[1] Iwate Prefectural Univ, Fac Software & Informat Sci, Takizawa, Japan
[2] I Somet Org Inc Assoc, Morioka, Iwate, Japan
[3] Iwate Prefectural Univ, Reg Res Ctr, Takizawa, Japan
关键词
Imbalanced data classification; One-class classification; Ensemble learning; One-class ensemble; SMOTE; COMPLEXITY; SELECTION; SUPPORT;
D O I
10.1007/s10489-021-02671-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data classification is an important issue in machine learning. Despite various studies, solving the data imbalance problem is still difficult. Since the oversampling method uses fake minority data, such a method is untrusted and causing security instability. The main objective of this paper is to improve accuracy for data imbalance classification without generating fake minority data. For this purpose, a reliable strategy is proposed using an ensemble of one-class classifiers. Such a classifier does not suffer data imbalance problems since the model learns from a single class. In particular, training data is split into minority and majority sets. Then, one-class classifiers are trained separately and applied to compute minority and majority scores for testing data. Finally, classification is made based on the combination of both scores. The proposed method is experimented with using imbalanced-learn datasets. Moreover, the result is compared with sampling methods via Decision Tree and K Nearest Neighbors classifiers. One-class ensemble classifier outperforms sampling methods in 20 datasets.
引用
收藏
页码:17073 / 17089
页数:17
相关论文
共 50 条
  • [21] Norm ball classifier for one-class classification
    Kim, Sehwa
    Lee, Kyungsik
    Jeong, Young-Seon
    ANNALS OF OPERATIONS RESEARCH, 2021, 303 (1-2) : 433 - 482
  • [22] One-class classifier based on principal curves
    Borges, Fernando Elias de Melo
    Mota, Otavio Fidelis
    Ferreira, Danton Diego
    Barbosa, Bruno Henrique Groenner
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26): : 19015 - 19024
  • [23] Diversity measures for one-class classifier ensembles
    Krawczyk, Bartosz
    Wozniak, Michal
    NEUROCOMPUTING, 2014, 126 : 36 - 44
  • [24] Norm ball classifier for one-class classification
    Sehwa Kim
    Kyungsik Lee
    Young-Seon Jeong
    Annals of Operations Research, 2021, 303 : 433 - 482
  • [25] Hybrid One-Class Ensemble for High-Dimensional Data Classification
    Krawczyk, Bartosz
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT II, 2016, 9622 : 136 - 144
  • [26] One-class Classifier Ensemble based Enhanced Semisupervised Classification of Hyperspectral Remote Sensing Images
    Singh, Pangambam Sendash
    Singh, Vijendra Pratap
    Pandey, Manish Kumar
    Karthikeyan, Subbiah
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 22 - 27
  • [27] Cluster-Based One-Class Ensemble for Classification Problems in Information Retrieval
    Lipka, Nedim
    Stein, Benno
    Anderka, Maik
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1041 - 1042
  • [28] OC-WAD: A One-Class Classifier Ensemble Approach for Anomaly Detection in Web Traffic
    Parhizkar, Elham
    Abadi, Mahdi
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 631 - 636
  • [29] Condition Monitoring System Design with One-class and Imbalanced-Data Classifier
    Wang, Shijin
    Xi, Lifeng
    2009 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2009, : 779 - +
  • [30] Fast structural ensemble for One-Class Classification
    Liu, Jiachen
    Miao, Qiguang
    Sun, Yanan
    Song, Jianfeng
    Quan, Yining
    PATTERN RECOGNITION LETTERS, 2016, 80 : 179 - 187