Instance selection using one-versus-all and one-versus-one decomposition approaches in multiclass classification datasets

被引:3
|
作者
Fang, Ching-Lin [1 ]
Wang, Ming-Chang [1 ]
Tsai, Chih-Fong [2 ]
Lin, Wei-Chao [3 ,4 ]
Liao, Pei-Qi [2 ]
机构
[1] Natl Chung Cheng Univ, Dept Business Adm, Chiayi, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Taoyuan, Taiwan
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Chang Gung Mem Hosp Linkou, Dept Thorac Surg, Taoyuan, Taiwan
关键词
data mining; instance selection; machine learning; multiclass classification; one-versus-all; one-versus-one; REDUCTION; BINARY;
D O I
10.1111/exsy.13217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance is important in data analysis and mining; it filters out unrepresentative, redundant, or noisy data from a given training set to obtain effective model learning. Various instance selection algorithms are proposed in the literature, and their potential and applicability in data cleaning and preprocessing steps are demonstrated. For multiclass classification datasets, the existing instance selection algorithms must deal with all the instances across the different classes simultaneously to produce a reduced training set. Generally, every multiclass classification dataset can be regarded as a complex domain problem, which can be effectively solved using the divide-and-conquer principle. In this study, the one-versus-all (OVA) and one-versus-one (OVO) decomposition approaches were used to decompose a multiclass dataset into multiple binary class datasets. These approaches have been widely employed when constructing the classifier but have never been considered in instance selection. The results of instance selection performance obtained with the OVA, OVO, and baseline approaches were assessed and compared for 20 different domain multiclass datasets as the first study and five medical domain datasets as the validation study. Furthermore, three instance selection algorithms were compared, including IB3, DROP3, and GA. The results demonstrate that using the OVO approach to perform instance selection can make the support vector machine (SVM) and k-nearest neighbour (k-NN) classifiers perform significantly better than the OVA and baseline approaches in terms of the area under the ROC curve (AUC) rate, regardless of the instance selection algorithm used. Moreover, the OVO approach can provide reasonably good data reduction rates and processing times, which are all better than those of the OVA approach.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] One-versus-one and one-versus-all multiclass SVM-RFE for gene selection in cancer classification
    Duan, Kai-Bo
    Rajapakse, Jagath C.
    Nguyen, Minh N.
    [J]. EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2007, 4447 : 47 - +
  • [2] Multiclass from Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches
    Rocha, Anderson
    Goldenstein, Siome Klein
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) : 289 - 302
  • [3] Combining One-Versus-One and One-Versus-All Strategies to Improve Multiclass SVM Classifier
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 37 - 45
  • [4] Multiclass imbalanced learning with one-versus-one decomposition and spectral clustering
    Li, Qianmu
    Song, Yanjun
    Zhang, Jing
    Sheng, Victor S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 147 (147)
  • [5] A multiclass classification using one-versus-all approach with the differential partition sampling ensemble
    Gao, Xin
    He, Yang
    Zhang, Mi
    Diao, Xinping
    Jing, Xiao
    Ren, Bing
    Ji, Weijia
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 97 (97)
  • [6] Unified Classification and Rejection: A One-versus-all Framework
    Cheng, Zhen
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    [J]. MACHINE INTELLIGENCE RESEARCH, 2024, 21 (05) : 870 - 887
  • [7] Ensemble selection in one-versus-one scheme - case study for cutting tools classification
    Rojek, Izabela
    Burduk, Robert
    Heda, Paulina
    [J]. BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2021, 69 (01)
  • [8] Universum Selection for Boosting the Performance of Multiclass Support Vector Machines Based on One-versus-One Strategy
    Songsiri, Patoomsiri
    Cherkassky, Vladimir
    Kijsirikul, Boonserm
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 159 : 9 - 19
  • [9] Adapted One-versus-All Decision Trees for Data Stream Classification
    Hashemi, Sattar
    Yang, Ying
    Mirzamomen, Zahra
    Kangavari, Mohammadreza
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) : 624 - 637
  • [10] Dialogue Act classification using RNNs introducing one-versus-all and attention mechanism
    Izumia H.
    Kato S.
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2019, 139 (12) : 1407 - 1414