Exploring of clustering algorithm on class-imbalanced data

被引:0
|
作者
Li Xuan [1 ]
Chen Zhigang [1 ]
Yang Fan [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China
关键词
Class-imbalanced Data; Clustering Algorithm; Imbalanced-ratios; CLASSIFICATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Imbalanced data distribution still remains an unsolved problem in data mining and machine learning. This paper introduces the problem of the class-imbalanced data in classification learning and naturally introduces it into the clustering learning since data clustering is an important and frequently used unsupervised learning method. In this paper, two verification methods based on two different aspects of original data are proposed to test and verify the influence of class-imbalanced data on clustering. Furthermore, we also conduct some experiments on different imbalanced-ratios to exploring its importance in clustering algorithm since is a very important factor for the performance in classification learning. Experimental results indicate that the class-imbalance of the dataset can seriously influence the final performance and efficiency of the clustering algorithm, and the higher the ratio, the higher the adverse effects of the clustering performance based on class-imbalanced data.
引用
下载
收藏
页码:89 / 93
页数:5
相关论文
共 50 条
  • [31] Online feature selection for high-dimensional class-imbalanced data
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    Wu, Xindong
    KNOWLEDGE-BASED SYSTEMS, 2017, 136 : 187 - 199
  • [32] Active Broad-Transfer Learning Algorithm for Class-Imbalanced Fault Diagnosis
    Liu, Guokai
    Shen, Weiming
    Gao, Liang
    Kusiak, Andrew
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [33] A Class-Imbalanced Deep Learning Fall Detection Algorithm Using Wearable Sensors
    Zhang, Jing
    Li, Jia
    Wang, Weibing
    SENSORS, 2021, 21 (19)
  • [34] Active Broad-Transfer Learning Algorithm for Class-Imbalanced Fault Diagnosis
    Liu, Guokai
    Shen, Weiming
    Gao, Liang
    Kusiak, Andrew
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [35] Margin calibration in SVM class-imbalanced learning
    Yang, Chan-Yun
    Yang, Jr-Syu
    Wang, Jian-Jun
    NEUROCOMPUTING, 2009, 73 (1-3) : 397 - 411
  • [36] Prototypical Classifier for Robust Class-Imbalanced Learning
    Wei, Tong
    Shi, Jiang-Xin
    Li, Yu-Feng
    Zhang, Min-Ling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 44 - 57
  • [37] Feature selection and classification by minimizing overlap degree for class-imbalanced data in metabolomics
    Fu, Guang-Hui
    Wu, Yuan-Jiao
    Zong, Min-Jie
    Yi, Lun-Zhao
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 196
  • [38] Stable variable selection of class-imbalanced data with precision-recall criterion
    Fu, Guang-Hui
    Xu, Feng
    Zhang, Bing-Yang
    Yi, Lun-Zhao
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 171 : 241 - 250
  • [39] 2v-SSPC: A new classification method for class-imbalanced data
    Dept. of Applied Mathematics, Xidian Univ., Xi'an 710071, China
    不详
    不详
    Xi Tong Cheng Yu Dian Zi Ji Shu/Syst Eng Electron, 2008, 12 (2471-2476): : 2471 - 2476
  • [40] Prediction of DTIs for high-dimensional and class-imbalanced data based on CGAN
    Yang, Kang
    Zhang, Zhongnan
    He, Song
    Bo, Xiaochen
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 788 - 791