A clustering-based discretization for supervised learning

被引:39
|
作者
Gupta, Ankit [2 ]
Mehrotra, Kishan G. [1 ]
Mohan, Chilukuri [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Ctr Sci & Technol 4 106, Syracuse, NY 13244 USA
[2] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
Discretization; Clustering; Binning; Supervised learning;
D O I
10.1016/j.spl.2010.01.015
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We address the problem of discretization of continuous variables for machine learning classification algorithms. Existing procedures do not use interdependence between the variables towards this goal. Our proposed method uses clustering to exploit such interdependence. Numerical results show that this improves the classification performance in almost all cases. Even if an existing algorithm can successfully operate with continuous variables, better performance is obtained if the variables are first discretized. An additional advantage of discretization is that it reduces the overall computation time. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:816 / 824
页数:9
相关论文
共 50 条
  • [21] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Haichao
    Wang, Jia
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [22] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Hachao
    Wang, Jia
    Knowledge-Based Systems, 2024, 292
  • [23] Clustering-based attack detection for adversarial reinforcement learning
    Majadas, Ruben
    Garcia, Javier
    Fernandez, Fernando
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2631 - 2647
  • [24] Consensus Clustering-Based Undersampling Approach to Imbalanced Learning
    Onan, Aytug
    SCIENTIFIC PROGRAMMING, 2019, 2019
  • [25] A Clustering-Based Method for Team Formation in Learning Environments
    Guijarro-Mata-Garcia, Marta
    Guijarro, Maria
    Fuentes-Fernandez, Ruben
    Hybrid Artificial Intelligent Systems, 2016, 9648 : 475 - 486
  • [26] Representation Learning by Denoising Autoencoders for Clustering-based Classification
    Owhadi-Kareshk, Moein
    Akbarzadeh-T, Mohammad-R
    2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2015, : 228 - 233
  • [27] ClusterCNN: Clustering-Based Feature Learning for Hyperspectral Image Classification
    Yao, Wei
    Lian, Cheng
    Bruzzone, Lorenzo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (11) : 1991 - 1995
  • [28] Clustering-Based Ensemble Learning for Activity Recognition in Smart Homes
    Jurek, Anna
    Nugent, Chris
    Bi, Yaxin
    Wu, Shengli
    SENSORS, 2014, 14 (07) : 12285 - 12304
  • [29] Wind Speed Forecasting with a Clustering-Based Deep Learning Model
    Kosanoglu, Fuat
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [30] LEARNING CLUSTERING-BASED LINEAR MAPPINGS FOR QUANTIZATION NOISE REMOVAL
    Alain, Martin
    Guillemot, Christine
    Thoreau, Dominique
    Guillotel, Philippe
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4200 - 4204