A clustering-based discretization for supervised learning

被引:39
|
作者
Gupta, Ankit [2 ]
Mehrotra, Kishan G. [1 ]
Mohan, Chilukuri [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Ctr Sci & Technol 4 106, Syracuse, NY 13244 USA
[2] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
Discretization; Clustering; Binning; Supervised learning;
D O I
10.1016/j.spl.2010.01.015
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We address the problem of discretization of continuous variables for machine learning classification algorithms. Existing procedures do not use interdependence between the variables towards this goal. Our proposed method uses clustering to exploit such interdependence. Numerical results show that this improves the classification performance in almost all cases. Even if an existing algorithm can successfully operate with continuous variables, better performance is obtained if the variables are first discretized. An additional advantage of discretization is that it reduces the overall computation time. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:816 / 824
页数:9
相关论文
共 50 条
  • [1] Clustering-Based Transductive Semi-Supervised Learning for Learning-to-Rank
    Rahangdale, Ashwini
    Raut, Shital
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (12)
  • [2] Graph clustering-based discretization approach to microarray data
    Kittakorn Sriwanna
    Tossapon Boongoen
    Natthakan Iam-On
    Knowledge and Information Systems, 2019, 60 : 879 - 906
  • [3] Graph clustering-based discretization approach to microarray data
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 879 - 906
  • [4] Fuzzy clustering-based discretization for gene expression classification
    Kianmehr, Keivan
    Alshalalfa, Mohammed
    Alhajj, Reda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 441 - 465
  • [5] Fuzzy clustering-based discretization for gene expression classification
    Keivan Kianmehr
    Mohammed Alshalalfa
    Reda Alhajj
    Knowledge and Information Systems, 2010, 24 : 441 - 465
  • [6] An evolutionary cut points search for graph clustering-based discretization
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 514 - 519
  • [7] A Discretization Algorithm of Continuous Attributes Based on Supervised Clustering
    Hua, Haiyang
    Zhao, Huaici
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 5 - 9
  • [8] Metric learning with clustering-based constraints
    Xinyao Guo
    Chuangyin Dang
    Jianqing Liang
    Wei Wei
    Jiye Liang
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3597 - 3605
  • [9] Metric learning with clustering-based constraints
    Guo, Xinyao
    Dang, Chuangyin
    Liang, Jianqing
    Wei, Wei
    Liang, Jiye
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (12) : 3597 - 3605
  • [10] Graph clustering-based discretization of splitting and merging methods (GraphS and GraphM)
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2017, 7