A Bayesian approach for supervised discretization

被引:0
|
作者
Boullé, M
机构
关键词
supervised learning; data preparation; discretization; Bayesianism;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In supervised machine learning, some algorithms are restricted to discrete data and thus need to discretize continuous attributes. In this paper, we present a new discretization method called MODL, based on a Bayesian approach. The MODL method relies on a model space of discretizations and on a prior distribution defined on this model space. This allows the setting up of an evaluation criterion of discretization, which is minimal for the most probable discretization given the data, i.e. the Bayes optimal discretization. We compare this approach with the MDL approach and statistical approaches used in other discretization methods, from a theoretical and experimental point of view. Extensive experiments show that the MODL method builds high quality discretizations.
引用
收藏
页码:199 / 208
页数:10
相关论文
共 50 条
  • [31] Comparative analysis of discretization methods in Bayesian networks
    Nojavan, Farnaz A.
    Qian, Song S.
    Stow, Craig A.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 87 : 64 - 71
  • [32] Supervised Multivariate Discretization in Mixed Data with Random Forests
    Berrado, Abdelaziz
    Runger, Georger C.
    2009 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2009, : 211 - +
  • [33] An Incremental Bit Allocation Strategy for Supervised Feature Discretization
    Ferreira, Artur
    Figueiredo, Mario
    PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2013, 2013, 7887 : 526 - 534
  • [34] A hybrid discretization method for naive Bayesian classifiers
    Wong, Tzu-Tsung
    PATTERN RECOGNITION, 2012, 45 (06) : 2321 - 2325
  • [35] A Supervised Discretization Method for Quantitative and Qualitative Ordered Variables
    Ruiz, Francisco J.
    Angulo, Cecilio
    Agell, Nuria
    COMPUTACION Y SISTEMAS, 2006, 9 (04): : 314 - 325
  • [36] Effective Supervised Discretization for Classification based on Correlation Maximization
    Zhu, Qiusha
    Lin, Lin
    Shyu, Mei-Ling
    Chen, Shu-Ching
    2011 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2011, : 390 - 395
  • [37] LR-SDiscr: An Efficient Algorithm for Supervised Discretization
    Drias, Habiba
    Moulai, Hadjer
    Rehkab, Nourelhouda
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2018, PT I, 2018, 10751 : 266 - 275
  • [38] A Discretization Algorithm of Continuous Attributes Based on Supervised Clustering
    Hua, Haiyang
    Zhao, Huaici
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 5 - 9
  • [39] NANO: A New Supervised Algorithm for Feature Selection with Discretization
    Senthilkumar, J.
    Manjula, D.
    Krishnamoorthy, R.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1515 - +
  • [40] New Bayesian Approach for Semi-supervised Hyperspectral Unmixing in Linear Mixing Models
    Amiri, Fahime
    Kahaei, Mohammad Hossein
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1752 - 1756