A Bayesian approach for supervised discretization

被引:0
|
作者
Boullé, M
机构
关键词
supervised learning; data preparation; discretization; Bayesianism;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In supervised machine learning, some algorithms are restricted to discrete data and thus need to discretize continuous attributes. In this paper, we present a new discretization method called MODL, based on a Bayesian approach. The MODL method relies on a model space of discretizations and on a prior distribution defined on this model space. This allows the setting up of an evaluation criterion of discretization, which is minimal for the most probable discretization given the data, i.e. the Bayes optimal discretization. We compare this approach with the MDL approach and statistical approaches used in other discretization methods, from a theoretical and experimental point of view. Extensive experiments show that the MODL method builds high quality discretizations.
引用
收藏
页码:199 / 208
页数:10
相关论文
共 50 条
  • [41] A semi-supervised coarse-to-fine approach with bayesian optimization for lithology identification
    Xie, Yunxin
    Jin, Liangyu
    Zhu, Chenyang
    Wu, Siyu
    EARTH SCIENCE INFORMATICS, 2023, 16 (3) : 2285 - 2305
  • [42] A semi-supervised coarse-to-fine approach with bayesian optimization for lithology identification
    Yunxin Xie
    Liangyu Jin
    Chenyang Zhu
    Siyu Wu
    Earth Science Informatics, 2023, 16 : 2285 - 2305
  • [43] Application of an efficient Bayesian discretization method to biomedical data
    Lustgarten, Jonathan L.
    Visweswaran, Shyam
    Gopalakrishnan, Vanathi
    Cooper, Gregory F.
    BMC BIOINFORMATICS, 2011, 12
  • [44] A non-parametric semi-supervised discretization method
    Bondu, Alexis
    Boulle, Marc
    Lemaire, Vincent
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (01) : 35 - 57
  • [45] FUZZY DISCRETIZATION TECHNIQUE FOR BAYESIAN FLOOD DISASTER MODEL
    Ahmad-Azami, Nor Idayu
    Yusoff, Nooraini
    Ku-Mahamud, Ku Ruhana
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2018, 17 (02): : 167 - 189
  • [46] IDD: A supervised interval distance-based method for discretization
    Ruiz, Francisco J.
    Angulo, Cecilio
    Agell, Nuria
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (09) : 1230 - 1238
  • [47] A non-parametric semi-supervised discretization method
    Alexis Bondu
    Marc Boullé
    Vincent Lemaire
    Knowledge and Information Systems, 2010, 24 : 35 - 57
  • [48] A Non-parametric Semi-supervised Discretization Method
    Bondu, A.
    Boulle, M.
    Lemaire, V
    Loiseau, S.
    Duval, B.
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 53 - +
  • [49] Application of an efficient Bayesian discretization method to biomedical data
    Jonathan L Lustgarten
    Shyam Visweswaran
    Vanathi Gopalakrishnan
    Gregory F Cooper
    BMC Bioinformatics, 12
  • [50] Inference in hybrid Bayesian networks using dynamic discretization
    Martin Neil
    Manesh Tailor
    David Marquez
    Statistics and Computing, 2007, 17 : 219 - 233