A Bayesian approach for supervised discretization

被引:0
|
作者
Boullé, M
机构
关键词
supervised learning; data preparation; discretization; Bayesianism;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In supervised machine learning, some algorithms are restricted to discrete data and thus need to discretize continuous attributes. In this paper, we present a new discretization method called MODL, based on a Bayesian approach. The MODL method relies on a model space of discretizations and on a prior distribution defined on this model space. This allows the setting up of an evaluation criterion of discretization, which is minimal for the most probable discretization given the data, i.e. the Bayes optimal discretization. We compare this approach with the MDL approach and statistical approaches used in other discretization methods, from a theoretical and experimental point of view. Extensive experiments show that the MODL method builds high quality discretizations.
引用
收藏
页码:199 / 208
页数:10
相关论文
共 50 条
  • [1] Optimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach
    Marc Boullé
    Advances in Data Analysis and Classification, 2009, 3 : 39 - 61
  • [2] Optimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach
    Boulle, Marc
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2009, 3 (01) : 39 - 61
  • [3] Multivariate supervised discretization, a neighborhood graph approach
    Muhlenbach, F
    Rakotomalala, R
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 314 - 321
  • [4] A Bayesian approach to modeling finite element discretization error
    Poot, Anne
    Kerfriden, Pierre
    Rocha, Iuri
    van der Meer, Frans
    STATISTICS AND COMPUTING, 2024, 34 (05)
  • [5] A Bayesian Hybrid Approach to Unsupervised Time Series Discretization
    Kameya, Yoshitaka
    Synnaeve, Gabriel
    Doncescu, Andrei
    Inoue, Katsumi
    Sato, Taisuke
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 342 - 349
  • [6] Supervised Discretization with GK - τ
    Huang, Wenxue
    Pan, Yuanyi
    Wu, Jianhong
    FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 114 - 120
  • [7] A SUPERVISED BAYESIAN APPROACH FOR SIMULTANEOUS SEGMENTATION AND CLASSIFICATION
    Zanotta, Daniel C.
    Ferreira, Matheus P.
    Zortea, Maciel
    Espinozal, Jean A.
    Shimabukuro, Yosio
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 2382 - 2384
  • [8] A greedy algorithm for supervised discretization
    Butterworth, R
    Simovici, DA
    Santos, GS
    Ohno-Machado, L
    JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (04) : 285 - 292
  • [9] Supervised Discretization for optimal prediction
    1ST INTERNATIONAL CONFERENCE ON DATA SCIENCE, ICDS 2014, 2014, 30 : 75 - 80
  • [10] Discretization of continuous predictor variables in Bayesian networks: An ecological threshold approach
    Lucena-Moya, Paloma
    Brawata, Renee
    Kath, Jarrod
    Harrison, Evan
    ElSawah, Sondoss
    Dyer, Fiona
    ENVIRONMENTAL MODELLING & SOFTWARE, 2015, 66 : 36 - 45