Combining Univariate and Multivariate Bottom-up Discretization

被引:0
|
作者
Sang, Yu [1 ]
Li, Keqiu [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
关键词
Discretization; Univariate and multivariate; Merging criterion; Stopping criterion; Significance of interval pair; Minimum Description Length Principle (MDLP); ALGORITHM; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most inductive learning methods require that the training data set contains only discrete at which makes it necessary to discretize continuous numeric attributes. Current efforts mainly focus on discretizing data for Individual attributes, without taking into account the correlations among, attributes and the number of inconsistant records produced by discretizalion. In addition, existing methods focus only on one-dimensional problem without extensively considering the effect of interval size and class number on discretization schemes. In this paper, we present a method by combining univariate and multivariate bottom-up discretization that employs a novel merging and stopping criterion. First, we present a new merging criterion based on both univariate and multivariate measurement, which synthetically evaluates the variance among the adjacent interval pairs to find the best merge and effectively captures the correlations among the continuous attributes. This is achieved by using the Minimum Description Length Principle and developing a measurement of significance of interval pair among attributes. The advantage of our proposed merging criterion is further analyzed. Second, we present a new stopping criterion with the aim to control the degree of misclassification while maximizing the merging accuracy. Moreover, we develop an algorithm to find the best cliscretization based on the new merging and stopping criterion. Detailed analysis shows that the proposed method brings higher accuracy to the discretization process. Finally, empirical experiments on IS real data sets show that our method generates a better discretization scheme that significantly improves the accuracy of classification than existing methods by using popular learning systems such as C4.5 decision tree.
引用
收藏
页码:161 / 187
页数:27
相关论文
共 50 条
  • [1] Combining bottom-up and top-down
    Boehringer, Christoph
    Rutherford, Thomas F.
    [J]. ENERGY ECONOMICS, 2008, 30 (02) : 574 - 596
  • [2] On Combining Top-down and Bottom-up Strategies in Reading
    张荣
    [J]. 读与写(教育教学刊), 2010, 7 (09) : 5 - 7
  • [3] Combining bottom-up and top-down attentional influences
    Navalpakkam, Vidhya
    Itti, Laurent
    [J]. HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [4] An attentional system combining top-down and bottom-up influences
    Rasolzadeh, Babak
    Targhi, Alireza Tavakoli
    Eklundh, Jan-Olof
    [J]. ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 123 - 140
  • [5] Combining Top-Down and Bottom-Up Techniques in Program Derivation
    Chaudhari, Dipak L.
    Damani, Om
    [J]. LOGIC-BASED PROGRAM SYNTHESIS AND TRANSFORMATION (LOPSTR 2015), 2015, 9527 : 244 - 258
  • [6] Bottom-up excitonics
    Aspuru-Guzik, Alan
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [7] Bottom-Up Management
    Gordon, Paul J.
    [J]. INDUSTRIAL & LABOR RELATIONS REVIEW, 1950, 3 (04): : 620 - 621
  • [8] Bottom-Up Management
    不详
    [J]. HUMAN ORGANIZATION, 1950, 9 (01) : 38 - 38
  • [9] A bottom-up review
    Standing, G
    [J]. FOREIGN POLICY, 2001, (122) : 8 - +
  • [10] BOTTOM-UP TESTING
    MEHTA, KD
    [J]. IEEE SOFTWARE, 1990, 7 (05) : 4 - 4