MML inference of decision graphs with multi-way joins and dynamic attributes

被引:0
|
作者
Tan, PJ [1 ]
Dowe, DL [1 ]
机构
[1] Monash Univ, Sch Comp Sci & Software Engn, Clayton, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A decision tree is a comprehensible representation that has been widely used in many supervised machine learning domains. But decision trees have two notable problems - those of replication and fragmentation. One way of solving these problems is to introduce the notion of decision graphs - a generalization of the decision tree - which addresses the above problems by allowing for disjunctions, or joins. While various decision graph systems are available, all of these systems impose some forms of restriction on the proposed representations, often leading to either a new redundancy or the original redundancy not being removed. Tan and Dowe (2002) introduced an unrestricted representation called the decision graph with multi-way joins, which has improved representative power and is able to use training data with improved efficiency. In this paper, we resolve the problem of encoding internal repeated structures by introducing dynamic attributes in decision graphs. A refined search heuristic to infer these decision graphs with dynamic attributes using the Minimum Message Length (MML) principle (see Wallace and Boulton (1968), Wallace and aeeman (1987) and Wallace and Dowe (1999)) is also introduced. On both real-world and artificial data, and in terms of both "right"/ "wrong" classification accuracy and logarithm of probability "bit-costing" predictive accuracy (for binary and multinomial target attributes), our enhanced multi-way join decision graph program with dynamic attributes improves our Tan and Dowe (2002) multi-way join decision graph program, which in turn significantly outperforms both C4.5 and C5.0. The resultant graphs from the new decision graph scheme axe also more concise than both those from C4.5 and from C5.0. We also comment on logarithm of probability as a means of scoring (probabilistic) predictions.
引用
收藏
页码:269 / 281
页数:13
相关论文
共 50 条
  • [1] MML inference of decision graphs with multi-way joins
    Tan, PJ
    Dowe, DL
    [J]. AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 131 - 142
  • [2] Building multi-way decision trees with numerical attributes
    Berzal, F
    Cubero, JC
    Marín, N
    Sánchez, D
    [J]. INFORMATION SCIENCES, 2004, 165 (1-2) : 73 - 90
  • [3] Are Multi-way Joins Actually Useful?
    Henderson, Michael
    Lawrence, Ramon
    [J]. ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 13 - 22
  • [4] Accelerating multi-way joins on the GPU
    Zhuohang Lai
    Xibo Sun
    Qiong Luo
    Xiaolong Xie
    [J]. The VLDB Journal, 2022, 31 : 529 - 553
  • [5] Accelerating multi-way joins on the GPU
    Lai, Zhuohang
    Sun, Xibo
    Luo, Qiong
    Xie, Xiaolong
    [J]. VLDB JOURNAL, 2022, 31 (03): : 529 - 553
  • [6] Optimizing Multiple Multi-Way Stream Joins
    Dossinger, Manuel
    Michel, Sebastian
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1985 - 1990
  • [7] On multi-way spatial joins with direction predicates
    Zhu, HJ
    Su, JW
    Ibarra, OH
    [J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2001, 2121 : 217 - 235
  • [8] Native Execution of GraphQL Queries over RDF Graphs Using Multi-Way Joins
    Karalis, Nikolaos
    Bigerl, Alexander
    Ngonga Ngomo, Axel-Cyrille
    [J]. KNOWLEDGE GRAPHS: SEMANTICS, MACHINE LEARNING, AND LANGUAGES, 2023, 56 : 77 - 93
  • [9] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    [J]. DATA & KNOWLEDGE ENGINEERING, 1999, 29 (02) : 179 - 200
  • [10] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    [J]. DATA & KNOWLEDGE ENGINEERING, 1998, 28 (03) : 277 - 298