MML inference of decision graphs with multi-way joins and dynamic attributes

被引：0

作者：

Tan, PJ ^{[1
]}

Dowe, DL ^{[1
]}

机构：

[1] Monash Univ, Sch Comp Sci & Software Engn, Clayton, Vic 3800, Australia

来源：

AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2003年 / 2903卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A decision tree is a comprehensible representation that has been widely used in many supervised machine learning domains. But decision trees have two notable problems - those of replication and fragmentation. One way of solving these problems is to introduce the notion of decision graphs - a generalization of the decision tree - which addresses the above problems by allowing for disjunctions, or joins. While various decision graph systems are available, all of these systems impose some forms of restriction on the proposed representations, often leading to either a new redundancy or the original redundancy not being removed. Tan and Dowe (2002) introduced an unrestricted representation called the decision graph with multi-way joins, which has improved representative power and is able to use training data with improved efficiency. In this paper, we resolve the problem of encoding internal repeated structures by introducing dynamic attributes in decision graphs. A refined search heuristic to infer these decision graphs with dynamic attributes using the Minimum Message Length (MML) principle (see Wallace and Boulton (1968), Wallace and aeeman (1987) and Wallace and Dowe (1999)) is also introduced. On both real-world and artificial data, and in terms of both "right"/ "wrong" classification accuracy and logarithm of probability "bit-costing" predictive accuracy (for binary and multinomial target attributes), our enhanced multi-way join decision graph program with dynamic attributes improves our Tan and Dowe (2002) multi-way join decision graph program, which in turn significantly outperforms both C4.5 and C5.0. The resultant graphs from the new decision graph scheme axe also more concise than both those from C4.5 and from C5.0. We also comment on logarithm of probability as a means of scoring (probabilistic) predictions.

引用

页码：269 / 281

页数：13

共 50 条

[1] MML inference of decision graphs with multi-way joins
Tan, PJ
Dowe, DL
[J]. AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 131 - 142
[2] Building multi-way decision trees with numerical attributes
Berzal, F
Cubero, JC
Marín, N
Sánchez, D
[J]. INFORMATION SCIENCES, 2004, 165 (1-2) : 73 - 90
[3] Are Multi-way Joins Actually Useful?
Henderson, Michael
Lawrence, Ramon
[J]. ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 13 - 22
[4] Accelerating multi-way joins on the GPU
Zhuohang Lai
Xibo Sun
Qiong Luo
Xiaolong Xie
[J]. The VLDB Journal, 2022, 31 : 529 - 553
[5] Accelerating multi-way joins on the GPU
Lai, Zhuohang
Sun, Xibo
Luo, Qiong
Xie, Xiaolong
[J]. VLDB JOURNAL, 2022, 31 (03): : 529 - 553
[6] Optimizing Multiple Multi-Way Stream Joins
Dossinger, Manuel
Michel, Sebastian
[J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1985 - 1990
[7] On multi-way spatial joins with direction predicates
Zhu, HJ
Su, JW
Ibarra, OH
[J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2001, 2121 : 217 - 235
[8] Native Execution of GraphQL Queries over RDF Graphs Using Multi-Way Joins
Karalis, Nikolaos
Bigerl, Alexander
Ngonga Ngomo, Axel-Cyrille
[J]. KNOWLEDGE GRAPHS: SEMANTICS, MACHINE LEARNING, AND LANGUAGES, 2023, 56 : 77 - 93
[9] Faster joins, self-joins and multi-way joins using join indices
Lei, H
Ross, KA
[J]. DATA & KNOWLEDGE ENGINEERING, 1999, 29 (02) : 179 - 200
[10] Faster joins, self-joins and multi-way joins using join indices
Lei, H
Ross, KA
[J]. DATA & KNOWLEDGE ENGINEERING, 1998, 28 (03) : 277 - 298

← 1 2 3 4 5 →