AN INFORMATION THEORETIC APPROACH TO RULE INDUCTION FROM DATABASES

被引：171

作者：

SMYTH, P ^{[1
]}

GOODMAN, RM ^{[1
]}

机构：

[1] CALTECH, DEPT ELECT ENGN, PASADENA, CA 91125 USA

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 1992年 / 4卷 / 04期

关键词：

CROSS ENTROPY; EXPERT SYSTEMS; INFORMATION THEORY; MACHINE LEARNING; KNOWLEDGE ACQUISITION; KNOWLEDGE DISCOVERY; RULE-BASED SYSTEMS; RULE INDUCTION;

D O I：

10.1109/69.149926

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The knowledge acquisition bottleneck in obtaining rules directly from an expert is well known. Hence, the problem of automated rule acquisition from data is a well-motivated one, particularly for domains where a database of sample data exists. In this paper we introduce a novel algorithm for the induction of rules from examples. The algorithm is novel in the sense that it not only learns rules for a given concept (classification), but it simultaneously learns rules relating multiple concepts. This type of learning, known as generalized rule induction is considerably more general than existing algorithms which tend to be classification oriented. Initially we focus on the problem of determining a quantitative, well-defined rule preference measure. In particular, we propose a quantity called the J-measure as an information theoretic alternative to existing approaches. The J-measure quantifies the information content of a rule or a hypothesis. We will outline the information theoretic origins of this measure and examine its plausibility as a hypothesis preference measure. We then define the ITRULE algorithm which uses the newly proposed measure to learn a set of optimal rules from a set of data samples, and we conclude the paper with an analysis of experimental results on real-world data.

引用

页码：301 / 316

页数：16

共 50 条

[1] Parallel Rule Induction with Information Theoretic Pre-Pruning
Stahl, Frederic
Bramer, Max
Adda, Mo
RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 151 - 164
[2] A DOMAIN THEORETIC APPROACH TO INCOMPLETE INFORMATION IN NESTED RELATIONAL DATABASES
LEVENE, M
LOIZOU, G
LECTURE NOTES IN COMPUTER SCIENCE, 1989, 367 : 439 - 456
[3] An information-theoretic approach to quantitative association rule mining
Ke, Yiping
Cheng, James
Ng, Wilfred
KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (02) : 213 - 244
[4] An information-theoretic approach to quantitative association rule mining
Yiping Ke
James Cheng
Wilfred Ng
Knowledge and Information Systems, 2008, 16 : 213 - 244
[5] A global rule induction approach to information extraction
Xiao, J
Chua, TS
Liu, JM
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 530 - 536
[6] Utility-Privacy Tradeoffs in Databases: An Information-Theoretic Approach
Sankar, Lalitha
Rajagopalan, S. Raj
Poor, H. Vincent
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (06) : 838 - 852
[7] A Rule-Induction Approach for Building an Arabic Language Interfaces to Databases
Bais, Hanane
Machkour, Mustapha
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (01) : 49 - 56
[8] Hierarchical fuzzy rule based systems using an information theoretic approach
Waldock, Antony
Carse, Brian
Melhuish, Chris
SOFT COMPUTING, 2006, 10 (10) : 867 - 879
[9] Hierarchical fuzzy rule based systems using an information theoretic approach
Antony Waldock
Brian Carse
Chris Melhuish
Soft Computing, 2006, 10 : 867 - 879
[10] The transition from authoritarian rule - A game theoretic approach
Sutter, D
JOURNAL OF THEORETICAL POLITICS, 2000, 12 (01) : 67 - 89

← 1 2 3 4 5 →