AN INFORMATION THEORETIC APPROACH TO RULE INDUCTION FROM DATABASES

被引:171
|
作者
SMYTH, P [1 ]
GOODMAN, RM [1 ]
机构
[1] CALTECH, DEPT ELECT ENGN, PASADENA, CA 91125 USA
关键词
CROSS ENTROPY; EXPERT SYSTEMS; INFORMATION THEORY; MACHINE LEARNING; KNOWLEDGE ACQUISITION; KNOWLEDGE DISCOVERY; RULE-BASED SYSTEMS; RULE INDUCTION;
D O I
10.1109/69.149926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The knowledge acquisition bottleneck in obtaining rules directly from an expert is well known. Hence, the problem of automated rule acquisition from data is a well-motivated one, particularly for domains where a database of sample data exists. In this paper we introduce a novel algorithm for the induction of rules from examples. The algorithm is novel in the sense that it not only learns rules for a given concept (classification), but it simultaneously learns rules relating multiple concepts. This type of learning, known as generalized rule induction is considerably more general than existing algorithms which tend to be classification oriented. Initially we focus on the problem of determining a quantitative, well-defined rule preference measure. In particular, we propose a quantity called the J-measure as an information theoretic alternative to existing approaches. The J-measure quantifies the information content of a rule or a hypothesis. We will outline the information theoretic origins of this measure and examine its plausibility as a hypothesis preference measure. We then define the ITRULE algorithm which uses the newly proposed measure to learn a set of optimal rules from a set of data samples, and we conclude the paper with an analysis of experimental results on real-world data.
引用
收藏
页码:301 / 316
页数:16
相关论文
共 50 条
  • [11] Likelihood-Based Sampling from Databases for Rule Induction Methods
    Tsumoto, Shusaku
    Hirano, Shoji
    Abe, Hidenao
    ROUGH SET AND KNOWLEDGE TECHNOLOGY (RSKT), 2010, 6401 : 265 - 272
  • [12] SUPERVISING ISODATA WITH AN INFORMATION THEORETIC STOPPING RULE
    CARMAN, CS
    MERICKEL, MB
    PATTERN RECOGNITION, 1990, 23 (1-2) : 185 - 197
  • [13] Information theoretic limits of learning a sparse rule
    Luneau, Clement
    Macris, Nicolas
    Barbier, Jean
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [14] Information theoretic rule discovery in neural networks
    Kamimura, R
    Kamimura, T
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 2569 - 2574
  • [15] Information theoretic limits of learning a sparse rule
    Luneau, Clement
    Macris, Nicolas
    Barbier, Jean
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (04):
  • [16] An information theoretic analysis of architectures for multilevel secure databases
    Aisbett, JE
    DATABASE SECURITY VOLUME X - STATUS AND PROSPECTS, 1997, : 212 - 225
  • [17] A hybrid approach to rule discovery in databases
    Zhong, N
    Dong, JZ
    Ohsuga, S
    INFORMATION SCIENCES, 2000, 126 (1-4) : 99 - 127
  • [18] Information theoretic approach to information extraction
    Amati, Giambattista
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 519 - 529
  • [19] Rule Induction Based on Rough Sets from Possibilistic Information under Lipski's Approach
    Nakata, Michinori
    Sakai, Hiroshi
    2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 218 - 223
  • [20] Improved spliced alignment from an information theoretic approach
    Zhang, M
    Gish, W
    BIOINFORMATICS, 2006, 22 (01) : 13 - 20