AN INFORMATION THEORETIC APPROACH TO RULE INDUCTION FROM DATABASES

被引:171
|
作者
SMYTH, P [1 ]
GOODMAN, RM [1 ]
机构
[1] CALTECH, DEPT ELECT ENGN, PASADENA, CA 91125 USA
关键词
CROSS ENTROPY; EXPERT SYSTEMS; INFORMATION THEORY; MACHINE LEARNING; KNOWLEDGE ACQUISITION; KNOWLEDGE DISCOVERY; RULE-BASED SYSTEMS; RULE INDUCTION;
D O I
10.1109/69.149926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The knowledge acquisition bottleneck in obtaining rules directly from an expert is well known. Hence, the problem of automated rule acquisition from data is a well-motivated one, particularly for domains where a database of sample data exists. In this paper we introduce a novel algorithm for the induction of rules from examples. The algorithm is novel in the sense that it not only learns rules for a given concept (classification), but it simultaneously learns rules relating multiple concepts. This type of learning, known as generalized rule induction is considerably more general than existing algorithms which tend to be classification oriented. Initially we focus on the problem of determining a quantitative, well-defined rule preference measure. In particular, we propose a quantity called the J-measure as an information theoretic alternative to existing approaches. The J-measure quantifies the information content of a rule or a hypothesis. We will outline the information theoretic origins of this measure and examine its plausibility as a hypothesis preference measure. We then define the ITRULE algorithm which uses the newly proposed measure to learn a set of optimal rules from a set of data samples, and we conclude the paper with an analysis of experimental results on real-world data.
引用
收藏
页码:301 / 316
页数:16
相关论文
共 50 条
  • [31] An information theoretic approach to processing management
    Kreucher, Chris
    Carter, Kevin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1869 - 1872
  • [32] Information theoretic approach to the authentication of multimedia
    Martinian, E
    Chen, B
    Wornell, G
    SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS III, 2001, 4314 : 185 - 196
  • [33] An Information Theoretic Approach to Econometrics.
    Park, Byoung Gun
    JOURNAL OF ECONOMIC LITERATURE, 2013, 51 (03) : 886 - 888
  • [34] An information theoretic approach to image segmentation
    Baranwal, R
    Singh, R
    Bora, PK
    IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 218 - 222
  • [35] An information theoretic approach for privacy metrics
    Bezzi, Michele
    TRANSACTIONS ON DATA PRIVACY, 2010, 3 (03) : 199 - 215
  • [36] AN INFORMATION THEORETIC APPROACH TO REGULATION SYSTEMS
    SABOURIN, MG
    CAINES, PE
    PROCEEDINGS OF THE 22ND CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1 & 2, 1988, : 469 - 469
  • [37] An information theoretic approach to network tomography
    Cho, Wendy K. Tam
    Judge, George
    APPLIED ECONOMICS LETTERS, 2015, 22 (01) : 1 - 6
  • [38] Information theoretic approach to Bayesian inference
    Jewell, J
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2002, 617 : 433 - 448
  • [39] An information theoretic approach to sensor scheduling
    McIntyre, GA
    Hintz, KJ
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION V, 1996, 2755 : 304 - 312
  • [40] An Information Theoretic Approach to RF Fingerprinting
    Gungor, Onur
    Koksal, C. Emre
    El Gamal, Hesham
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 61 - 65