A statistical framework for mining substitution rules

被引:0
|
作者
Wei-Guang Teng
Ming-Jyh Hsieh
Ming-Syan Chen
机构
[1] National Taiwan University,Department of Electrical Engineering
来源
关键词
Concrete itemset; Correlation analysis; Statistical significance; Substitution rule;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a new mining capability, called mining of substitution rules, is explored. A substitution refers to the choice made by a customer to replace the purchase of some items with that of others. The mining of substitution rules in a transaction database, the same as that of association rules, will lead to very valuable knowledge in various aspects, including market prediction, user behaviour analysis and decision support. The process of mining substitution rules can be decomposed into two procedures. The first procedure is to identify concrete itemsets among a large number of frequent itemsets, where a concrete itemset is a frequent itemset whose items are statistically dependent. The second procedure is then on the substitution rule generation. In this paper, we first derive theoretical properties for the model of substitution rule mining and devise a technique on the induction of positive itemset supports to improve the efficiency of support counting for negative itemsets. Then, in light of these properties, the SRM (substitution rule mining) algorithm is designed and implemented to discover the substitution rules efficiently while attaining good statistical significance. Empirical studies are performed to evaluate the performance of the SRM algorithm proposed. It is shown that the SRM algorithm not only has very good execution efficiency but also produces substitution rules of very high quality.
引用
收藏
页码:158 / 178
页数:20
相关论文
共 50 条
  • [1] A statistical framework for mining substitution rules
    Teng, WG
    Hsieh, MJ
    Chen, MS
    KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (02) : 158 - 178
  • [2] A framework for mining association rules
    Luo, J
    Rajasekaran, S
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 509 - 517
  • [3] Statistical mining of interesting association rules
    Christian H. Weiß
    Statistics and Computing, 2008, 18 : 185 - 194
  • [4] Statistical mining of interesting association rules
    Weiss, Christian H.
    STATISTICS AND COMPUTING, 2008, 18 (02) : 185 - 194
  • [5] On the mining of substitution rules for statistically dependent items
    Teng, WG
    Hsieh, MJ
    Chen, MS
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 442 - 449
  • [6] Association Rules Mining Based on Statistical Correlation
    Jian Hu
    Xiang Yang-Li
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11054 - 11057
  • [7] A statistical framework for biomedical literature mining
    Chung, Dongjun
    Lawson, Andrew
    Zheng, W. Jim
    STATISTICS IN MEDICINE, 2017, 36 (22) : 3461 - 3474
  • [8] An ordinal framework for data mining of fuzzy rules
    Lee, JWT
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 399 - 404
  • [9] TSD based framework for mining the induction rules
    Pandey, Subhash Chandra
    Nandi, Gora Chand
    JOURNAL OF COMPUTATIONAL SCIENCE, 2014, 5 (02) : 184 - 195
  • [10] A framework for mining association rules in data warehouses
    Tjioe, HC
    Taniar, D
    INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 159 - 165