A novel algorithm for mining couples of enhanced association rules based on the number of output couples and its application

被引:0
|
作者
Petr Máša
Jan Rauch
机构
[1] Prague University of Economics and Business,
关键词
Python; GUHA method; Enhanced association rules; Subgroup discovery; CleverMiner;
D O I
暂无
中图分类号
学科分类号
摘要
Besides the need for more advanced predictive methods, there is increasing demand for easily interpretable results. Couples of enhanced association rules (a generalization of association rules/apriori/frequent itemsets) are excellent candidates for this task. They can be interpreted in various ways, subgroup discovery being an example. A typical result in rule mining is that there are too low or too many rules in the resulting ruleset. Analysts must usually iterate 5–15 times to get a reasonable number of rules. Inspired by research in a similar area of frequent itemsets to simplify input and parameter-free frequent itemsets, we have proposed a novel algorithm that finds rules based not on parameters like support and confidence but the best rules by a given range of required rule count in output. We propose this algorithm for couples of rules – SD4ft-Miner procedure and benefits from a brand new implementation of methods of mechanizing hypothesis formation in Python called Cleverminer that allows easy implementation of this algorithm. We have verified the algorithm by several applications on eight public data sets. Our original case was a case study, and it was also the reason why we developed the algorithm. However, implementation is in Python, and the algorithm itself can be used on a broader class of methods in any language. The algorithm iterates quickly, in all experiments we needed a maximum of 10 iterations. Possible enhancements to this algorithm are also outlined.
引用
收藏
页码:431 / 458
页数:27
相关论文
共 50 条
  • [41] The algorithm of objective association rules mining based on binary
    Fang, Gang
    Wei, Zu-Kuan
    Yin, Qian
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 214 - +
  • [42] A new ontology based association rules mining algorithm
    Zhu, Peng
    Jia, Fei
    Journal of Theoretical and Applied Information Technology, 2012, 45 (01): : 192 - 197
  • [43] A Novel Efficient Mining Association Rules Algorithm for Distributed Databases
    Shen, Liangzhong
    PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 50 - 56
  • [44] Comparative study on the algorithm for mining association rules based on Data Mining
    Guo, Jia
    Ren, Jing-yi
    Zhang, Yu-jing
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 17 : 44 - 48
  • [45] Research On Novel Model of Data Mining Based on Improved Association Rules and Clustering Algorithm
    Tan, Qing
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY (EMCS 2017), 2017, 61 : 522 - 526
  • [46] Extension of Local Association Rules Mining Algorithm Based on Apriori Algorithm
    Zhang Chun-sheng
    Li Yan
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 340 - 343
  • [47] An Effective Algorithm Based on Association Graph and Matrix for Mining Association Rules
    Pan, Haiwei
    Tan, Xiaolei
    Han, Qilong
    Yin, Guisheng
    2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [48] Data Mining Technique and Application Based on Association Rules
    Li, Tong
    Cheng, Yuepeng
    Liu, Yuli
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 304 - 306
  • [49] Parallelism of association rules mining and its application in insurance operations
    Tian, JL
    Zhu, L
    Zhang, SQ
    Huang, G
    COMPUTATIONAL SCIENCE - ICCS 2004, PROCEEDINGS, 2004, 3039 : 907 - 914
  • [50] Mining model of fuzzy association rules and its application in calciner
    Wang Jie
    Dang Qinhua
    Wu Zhenjie
    ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 904 - 907