Towards a framework for large itemsets generation in association rules mining

被引:0
|
作者
Fortier, PJ [1 ]
Mukhin, D [1 ]
Sáchez-Ruíz, AJS [1 ]
机构
[1] Univ Massachusetts Dartmouth, ECE Dept, N Dartmouth, MA 02747 USA
关键词
itemset generation; association rules mining; data mining; generic programming; frameworks; skip lists;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of computing all association rules, for prescribed minimum support and confidence, by analyzing a large data base of transactions, can be decomposed into two large building blocks; namely to find all the large itemsets, and to use these to generate the rules. In this paper we present a, framework for the former expressed as a generic algorithm that operates upon function objects and containers with their associated iterators. We show how the framework can be instantiated to produce implementations of Apriori, and Partition; two well-known approaches to large itemsets generation. We also show performance curves associated with implementations obtained by instantiating the framework for Apriori, using two different, backing data structures, namely hash trees and skip lists; the latter being a novel implementation on its own. Our framework has been implemented in C++ by using generic programming technology popularized by the Standard Template Library (STL).
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [41] An Efficient Framework for Mining Association Rules in the Distributed Databases
    Goyal, Lalit Mohan
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    COMPUTER JOURNAL, 2018, 61 (05): : 645 - 657
  • [42] Mining Fuzzy Association Rules in the Framework of AFS Theory
    Wang B.
    Liu X.-D.
    Wang L.-D.
    Ann. Data Sci., 3 (261-270): : 261 - 270
  • [43] Mining association rules for concept hierarchy in large database
    Wang, Chien-Hua
    Lee, Wei-Hsuan
    Pang, Chin-Tzong
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2018, 21 (02) : 457 - 467
  • [44] Mining Association Rules from a Single Large Graph
    Huynh, Bao
    Nguyen, Lam B. Q.
    Nguyen, Duc H. M.
    Nguyen, Ngoc Thanh
    Nguyen, Hung-Son
    Pham, Tuyn
    Pham, Tri
    Nguyen, Loan T. T.
    Nguyen, Trinh D. D.
    Vo, Bay
    CYBERNETICS AND SYSTEMS, 2024, 55 (03) : 693 - 707
  • [45] Parallel algorithms for mining association rules in large databases
    Kudo, T
    Ashihara, H
    Shimizu, K
    INTELLIGENT SYSTEMS, 1997, : 125 - 128
  • [46] Mining association rules in very large clustered domains
    Nanopoulos, Alexandros
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    INFORMATION SYSTEMS, 2007, 32 (05) : 649 - 669
  • [47] Mining Short Association Rules from Large Database
    Ye, Feiyue
    Chen, Mingxia
    Qian, Jin
    2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 1, PROCEEDINGS, 2009, : 362 - 365
  • [48] Efficient mining of categorized association rules in large databases
    Tseng, SM
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 3606 - 3610
  • [49] Discovering frequent closed itemsets for association rules
    Pasquier, N
    Bastide, Y
    Taouil, R
    Lakhal, L
    DATABASE THEORY - ICDT'99, 1999, 1540 : 398 - 416
  • [50] A framework for incremental generation of closed itemsets
    Valtchev, Petko
    Missaoui, Rokia
    Godin, Robert
    DISCRETE APPLIED MATHEMATICS, 2008, 156 (06) : 924 - 949