Demand-driven frequent itemset mining using pattern structures

被引:0
|
作者
Haixun Wang
Chang-Shing Perng
Sheng Ma
Philip S. Yu
机构
[1] IBM T.J. Watson Research Center,
来源
关键词
Association rule mining; Database integration; Data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Frequent itemset mining aims at discovering patterns the supports of which are beyond a given threshold. In many applications, including network event management systems, which motivated this work, patterns are composed of items each described by a subset of attributes of a relational table. As it involves an exponential mining space, the efficient implementation of user preferences and mining constraints becomes the first priority for a mining algorithm. User preferences and mining constraints are often expressed using patterns’ attribute structures. Unlike traditional methods that mine all frequent patterns indiscriminately, we regard frequent itemset mining as a two-step process: the mining of the pattern structures and the mining of patterns within each pattern structure. In this paper, we present a novel architecture that uses pattern structures to organize the mining space. In comparison with the previous techniques, the advantage of our approach is two-fold: (i) by exploiting the interrelationships among pattern structures, execution times for mining can be reduced significantly; and (ii) more importantly, it enables us to incorporate high-level simple user preferences and mining constraints into the mining process efficiently. These advantages are demonstrated by our experiments using both synthetic and real-life datasets.
引用
收藏
页码:82 / 102
页数:20
相关论文
共 50 条
  • [31] A Frequent Itemset Reduction Algorithm for Global Pattern Mining on Distributed Data Streams
    Shalini
    Jain, Sanjay Kumar
    [J]. 2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 205 - 210
  • [32] Query Expansion in Information Retrieval using Frequent Pattern (FP) Growth Algorithm for Frequent Itemset Search and Association Rules Mining
    Afuan, Lasmedi
    Ashari, Ahmad
    Suyanto, Yohanes
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 263 - 267
  • [33] Mining Frequent Itemset Using Quine-McCluskey Algorithm
    Bajpayee, Kanishka
    Kant, Surya
    Pant, Bhaskar
    Chaudhary, Ankur
    Sharma, Shashi Kumar
    [J]. PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 763 - 769
  • [34] Grafting for combinatorial binary model using frequent itemset mining
    Lee, Taito
    Matsushima, Shin
    Yamanishi, Kenji
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (01) : 101 - 123
  • [35] Implementation of an Improved Algorithm for Frequent Itemset Mining using Hadoop
    Agarwal, Ruchi
    Singh, Sunny
    Vats, Satvik
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 13 - 18
  • [36] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [37] Grafting for combinatorial binary model using frequent itemset mining
    Taito Lee
    Shin Matsushima
    Kenji Yamanishi
    [J]. Data Mining and Knowledge Discovery, 2020, 34 : 101 - 123
  • [38] High utility pattern mining using the maximal itemset property and lexicographic tree structures
    Lin, Ming-Yen
    Tu, Tzer-Fu
    Hsueh, Sue-Chen
    [J]. INFORMATION SCIENCES, 2012, 215 : 1 - 14
  • [39] Frequent Pattern using Multiple Attribute Value for Itemset Generation
    Long, Zalizah Awang
    Abu Bakar, Azuraliza
    Hamdan, Abdul Razak
    [J]. 2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 44 - 50
  • [40] Pattern mining algorithms for data streams using itemset
    Krishnamoorthy M.
    Karthikeyan R.
    [J]. Measurement: Sensors, 2022, 24