Discovering frequent itemsets in the presence of highly frequent items

被引:0
|
作者
Groth, DP [1 ]
Robertson, EL [1 ]
机构
[1] Indiana Univ, Sch Informat, Bloomington, IN 47405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents new techniques for focusing the discovery of frequent itemsets within large, dense datasets containing highly frequent items. The existence of highly frequent items adds significantly to the cost of computing the complete set of frequent itemsets. Our approach allows for the exclusion of such items during-the candidate generation phase of the Apriori algorithm. Afterwards, the highly frequent items can be reintroduced, via an inferencing framework, providing for a capability to generate frequent itemsets without counting their frequency. We demonstrate the use of these new techniques within the well-studied framework of the Apriori algorithm. Furthermore, we provide empirical results using our techniques on both synthetic and real datasets - both relevant since the real datasets exhibit statistical characteristics different from the probabilistic assumptions behind the synthetic data. The source we used for real data was the U.S. Census.
引用
收藏
页码:251 / 264
页数:14
相关论文
共 50 条
  • [21] Associating absent frequent itemsets with infrequent items to identify abnormal transactions
    Li-Jen Kao
    Yo-Ping Huang
    Frode Eika Sandnes
    [J]. Applied Intelligence, 2015, 42 : 694 - 706
  • [22] Discovering Frequent Itemsets Over Event Logs Using ECLAT Algorithm
    Sundeep, A. S.
    Veena, G. S.
    [J]. INTERNATIONAL PROCEEDINGS ON ADVANCES IN SOFT COMPUTING, INTELLIGENT SYSTEMS AND APPLICATIONS, ASISA 2016, 2018, 628 : 63 - 69
  • [23] Associating absent frequent itemsets with infrequent items to identify abnormal transactions
    Kao, Li-Jen
    Huang, Yo-Ping
    Sandnes, Frode Eika
    [J]. APPLIED INTELLIGENCE, 2015, 42 (04) : 694 - 706
  • [24] Axiomatization of frequent itemsets
    Calders, T
    Paredaens, J
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 290 (01) : 669 - 693
  • [25] Efficient algorithms for deriving complete frequent itemsets from frequent closed itemsets
    Wu, Cheng-Wei
    Huang, JianTao
    Lin, Yun-Wei
    Chuang, Chien-Yu
    Tseng, Yu-Chee
    [J]. APPLIED INTELLIGENCE, 2022, 52 (06) : 7002 - 7023
  • [26] Efficient algorithms for deriving complete frequent itemsets from frequent closed itemsets
    Cheng-Wei Wu
    JianTao Huang
    Yun-Wei Lin
    Chien-Yu Chuang
    Yu-Chee Tseng
    [J]. Applied Intelligence, 2022, 52 : 7002 - 7023
  • [27] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654
  • [28] Generating Closed Frequent Itemsets with the Frequent Pattern List
    Li, Qin
    Chang, Sheng
    [J]. 2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [29] Mining maximal frequent itemsets with frequent pattern list
    Qian, Jin
    Ye, Feiyue
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 628 - 632
  • [30] Frequent and Non-frequent Sequential Itemsets Detection
    Xylogiannopoulos, Konstantinos F.
    Karampelas, Panagiotis
    Alhajj, Reda
    [J]. FROM SOCIAL DATA MINING AND ANALYSIS TO PREDICTION AND COMMUNITY DETECTION, 2017, : 211 - 238