An Efficient Approach for Mining Frequent Item sets with Transaction Deletion Operation

被引:0
|
作者
Bay Vo [1 ,2 ]
Thien-Phuong Le [3 ]
Tzung-Pei Hong [4 ]
Bac Le [5 ]
Jung, Jason [6 ]
机构
[1] Ton Duc Thang Univ, Dept Data Sci, Ho Chi Minh, Vietnam
[2] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh, Vietnam
[3] Pacific Ocean Univ, Fac Technol, Nha Trang, Khanh Hoa Provi, Vietnam
[4] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung, Taiwan
[5] Univ Sci, Dept Comp Sci, Ho Chi Minh City, Vietnam
[6] Chung Ang Univ, Dept Comp Engn, Seoul, South Korea
关键词
Data mining; frequent item sets; incremental mining; pre-large item sets; item set-tidset tree; PATTERN TREES; MAINTENANCE; ALGORITHMS; TRIE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deletion of transactions in databases is common in real-world applications. Developing an efficient and effective mining algorithm to maintain discovered information is thus quite important in data mining fields. A lot of algorithms have been proposed in recent years, and the best of them is the pre-large-tree-based algorithm. However, this algorithm only rebuilds the final pre-large tree every deleted transactions. After that, the FP-growth algorithm is applied for mining all frequent item sets. The pre-large-tree-based approach requires twice the computation time needed for a single procedure. In this paper, we present an incremental mining algorithm to solve above issues. An itemset tidset-tree structure will be used to maintain large and pre-large item sets. The proposed algorithm only processes deleted transactions for updating some nodes in this tree, and all frequent item sets are directly derived from the tree traversal process. Experimental results show that the proposed algorithm has good performance.
引用
收藏
页码:595 / 602
页数:8
相关论文
共 50 条
  • [1] Efficient Mining of Frequent Item Sets on Large Uncertain Databases
    Wang, Liang
    Cheung, David Wai-Lok
    Cheng, Reynold
    Lee, Sau Dan
    Yang, Xuan S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (12) : 2170 - 2183
  • [2] MAINTENANCE ALGORITHM FOR UPDATING THE DISCOVERED MULTIPLE FUZZY FREQUENT ITEM SETS FOR TRANSACTION DELETION
    Lin, Chun-Wei
    Wu, Tsu-Yang
    Lin, Guo
    Hong, Tzung-Pei
    [J]. PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2014, : 475 - 480
  • [3] Energy efficient in-sensor data cleaning for mining frequent item sets
    Bahi, Jacques M.
    Makhoul, Abdallah
    Medlej, Maguy
    [J]. Sensors and Transducers, 2012, 14 (SPEC. 2): : 64 - 78
  • [4] An Efficient Count Based Transaction Reduction Approach For Mining Frequent Patterns
    Vijayalakshmi, V.
    Pethalakshmi, A.
    [J]. GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 52 - 61
  • [5] Mining Maximum frequent item sets over data streams using Transaction Sliding Window Techniques
    Dhull, Annuradha
    Yadav, Neeraj
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2014, 14 (02): : 85 - 90
  • [6] An Efficient Method for Mining Frequent Weighted Closed Itemsets from Weighted Item Transaction Databases
    Bay Vo
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2017, 33 (01) : 199 - 216
  • [7] On Frequent Item Sets Acquisition Based on the Set Operations of Item-Transaction List
    Zeng, Chuanhua
    Jin, Ai
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 172 - +
  • [8] A vertical format algorithm for mining frequent item sets
    Guo Yi-ming
    Wang Zhi-jun
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 11 - 13
  • [9] A new mining algorithm based on frequent item sets
    Wen Yun
    [J]. FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 410 - 413
  • [10] Bayesian classifier based on frequent item sets mining
    Xu, Junming
    Jiang, Yuan
    Zhou, Zhihua
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2007, 44 (08): : 1293 - 1300