FTKHUIM: A Fast and Efficient Method for Mining Top-K High-Utility Itemsets

被引:8
|
作者
Vu, Vinh V. [1 ]
Lam, Mi T. H. [1 ]
Duong, Thuy T. M. [1 ]
Manh, Ly T. [1 ]
Nguyen, Thuy T. T. [1 ]
Nguyen, Le V. [1 ]
Yun, Unil [2 ]
Snasel, Vaclav [3 ]
Vo, Bay [4 ]
机构
[1] Ho Chi Minh City Univ Ind & Trade, Fac Informat Technol, Ho Chi Minh 700000, Vietnam
[2] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea
[3] VSB Tech Univ Ostrava, Fac Elect Engn & Comp Sci, Ostrava 70800, Czech Republic
[4] HUTECH Univ, Fac Informat Technol, Ho Chi Minh 700000, Vietnam
关键词
Knowledge data discovery; high-utility itemset; top-k HUIM; threshold-raising strategy; ALGORITHMS;
D O I
10.1109/ACCESS.2023.3314984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-utility itemset mining (HUIM) is an important task in the field of knowledge data discovery. The large search space and huge number of HUIs are the consequences of applying HUIM algorithms with an inappropriate user-defined minimum utility threshold value. Determining a suitable threshold value to obtain the expected results is not a simple task and requires spending a lot of time. For common users, it is difficult to define a minimum threshold utility for exploring the right number of HUIs. On the one hand, if the threshold is set too high then the number of HUIs would not be enough. On the other hand, if the threshold is set too low, too many HUIs will be mined, thus wasting both time and memory. The top-k HUIs mining problem was proposed to solve this issue, and many effective algorithms have since been introduced by researchers. In this research, a novel approach, namely FTKHUIM (Fast top-k HUI Mining), is introduced to explore the top-k HUIs. One new threshold-raising strategy called RTU, a transaction utility (TU)-based threshold-raising strategy, has also been shown to rapidly increase the speed of top-k HUIM. The study also proposes a global structure to store utility values in the process of applying raising-threshold strategies to optimize these strategies. The results of experiments on various datasets prove that the FTKHUIM algorithm achieves better results with regard to both the time and search space needed.
引用
收藏
页码:104789 / 104805
页数:17
相关论文
共 50 条
  • [31] An Efficient Algorithm for Mining High-Utility Itemsets with Discount Notion
    Bansal, Ruchita
    Dawar, Siddharth
    Goyal, Vikram
    BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 84 - 98
  • [32] Efficient algorithms for mining high-utility itemsets in uncertain databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    KNOWLEDGE-BASED SYSTEMS, 2016, 96 : 171 - 187
  • [33] Efficient algorithms of mining top-k frequent closed itemsets
    Lan Yongjie
    Qiu Yong
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 551 - 554
  • [34] TKU-PSO: An Efficient Particle Swarm Optimization Model for Top-K High-Utility Itemset Mining
    Carstensen, Simen
    Lin, Jerry Chun-Wei
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024,
  • [35] Efficient incremental mining of top-K frequent closed itemsets
    Pietracaprina, Andrea
    Vandin, Fabio
    DISCOVERY SCIENCE, PROCEEDINGS, 2007, 4755 : 275 - +
  • [36] Top-k high average-utility itemsets mining with effective pruning strategies
    Ronghui Wu
    Zhan He
    Applied Intelligence, 2018, 48 : 3429 - 3445
  • [37] Top-k high average-utility itemsets mining with effective pruning strategies
    Wu, Ronghui
    He, Zhan
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3429 - 3445
  • [38] An efficient algorithm for mining the top-k high utility itemsets, using novel threshold raising and pruning strategies
    Quang-Huy Duong
    Liao, Bo
    Fournier-Viger, Philippe
    Thu-Lan Dam
    KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 106 - 122
  • [39] An efficient method for mining High-Utility itemsets from unstable negative profit databases
    Tung, N. T.
    Nguyen, Trinh D. D.
    Nguyen, Loan T. T.
    Vo, Bay
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [40] Fast algorithms for mining high-utility itemsets with various discount strategies
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    ADVANCED ENGINEERING INFORMATICS, 2016, 30 (02) : 109 - 126