Targeted mining of top-k high utility itemsets

被引:3
|
作者
Huang, Shan [1 ]
Gan, Wensheng [1 ,2 ]
Miao, Jinbao [1 ]
Han, Xuming [3 ]
Fournier-Viger, Philippe [4 ]
机构
[1] Jinan Univ, Coll Cyber Secur, Guangzhou 510632, Peoples R China
[2] Pazhou Lab, Guangzhou 510330, Peoples R China
[3] Jinan Univ, Coll Informat Sci & Technol, Guangzhou 510632, Peoples R China
[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Utility mining; Target itemset; Targeted mining; Top-k; EFFICIENT ALGORITHMS; FREQUENT PATTERNS;
D O I
10.1016/j.engappai.2023.107047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding high-importance patterns in data is an emerging data mining task known as High-utility itemset mining (HUIM). Given a minimum utility threshold, a HUIM algorithm extracts all the high-utility itemsets (HUIs) whose utility values are not less than the threshold. This can reveal a wealth of useful information, but the precise needs of users are not well taken into account. In particular, users often want to focus on patterns that have some specific items rather than find all patterns. To overcome that difficulty, targeted mining has emerged, focusing on user preferences, but only preliminary work has been conducted. For example, the targeted high-utility itemset querying algorithm (TargetUM) was proposed, which uses a lexicographic tree to query itemsets containing a target pattern. However, selecting the minimum utility threshold is difficult when the user is not familiar with the processed database. As a solution, this paper formulates the task of targeted mining of the top -k high-utility itemsets and proposes an efficient algorithm called TMKU based on the TargetUM algorithm to discover the top -k target high-utility itemsets (top -k THUIs). At the same time, several pruning strategies are used to reduce memory consumption and execution time. Extensive experiments show that the proposed TMKU algorithm has good performance on real and synthetic datasets.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Mining top-k high average-utility itemsets based on breadth-first search
    Xuan Liu
    Genlang Chen
    Fangyu Wu
    Shiting Wen
    Wanli Zuo
    [J]. Applied Intelligence, 2023, 53 : 29319 - 29337
  • [22] Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies
    Singh, Kuldeep
    Biswas, Bhaskar
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [23] Mining top-k high average-utility itemsets based on breadth-first search
    Liu, Xuan
    Chen, Genlang
    Wu, Fangyu
    Wen, Shiting
    Zuo, Wanli
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 29319 - 29337
  • [24] An efficient algorithm for mining the top-k high utility itemsets, using novel threshold raising and pruning strategies
    Quang-Huy Duong
    Liao, Bo
    Fournier-Viger, Philippe
    Thu-Lan Dam
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 106 - 122
  • [25] Mining Top-K constrained cross-level high-utility itemsets over data streams
    Han, Meng
    Liu, Shujuan
    Gao, Zhihui
    Mu, Dongliang
    Li, Ang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (05) : 2885 - 2924
  • [26] Mining top-k high-utility itemsets from a data stream under sliding window model
    Siddharth Dawar
    Veronica Sharma
    Vikram Goyal
    [J]. Applied Intelligence, 2017, 47 : 1240 - 1255
  • [27] Mining Top-K constrained cross-level high-utility itemsets over data streams
    Meng Han
    Shujuan Liu
    Zhihui Gao
    Dongliang Mu
    Ang Li
    [J]. Knowledge and Information Systems, 2024, 66 : 2885 - 2924
  • [28] Mining top-k high-utility itemsets from a data stream under sliding window model
    Dawar, Siddharth
    Sharma, Veronica
    Goyal, Vikram
    [J]. APPLIED INTELLIGENCE, 2017, 47 (04) : 1240 - 1255
  • [29] Efficiently Mining Top-K High Utility Sequential Patterns
    Yin, Junfu
    Zheng, Zhigang
    Cao, Longbing
    Song, Yin
    Wei, Wei
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1259 - 1264
  • [30] TKUS: Mining top-k high utility sequential patterns
    Zhang, Chunkai
    Du, Zilin
    Gan, Wensheng
    Yu, Philip S.
    [J]. INFORMATION SCIENCES, 2021, 570 : 342 - 359