An efficient algorithm for mining top-k on-shelf high utility itemsets

被引:0
|
作者
Thu-Lan Dam
Kenli Li
Philippe Fournier-Viger
Quang-Huy Duong
机构
[1] Hunan University,College of Computer Science and Electronic Engineering
[2] Hanoi University of Industry,Faculty of Information Technology
[3] National University of Defense Technology,CIC of HPC
[4] National Supercomputing Center in Changsha,School of Natural Sciences and Humanities, Harbin Institute of Technology
[5] Shenzhen Graduate School,Faculty of Information Technology, Mathematics and Electrical Engineering
[6] Norwegian University of Science and Technology,undefined
来源
关键词
Data mining; High utility mining; On-shelf high utility mining; Top-; on-shelf high utility mining;
D O I
暂无
中图分类号
学科分类号
摘要
High on-shelf utility itemset (HOU) mining is an emerging data mining task which consists of discovering sets of items generating a high profit in transaction databases. The task of HOU mining is more difficult than traditional high utility itemset (HUI) mining, because it also considers the shelf time of items, and items having negative unit profits. HOU mining can be used to discover more useful and interesting patterns in real-life applications than traditional HUI mining. Several algorithms have been proposed for this task. However, a major drawback of these algorithms is that it is difficult for users to find a suitable value for the minimum utility threshold parameter. If the threshold is set too high, not enough patterns are found. And if the threshold is set too low, too many patterns will be found and the algorithm may use an excessive amount of time and memory. To address this issue, we propose to address the problem of top-k on-shelf high utility itemset mining, where the user directly specifies k, the desired number of patterns to be output instead of specifying a minimum utility threshold value. An efficient algorithm named KOSHU (fast top-K on-shelf high utility itemset miner) is proposed to mine the top-k HOUs efficiently, while considering on-shelf time periods of items, and items having positive and/or negative unit profits. KOSHU introduces three novel strategies, named efficient estimated co-occurrence maximum period rate pruning, period utility pruning and concurrence existing of a pair 2-itemset pruning to reduce the search space. KOSHU also incorporates several novel optimizations and a faster method for constructing utility-lists. An extensive performance study on real-life and synthetic datasets shows that the proposed algorithm is efficient both in terms of runtime and memory consumption and has excellent scalability.
引用
收藏
页码:621 / 655
页数:34
相关论文
共 50 条
  • [1] An efficient algorithm for mining top-k on-shelf high utility itemsets
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 52 (03) : 621 - 655
  • [2] Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies
    Singh, Kuldeep
    Biswas, Bhaskar
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [3] TKEH: an efficient algorithm for mining top-k high utility itemsets
    Kuldeep Singh
    Shashank Sheshar Singh
    Ajay Kumar
    Bhaskar Biswas
    [J]. Applied Intelligence, 2019, 49 : 1078 - 1097
  • [4] TKEH: an efficient algorithm for mining top-k high utility itemsets
    Singh, Kuldeep
    Singh, Shashank Sheshar
    Kumar, Ajay
    Biswas, Bhaskar
    [J]. APPLIED INTELLIGENCE, 2019, 49 (03) : 1078 - 1097
  • [5] Efficient Algorithms for Mining Top-K High Utility Itemsets
    Tseng, Vincent S.
    Wu, Cheng-Wei
    Fournier-Viger, Philippe
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 54 - 67
  • [6] Mining of top-k high utility itemsets with negative utility
    Sun, Rui
    Han, Meng
    Zhang, Chunyan
    Shen, Mingyao
    Du, Shiyu
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5637 - 5652
  • [7] Targeted mining of top-k high utility itemsets
    Huang, Shan
    Gan, Wensheng
    Miao, Jinbao
    Han, Xuming
    Fournier-Viger, Philippe
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [8] DEVELOPMENT OF AN EFFICIENT TECHNIQUE FOR MINING TOP-K CLOSED HIGH UTILITY ITEMSETS
    Velayudhan, Baby
    Sakthivel
    Subasree
    [J]. IIOAB JOURNAL, 2016, 7 (09) : 150 - 155
  • [9] Efficient mining of top-k high utility itemsets through genetic algorithms
    Luna, Jose Maria
    Kiran, Rage Uday
    Fournier-Viger, Philippe
    Ventura, Sebastian
    [J]. INFORMATION SCIENCES, 2023, 624 : 529 - 553
  • [10] Implementing a Hybrid of Efficient Algorithms For Mining Top-K High Utility Itemsets
    Rajendra, Ingle Mayur
    Vyas, Chaitanya
    Moghe, Sanika Sameer
    Deshmukh, Deepali
    Sakhare, Sachin
    Gonge, Sudhanshu
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,