A residual utility-based concept for high-utility itemset mining

被引:0
|
作者
Pushp Sra
Satish Chand
机构
[1] Jawaharlal Nehru University,School of Computer and Systems Sciences
关键词
High-utility itemset mining; Data mining; Knowledge discovery; Memory efficient mining;
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge discovery in databases aims at finding useful information for decision-making. The problem of high-utility itemset mining (HUIM) has specifically garnered huge research attention, as it aims to find relevant information on patterns in a database, which conform to a user-defined utility function. The mined patterns are used for making data-backed decisions in the fields of healthcare, e-commerce, web analytics, etc. Various algorithms exist in the literature related to mining the high-utility items from the databases; however, most of them require multiple database scans, or deploy complex data structures. The utility-list is an efficient list-based data structure that is being widely adopted in the design of HUIM algorithms. The existing utility-list-based algorithms, however, suffer from some drawbacks like extensive use of inefficient join operations, multiple definitions of join operations, etc. Though the HUIM is an important research area, yet very little research has been directed towards improving the design of data structures used for the mining process. In this paper, we introduce the concept of residual utility to design two new data structures, called residue-map and master-map. Using these two data structures, a new algorithm, called R-Miner, is introduced for mining the high-utility items. In order to further optimise the mining process, the cumulative utility value is used as an upper bound and additional pruning conditions are also discussed. Several experiments are carried out on both real and synthetic datasets to compare the performance of R-Miner with the existing list-based algorithms. The experimental results show that the R-Miner improves the performance by up to the order of 2 as compared to the list-based algorithms: EFIM, H-Miner, HUI-Miner, FHM, and ULB-Miner.
引用
下载
收藏
页码:211 / 235
页数:24
相关论文
共 50 条
  • [31] Efficient evolutionary computation model of closed high-utility itemset mining
    Jerry Chun-Wei Lin
    Youcef Djenouri
    Gautam Srivastava
    Philippe Fourier-Viger
    Applied Intelligence, 2022, 52 : 10604 - 10616
  • [32] Efficient High-Utility Itemset Mining Over Variety of Databases: A Survey
    Suvarna, U.
    Srinivas, Y.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 803 - 816
  • [33] Efficient high-utility occupancy itemset mining algorithm on massive data
    He, Jingxuan
    Han, Xixian
    Wang, Jinbao
    Zhang, Kaiqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [34] Mining of high-utility itemsets with negative utility
    Singh, Kuldeep
    Shakya, Harish Kumar
    Singh, Abhimanyu
    Biswas, Bhaskar
    EXPERT SYSTEMS, 2018, 35 (06)
  • [35] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (02) : 357 - 381
  • [36] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Liu, Xuan
    Wen, Shiting
    Zuo, Wanli
    APPLIED INTELLIGENCE, 2020, 50 (01) : 169 - 191
  • [37] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Kenli Li
    Philippe Fournier-Viger
    Quang-Huy Duong
    Frontiers of Computer Science, 2019, 13 : 357 - 381
  • [38] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Xuan Liu
    Shiting Wen
    Wanli Zuo
    Applied Intelligence, 2020, 50 : 169 - 191
  • [39] High-utility pattern mining: A method for discovery of high-utility item sets
    Hu, Jianying
    Mojsilovic, Aleksandra
    PATTERN RECOGNITION, 2007, 40 (11) : 3317 - 3324
  • [40] Mining High Utility Itemset with Multiple Minimum Utility Thresholds Based on Utility Deviation
    Alhusaini, Naji
    Li, Jing
    Fournier-Viger, Philippe
    Hawbani, Ammar
    Chen, Guilin
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 490 - 496