A residual utility-based concept for high-utility itemset mining

被引:0
|
作者
Pushp Sra
Satish Chand
机构
[1] Jawaharlal Nehru University,School of Computer and Systems Sciences
关键词
High-utility itemset mining; Data mining; Knowledge discovery; Memory efficient mining;
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge discovery in databases aims at finding useful information for decision-making. The problem of high-utility itemset mining (HUIM) has specifically garnered huge research attention, as it aims to find relevant information on patterns in a database, which conform to a user-defined utility function. The mined patterns are used for making data-backed decisions in the fields of healthcare, e-commerce, web analytics, etc. Various algorithms exist in the literature related to mining the high-utility items from the databases; however, most of them require multiple database scans, or deploy complex data structures. The utility-list is an efficient list-based data structure that is being widely adopted in the design of HUIM algorithms. The existing utility-list-based algorithms, however, suffer from some drawbacks like extensive use of inefficient join operations, multiple definitions of join operations, etc. Though the HUIM is an important research area, yet very little research has been directed towards improving the design of data structures used for the mining process. In this paper, we introduce the concept of residual utility to design two new data structures, called residue-map and master-map. Using these two data structures, a new algorithm, called R-Miner, is introduced for mining the high-utility items. In order to further optimise the mining process, the cumulative utility value is used as an upper bound and additional pruning conditions are also discussed. Several experiments are carried out on both real and synthetic datasets to compare the performance of R-Miner with the existing list-based algorithms. The experimental results show that the R-Miner improves the performance by up to the order of 2 as compared to the list-based algorithms: EFIM, H-Miner, HUI-Miner, FHM, and ULB-Miner.
引用
下载
收藏
页码:211 / 235
页数:24
相关论文
共 50 条
  • [41] Personalized Recommendation Approach for Academic Literature Using High-Utility Itemset Mining Technique
    Dhanda, Mahak
    Verma, Vijay
    PROGRESS IN INTELLIGENT COMPUTING TECHNIQUES: THEORY, PRACTICE, AND APPLICATIONS, VOL 2, 2018, 719 : 247 - 254
  • [42] Parallel High Utility Itemset Mining
    Fan, Gaojuan
    Xiao, Huaiyuan
    Zhang, Chongsheng
    Almpanidis, George
    Fournier-Viger, Philippe
    Fujita, Hamido
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 819 - 830
  • [43] A Visualizer for High Utility Itemset Mining
    Song, Wei
    Liu, Mingyuan
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, : 244 - 248
  • [44] Mining Minimal High-Utility Itemsets
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Wu, Cheng-Wei
    Tseng, Vincent S.
    Faghihi, Usef
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT I, 2016, 9827 : 88 - 101
  • [45] Performance comparison of inertia weight and acceleration coefficients of BPSO in the context of high-utility itemset mining
    Ridowati Gunawan
    Edi Winarko
    Reza Pulungan
    Evolutionary Intelligence, 2023, 16 : 943 - 961
  • [46] GPU-Based Efficient Parallel Heuristic Algorithm for High-Utility Itemset Mining in Large Transaction Datasets
    Fang, Wei
    Jiang, Haipeng
    Lu, Hengyang
    Sun, Jun
    Wu, Xiaojun
    Lin, Jerry Chun-Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 652 - 667
  • [47] Frequent Closed High-Utility Itemset Mining Algorithm Based on Leiden Community Detection and Compact Genetic Algorithm
    Zhao, Xiumei
    Zhong, Xincheng
    Han, Bing
    IEEE ACCESS, 2024, 12 : 84763 - 84773
  • [48] Cross-Level High-Utility Itemset Mining Using Multi-core Processing
    Tung, N. T.
    Nguyen, Loan T. T.
    Nguyen, Trinh D. D.
    Kozierkiewicz, Adrianna
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 12876 : 467 - 479
  • [49] Performance comparison of inertia weight and acceleration coefficients of BPSO in the context of high-utility itemset mining
    Gunawan, Ridowati
    Winarko, Edi
    Pulungan, Reza
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (03) : 943 - 961
  • [50] A high utility itemset mining algorithm based on subsume index
    Song, Wei
    Zhang, Zihan
    Li, Jinhong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (01) : 315 - 340