A residual utility-based concept for high-utility itemset mining

被引：0

作者：

Pushp Sra

Satish Chand

机构：

[1] Jawaharlal Nehru University,School of Computer and Systems Sciences

来源：

Knowledge and Information Systems | 2024年 / 66卷 / 1期

关键词：

High-utility itemset mining; Data mining; Knowledge discovery; Memory efficient mining;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Knowledge discovery in databases aims at finding useful information for decision-making. The problem of high-utility itemset mining (HUIM) has specifically garnered huge research attention, as it aims to find relevant information on patterns in a database, which conform to a user-defined utility function. The mined patterns are used for making data-backed decisions in the fields of healthcare, e-commerce, web analytics, etc. Various algorithms exist in the literature related to mining the high-utility items from the databases; however, most of them require multiple database scans, or deploy complex data structures. The utility-list is an efficient list-based data structure that is being widely adopted in the design of HUIM algorithms. The existing utility-list-based algorithms, however, suffer from some drawbacks like extensive use of inefficient join operations, multiple definitions of join operations, etc. Though the HUIM is an important research area, yet very little research has been directed towards improving the design of data structures used for the mining process. In this paper, we introduce the concept of residual utility to design two new data structures, called residue-map and master-map. Using these two data structures, a new algorithm, called R-Miner, is introduced for mining the high-utility items. In order to further optimise the mining process, the cumulative utility value is used as an upper bound and additional pruning conditions are also discussed. Several experiments are carried out on both real and synthetic datasets to compare the performance of R-Miner with the existing list-based algorithms. The experimental results show that the R-Miner improves the performance by up to the order of 2 as compared to the list-based algorithms: EFIM, H-Miner, HUI-Miner, FHM, and ULB-Miner.

引用

下载

页码：211 / 235

页数：24

共 50 条

[31] Efficient evolutionary computation model of closed high-utility itemset mining
Jerry Chun-Wei Lin
Youcef Djenouri
Gautam Srivastava
Philippe Fourier-Viger
Applied Intelligence, 2022, 52 : 10604 - 10616
[32] Efficient High-Utility Itemset Mining Over Variety of Databases: A Survey
Suvarna, U.
Srinivas, Y.
SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 803 - 816
[33] Efficient high-utility occupancy itemset mining algorithm on massive data
He, Jingxuan
Han, Xixian
Wang, Jinbao
Zhang, Kaiqi
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
[34] Mining of high-utility itemsets with negative utility
Singh, Kuldeep
Shakya, Harish Kumar
Singh, Abhimanyu
Biswas, Bhaskar
EXPERT SYSTEMS, 2018, 35 (06)
[35] CLS-Miner: efficient and effective closed high-utility itemset mining
Thu-Lan Dam
Li, Kenli
Fournier-Viger, Philippe
Quang-Huy Duong
FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (02) : 357 - 381
[36] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
Liu, Xuan
Wen, Shiting
Zuo, Wanli
APPLIED INTELLIGENCE, 2020, 50 (01) : 169 - 191
[37] CLS-Miner: efficient and effective closed high-utility itemset mining
Thu-Lan Dam
Kenli Li
Philippe Fournier-Viger
Quang-Huy Duong
Frontiers of Computer Science, 2019, 13 : 357 - 381
[38] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
Xuan Liu
Shiting Wen
Wanli Zuo
Applied Intelligence, 2020, 50 : 169 - 191
[39] High-utility pattern mining: A method for discovery of high-utility item sets
Hu, Jianying
Mojsilovic, Aleksandra
PATTERN RECOGNITION, 2007, 40 (11) : 3317 - 3324
[40] Mining High Utility Itemset with Multiple Minimum Utility Thresholds Based on Utility Deviation
Alhusaini, Naji
Li, Jing
Fournier-Viger, Philippe
Hawbani, Ammar
Chen, Guilin
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 490 - 496

← 1 2 3 4 5 →