A residual utility-based concept for high-utility itemset mining

被引:1
|
作者
Sra, Pushp [1 ]
Chand, Satish [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
关键词
High-utility itemset mining; Data mining; Knowledge discovery; Memory efficient mining;
D O I
10.1007/s10115-023-01948-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery in databases aims at finding useful information for decision-making. The problem of high-utility itemset mining (HUIM) has specifically garnered huge research attention, as it aims to find relevant information on patterns in a database, which conform to a user-defined utility function. The mined patterns are used for making data-backed decisions in the fields of healthcare, e-commerce, web analytics, etc. Various algorithms exist in the literature related to mining the high-utility items from the databases; however, most of them require multiple database scans, or deploy complex data structures. The utility-list is an efficient list-based data structure that is being widely adopted in the design of HUIM algorithms. The existing utility-list-based algorithms, however, suffer from some drawbacks like extensive use of inefficient join operations, multiple definitions of join operations, etc. Though the HUIM is an important research area, yet very little research has been directed towards improving the design of data structures used for the mining process. In this paper, we introduce the concept of residual utility to design two new data structures, called residue-map and master-map. Using these two data structures, a new algorithm, called R-Miner, is introduced for mining the high-utility items. In order to further optimise the mining process, the cumulative utility value is used as an upper bound and additional pruning conditions are also discussed. Several experiments are carried out on both real and synthetic datasets to compare the performance of R-Miner with the existing list-based algorithms. The experimental results show that the R-Miner improves the performance by up to the order of 2 as compared to the list-based algorithms: EFIM, H-Miner, HUI-Miner, FHM, and ULB-Miner.
引用
下载
收藏
页码:211 / 235
页数:25
相关论文
共 50 条
  • [21] A predictive GA-based model for closed high-utility itemset mining
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Srivastava, Gautam
    Yun, Unil
    Fournier-Viger, Philippe
    APPLIED SOFT COMPUTING, 2021, 108
  • [22] Targeted High-Utility Itemset Querying
    Miao J.
    Wan S.
    Gan W.
    Sun J.
    Chen J.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (04): : 871 - 883
  • [23] Targeted High-Utility Itemset Querying
    Miao, Jinbao
    Wan, Shicheng
    Gan, Wensheng
    Sun, Jiayi
    Chen, Jiahui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5534 - 5543
  • [24] Whitebox Induction of Default Rules Using High-Utility Itemset Mining
    Shakerin, Farhad
    Gupta, Gopal
    PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES (PADL 2020), 2020, 12007 : 168 - 176
  • [25] HUITWU: An Efficient Algorithm for High-Utility Itemset Mining in Transaction Databases
    Shi-Ming Guo
    Hong Gao
    Journal of Computer Science and Technology, 2016, 31 : 776 - 786
  • [26] Efficient evolutionary computation model of closed high-utility itemset mining
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Srivastava, Gautam
    Fourier-Viger, Philippe
    APPLIED INTELLIGENCE, 2022, 52 (09) : 10604 - 10616
  • [27] HUITWU: An Efficient Algorithm for High-Utility Itemset Mining in Transaction Databases
    Guo, Shi-Ming
    Gao, Hong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (04) : 776 - 786
  • [28] Investigating Crossover Operators in Genetic Algorithms for High-Utility Itemset Mining
    Nawaz, M. Saqib
    Fournier-Viger, Philippe
    Song, Wei
    Lin, Jerry Chun-Wei
    Noack, Bernd
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 16 - 28
  • [29] EFIM: a fast and memory efficient algorithm for high-utility itemset mining
    Zida, Souleymane
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Wu, Cheng-Wei
    Tseng, Vincent S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (02) : 595 - 625
  • [30] EFIM: a fast and memory efficient algorithm for high-utility itemset mining
    Souleymane Zida
    Philippe Fournier-Viger
    Jerry Chun-Wei Lin
    Cheng-Wei Wu
    Vincent S. Tseng
    Knowledge and Information Systems, 2017, 51 : 595 - 625