An efficient PSO-based evolutionary model for closed high-utility itemset mining

被引:0
|
作者
Carstensen, Simen [1 ]
Lin, Jerry Chun-Wei [2 ]
机构
[1] Univ Bergen, Bergen, Norway
[2] Western Norway Univ Appl Sci, Bergen, Norway
关键词
Evolutionary computation; Closed high-utility itemset; Data mining; Optimization; Particle swarm optimization; DISCOVERY;
D O I
10.1007/s10489-024-06151-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-utility itemset mining (HUIM) is a widely adopted data mining technique for discovering valuable patterns in transactional databases. Although HUIM can provide useful knowledge in various types of data, it can be challenging to interpret the results when many patterns are found. To alleviate this, closed high-utility itemset mining (CHUIM) has been suggested, which provides users with a more concise and meaningful set of solutions. However, CHUIM is a computationally demanding task, and current approaches can require prolonged runtimes. This paper aims to solve this problem and proposes a meta-heuristic model based on particle swarm optimization (PSO) to discover CHUIs, called CHUI-PSO. Moreover, the algorithm incorporates several new strategies to reduce the computational cost associated with similar existing techniques. First, we introduce Extended TWU pruning (ETP), which aims to decrease the number of possible candidates to improve the discovery of solutions in large search spaces. Second, we propose two new utility upper bounds, used to estimate itemset utilities and bypass expensive candidate evaluations. Finally, to increase population diversity and prevent redundant computations, we suggest a structure called ExploredSet to maintain and utilize the evaluated candidates. Extensive experimental results show that CHUI-PSO outperforms the current state-of-the-art algorithms regarding execution time, accuracy, and convergence.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] HUITWU: An Efficient Algorithm for High-Utility Itemset Mining in Transaction Databases
    Shi-Ming Guo
    Hong Gao
    Journal of Computer Science and Technology, 2016, 31 : 776 - 786
  • [22] EFIM: a fast and memory efficient algorithm for high-utility itemset mining
    Zida, Souleymane
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Wu, Cheng-Wei
    Tseng, Vincent S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (02) : 595 - 625
  • [23] EFIM: a fast and memory efficient algorithm for high-utility itemset mining
    Souleymane Zida
    Philippe Fournier-Viger
    Jerry Chun-Wei Lin
    Cheng-Wei Wu
    Vincent S. Tseng
    Knowledge and Information Systems, 2017, 51 : 595 - 625
  • [24] HUITWU: An Efficient Algorithm for High-Utility Itemset Mining in Transaction Databases
    Guo, Shi-Ming
    Gao, Hong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (04) : 776 - 786
  • [25] A Parallel High-Utility Itemset Mining Algorithm Based on Hadoop
    Cheng Z.
    Shen W.
    Fang W.
    Lin J.C.-W.
    Complex System Modeling and Simulation, 2023, 3 (01): : 47 - 58
  • [26] A survey of incremental high-utility itemset mining
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    Hong, Tzung-Pei
    Fujita, Hamido
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
  • [27] Efficient High-Utility Itemset Mining Over Variety of Databases: A Survey
    Suvarna, U.
    Srinivas, Y.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 803 - 816
  • [28] Efficient high-utility occupancy itemset mining algorithm on massive data
    He, Jingxuan
    Han, Xixian
    Wang, Jinbao
    Zhang, Kaiqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [29] Ignoring Internal Utilities in High-Utility Itemset Mining
    Oguz, Damla
    SYMMETRY-BASEL, 2022, 14 (11):
  • [30] High-Utility Itemset Mining with Effective Pruning Strategies
    Wu, Jimmy Ming-Tai
    Lin, Jerry Chun-Wei
    Tamrakar, Ashish
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (06)