Improved Genetic Algorithm for High-Utility Itemset Mining

被引:21
|
作者
Zhang, Qiang [1 ]
Fang, Wei [1 ,2 ]
Sun, Jun [1 ,2 ]
Wang, Quan [3 ]
机构
[1] Jiangnan Univ, Sch IoT Engn, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Wuxi SensingNet Industrializat Res Inst, Wuxi 214315, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; high-utility itemset mining; genetic algorithm; neighborhood exploration; diversity maintenance; EFFICIENT ALGORITHMS; DISCOVERY; VERSION;
D O I
10.1109/ACCESS.2019.2958150
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-utility itemset mining (HUIM) is an important research topic in the data mining field. Typically, traditional HUIM algorithms must handle the exponential problem of huge search space when the database size or number of distinct items is very large. As an alternative and effective approach, evolutionary computation (EC)-based algorithms have been proposed to solve HUIM problems because they can obtain a set of nearly optimal solutions in limited time. However, it is still time-consuming for EC-based algorithms to find complete high-utility itemsets (HUIs) in transactional databases. To address this problem, we propose an HUIM algorithm based on an improved genetic algorithm (HUIM-IGA). In addition, a neighborhood exploration strategy is proposed to improve search efficiency for HUIs. To reduce missing HUIs, a population diversity maintenance strategy is employed in the proposed HUIM-IGA. An individual repair method is also introduced to reduce invalid combinations for discovering HUIs. In addition, an elite strategy is employed to prevent the loss of HUIs. Experimental results obtained on a set of real-world datasets demonstrate that the proposed algorithm can find complete HUIs in terms of the given minimum utility threshold, and the time-consuming of HUIM-IGA is relatively lower when mining the same number of HUIs than state-of-the-art EC-based HUIM algorithms.
引用
收藏
页码:176799 / 176813
页数:15
相关论文
共 50 条
  • [41] Personalized Recommendation Approach for Academic Literature Using High-Utility Itemset Mining Technique
    Dhanda, Mahak
    Verma, Vijay
    PROGRESS IN INTELLIGENT COMPUTING TECHNIQUES: THEORY, PRACTICE, AND APPLICATIONS, VOL 2, 2018, 719 : 247 - 254
  • [42] High Utility Itemset Mining over Uncertain Datasets Based on a Quantum Genetic Algorithm
    Wang, Ju
    Liu, Fuxian
    Jin, Chunjie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (08): : 3606 - 3629
  • [43] A high utility itemset mining algorithm based on subsume index
    Song, Wei
    Zhang, Zihan
    Li, Jinhong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (01) : 315 - 340
  • [44] Performance comparison of inertia weight and acceleration coefficients of BPSO in the context of high-utility itemset mining
    Ridowati Gunawan
    Edi Winarko
    Reza Pulungan
    Evolutionary Intelligence, 2023, 16 : 943 - 961
  • [45] An efficient algorithm for mining periodic high-utility sequential patterns
    Duy-Tai Dinh
    Bac Le
    Fournier-Viger, Philippe
    Van-Nam Huynh
    APPLIED INTELLIGENCE, 2018, 48 (12) : 4694 - 4714
  • [46] A high utility itemset mining algorithm based on subsume index
    Wei Song
    Zihan Zhang
    Jinhong Li
    Knowledge and Information Systems, 2016, 49 : 315 - 340
  • [47] A High Utility Itemset Mining Algorithm Based on Particle Filter
    Yang, Yang
    Ding, Jiaman
    Wang, Honghai
    Xing, Huifen
    Li, En
    Mathematical Problems in Engineering, 2023, 2023
  • [48] An Efficient Algorithm for Incremental and Interactive High Utility Itemset Mining
    Guo, Shiming
    Gao, Hong
    2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 996 - 1001
  • [49] High-utility itemsets mining integrating an improved crow search algorithm and particle search optimization
    Ledmi M.
    Ledmi A.
    Souidi M.E.H.
    Hamdi-Cherif A.
    Maarouk T.M.
    Hamdi-Cherif C.K.-M.
    Soft Computing, 2024, 28 (13-14) : 8471 - 8496
  • [50] Cross-Level High-Utility Itemset Mining Using Multi-core Processing
    Tung, N. T.
    Nguyen, Loan T. T.
    Nguyen, Trinh D. D.
    Kozierkiewicz, Adrianna
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 12876 : 467 - 479