Improved Genetic Algorithm for High-Utility Itemset Mining

被引:21
|
作者
Zhang, Qiang [1 ]
Fang, Wei [1 ,2 ]
Sun, Jun [1 ,2 ]
Wang, Quan [3 ]
机构
[1] Jiangnan Univ, Sch IoT Engn, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Wuxi SensingNet Industrializat Res Inst, Wuxi 214315, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; high-utility itemset mining; genetic algorithm; neighborhood exploration; diversity maintenance; EFFICIENT ALGORITHMS; DISCOVERY; VERSION;
D O I
10.1109/ACCESS.2019.2958150
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-utility itemset mining (HUIM) is an important research topic in the data mining field. Typically, traditional HUIM algorithms must handle the exponential problem of huge search space when the database size or number of distinct items is very large. As an alternative and effective approach, evolutionary computation (EC)-based algorithms have been proposed to solve HUIM problems because they can obtain a set of nearly optimal solutions in limited time. However, it is still time-consuming for EC-based algorithms to find complete high-utility itemsets (HUIs) in transactional databases. To address this problem, we propose an HUIM algorithm based on an improved genetic algorithm (HUIM-IGA). In addition, a neighborhood exploration strategy is proposed to improve search efficiency for HUIs. To reduce missing HUIs, a population diversity maintenance strategy is employed in the proposed HUIM-IGA. An individual repair method is also introduced to reduce invalid combinations for discovering HUIs. In addition, an elite strategy is employed to prevent the loss of HUIs. Experimental results obtained on a set of real-world datasets demonstrate that the proposed algorithm can find complete HUIs in terms of the given minimum utility threshold, and the time-consuming of HUIM-IGA is relatively lower when mining the same number of HUIs than state-of-the-art EC-based HUIM algorithms.
引用
收藏
页码:176799 / 176813
页数:15
相关论文
共 50 条
  • [21] A residual utility-based concept for high-utility itemset mining
    Sra, Pushp
    Chand, Satish
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023,
  • [22] A residual utility-based concept for high-utility itemset mining
    Pushp Sra
    Satish Chand
    Knowledge and Information Systems, 2024, 66 (1) : 211 - 235
  • [23] A residual utility-based concept for high-utility itemset mining
    Sra, Pushp
    Chand, Satish
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (01) : 211 - 235
  • [24] MMC: efficient and effective closed high-utility itemset mining
    Mofid, Amir Hossein
    Daneshpour, Negin
    Torabi, Zeinab
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 18900 - 18918
  • [25] Large-Scale Closed High-Utility Itemset Mining
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Srivastava, Gautam
    Wu, Jimmy Ming-Tai
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 591 - 598
  • [26] TopHUI: Top-k high-utility itemset mining with negative utility
    Gan, Wensheng
    Wan, Shicheng
    Chen, Jiahui
    Chen, Chien-Ming
    Qiu, Lina
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5350 - 5359
  • [27] Whitebox Induction of Default Rules Using High-Utility Itemset Mining
    Shakerin, Farhad
    Gupta, Gopal
    PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES (PADL 2020), 2020, 12007 : 168 - 176
  • [28] Efficient evolutionary computation model of closed high-utility itemset mining
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Srivastava, Gautam
    Fourier-Viger, Philippe
    APPLIED INTELLIGENCE, 2022, 52 (09) : 10604 - 10616
  • [29] Efficient evolutionary computation model of closed high-utility itemset mining
    Jerry Chun-Wei Lin
    Youcef Djenouri
    Gautam Srivastava
    Philippe Fourier-Viger
    Applied Intelligence, 2022, 52 : 10604 - 10616
  • [30] Efficient High-Utility Itemset Mining Over Variety of Databases: A Survey
    Suvarna, U.
    Srinivas, Y.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 803 - 816