Improved Genetic Algorithm for High-Utility Itemset Mining

被引:21
|
作者
Zhang, Qiang [1 ]
Fang, Wei [1 ,2 ]
Sun, Jun [1 ,2 ]
Wang, Quan [3 ]
机构
[1] Jiangnan Univ, Sch IoT Engn, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Wuxi SensingNet Industrializat Res Inst, Wuxi 214315, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; high-utility itemset mining; genetic algorithm; neighborhood exploration; diversity maintenance; EFFICIENT ALGORITHMS; DISCOVERY; VERSION;
D O I
10.1109/ACCESS.2019.2958150
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-utility itemset mining (HUIM) is an important research topic in the data mining field. Typically, traditional HUIM algorithms must handle the exponential problem of huge search space when the database size or number of distinct items is very large. As an alternative and effective approach, evolutionary computation (EC)-based algorithms have been proposed to solve HUIM problems because they can obtain a set of nearly optimal solutions in limited time. However, it is still time-consuming for EC-based algorithms to find complete high-utility itemsets (HUIs) in transactional databases. To address this problem, we propose an HUIM algorithm based on an improved genetic algorithm (HUIM-IGA). In addition, a neighborhood exploration strategy is proposed to improve search efficiency for HUIs. To reduce missing HUIs, a population diversity maintenance strategy is employed in the proposed HUIM-IGA. An individual repair method is also introduced to reduce invalid combinations for discovering HUIs. In addition, an elite strategy is employed to prevent the loss of HUIs. Experimental results obtained on a set of real-world datasets demonstrate that the proposed algorithm can find complete HUIs in terms of the given minimum utility threshold, and the time-consuming of HUIM-IGA is relatively lower when mining the same number of HUIs than state-of-the-art EC-based HUIM algorithms.
引用
收藏
页码:176799 / 176813
页数:15
相关论文
共 50 条
  • [31] Efficient High-utility Itemset Mining Based on a Novel Data Structure
    Shen, Wei
    Zhang, Chao
    Fang, Wei
    Zhang, Xin
    Than, Zhi-Hui
    Lin, Jerry Chun-Wei
    2021 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2021,
  • [32] Targeted High-Utility Itemset Querying
    Miao J.
    Wan S.
    Gan W.
    Sun J.
    Chen J.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (04): : 871 - 883
  • [33] Targeted High-Utility Itemset Querying
    Miao, Jinbao
    Wan, Shicheng
    Gan, Wensheng
    Sun, Jiayi
    Chen, Jiahui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5534 - 5543
  • [34] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (02) : 357 - 381
  • [35] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Liu, Xuan
    Wen, Shiting
    Zuo, Wanli
    APPLIED INTELLIGENCE, 2020, 50 (01) : 169 - 191
  • [36] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Kenli Li
    Philippe Fournier-Viger
    Quang-Huy Duong
    Frontiers of Computer Science, 2019, 13 : 357 - 381
  • [37] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Xuan Liu
    Shiting Wen
    Wanli Zuo
    Applied Intelligence, 2020, 50 : 169 - 191
  • [38] A predictive GA-based model for closed high-utility itemset mining
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Srivastava, Gautam
    Yun, Unil
    Fournier-Viger, Philippe
    APPLIED SOFT COMPUTING, 2021, 108
  • [39] A BPSO-based method for high-utility itemset mining without minimum utility threshold
    Gunawan, Ridowati
    Winarko, Edi
    Pulungan, Reza
    KNOWLEDGE-BASED SYSTEMS, 2020, 190
  • [40] GPU-Based Efficient Parallel Heuristic Algorithm for High-Utility Itemset Mining in Large Transaction Datasets
    Fang, Wei
    Jiang, Haipeng
    Lu, Hengyang
    Sun, Jun
    Wu, Xiaojun
    Lin, Jerry Chun-Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 652 - 667