Multi-Objective Optimization for High-Dimensional Maximal Frequent Itemset Mining

被引:5
|
作者
Zhang, Yalong [1 ]
Yu, Wei [1 ]
Ma, Xuan [2 ]
Ogura, Hisakazu [3 ]
Ye, Dongfen [1 ]
机构
[1] Quzhou Univ, Coll Elect & Informat Engn, Quzhou, Peoples R China
[2] Xian Univ Technol, Fac Automat & Informat Engn, Xian 710048, Peoples R China
[3] Univ Fukui, Grad Sch Engn, Fukui 9108507, Japan
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 19期
关键词
association rules; frequent itemset mining; big data; multi-objective optimization; maximal frequent itemset; ALGORITHM;
D O I
10.3390/app11198971
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The solution space of a frequent itemset generally presents exponential explosive growth because of the high-dimensional attributes of big data. However, the premise of the big data association rule analysis is to mine the frequent itemset in high-dimensional transaction sets. Traditional and classical algorithms such as the Apriori and FP-Growth algorithms, as well as their derivative algorithms, are unacceptable in practical big data analysis in an explosive solution space because of their huge consumption of storage space and running time. A multi-objective optimization algorithm was proposed to mine the frequent itemset of high-dimensional data. First, all frequent 2-itemsets were generated by scanning transaction sets based on which new items were added in as the objects of population evolution. Algorithms aim to search for the maximal frequent itemset to gather more non-void subsets because non-void subsets of frequent itemsets are all properties of frequent itemsets. During the operation of algorithms, lethal gene fragments in individuals were recorded and eliminated so that individuals may resurge. Finally, the set of the Pareto optimal solution of the frequent itemset was gained. All non-void subsets of these solutions were frequent itemsets, and all supersets are non-frequent itemsets. Finally, the practicability and validity of the proposed algorithm in big data were proven by experiments.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An itemset reduction based multi-objective evolutionary algorithm for mining high-dimensional frequent and high utility itemsets
    Zhang L.
    Li L.
    Yang H.-P.
    Sun X.
    Cheng F.
    Sun X.-Y.
    Su Y.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (10): : 2832 - 2840
  • [2] The Discussions of Maximal Frequent Itemset Mining Optimization
    Li, Haifeng
    2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY (CII 2016), 2016, : 96 - 100
  • [3] Multi-Objective Optimization Algorithm for High-Dimensional Portfolios
    Song, Yingjie
    Han, Lihuan
    Computer Engineering and Applications, 2024, 60 (19) : 309 - 322
  • [4] Multi-objective Optimization in High-Dimensional Molecular Systems
    Slanzi, Debora
    Mameli, Valentina
    Khoroshiltseva, Marina
    Poli, Irene
    ARTIFICIAL LIFE AND EVOLUTIONARY COMPUTATION, WIVACE 2017, 2018, 830 : 284 - 295
  • [5] A Closed Itemset Property based Multi-objective Evolutionary Approach for Mining Frequent and High Utility Itemsets
    Cao, Heng
    Yang, Shangshang
    Wang, Qingren
    Wang, Qijun
    Zhang, Lei
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 3356 - 3363
  • [6] BAYESIAN OPTIMIZATION FOR MULTI-OBJECTIVE HIGH-DIMENSIONAL TURBINE AERO DESIGN
    Zhang, Yiming
    Ghosh, Sayan
    Vandeputte, Thomas
    Wang, Liping
    PROCEEDINGS OF ASME TURBO EXPO 2021: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, VOL 9B, 2021,
  • [7] Multi-Objective Bayesian Optimization over High-Dimensional Search Spaces
    Daulton, Samuel
    Eriksson, David
    Balandat, Maximillian
    Bakshy, Eytan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 507 - 517
  • [8] High-dimensional expensive multi-objective optimization via additive structure
    Wang, Hongyan
    Xu, Hua
    Yuan, Yuan
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
  • [9] Frequent Itemset Mining in High Dimensional Data: A Review
    Zaki, Fatimah Audah Md
    Zulkurnain, Nurul Fariza
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, 2019, 481 : 325 - 334
  • [10] Feature selection in high-dimensional EEG data by parallel multi-objective optimization
    Kimovski, Dragi
    Ortega, Julio
    Ortiz, Andres
    Banos, Raul
    2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 314 - 322