Mining High Utility Itemsets Using Prefix Trees and Utility Vectors

被引:13
|
作者
Qu, Jun-Feng [1 ]
Fournier-Viger, Philippe [2 ]
Liu, Mengchi [3 ]
Hang, Bo [1 ]
Hu, Chunyang [1 ]
机构
[1] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang 441053, Hubei, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518055, Guangdong, Peoples R China
[3] South China Normal Univ, Sch Comp Sci, Guangzhou Key Lab Big Data & Intelligent Educ, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
High utility itemset; mining algorithm; prefix tree; utility vector; ALGORITHM; GENERATION; DISCOVERY; PATTERNS;
D O I
10.1109/TKDE.2023.3256126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility itemsets can reveal combinations of items that have a high profit, expense, or importance. Mining high utility itemsets in a database with n items generally results in a huge search space, composed of 2(n )itemsets, and heavy utility calculations for the explored itemsets. Previous algorithms using prefix tree structures perform two phases, namely candidate generation and testing. To avoid generating candidate itemsets, one-phase algorithms use list or hyper-link structures and have been proven to be superior to two-phase algorithms. However, it should be noted that a prefix tree is still an efficient structure for itemset mining problems, and especially algorithms using prefix trees such as FP-Growth have shown excellent performance for mining frequent itemsets. This paper proposes Hamm, a High-performance AlgorithM for Mining high utility itemsets. Hamm employs a novel TV (prefix Tree and utility Vector) structure and mines high utility itemsets in one phase without candidate generation. We also develop an efficient optimization which is incorporated into Hamm as a component. Using prefix trees and utility vectors, Hamm outperforms state-of-the-art algorithms on various databases in experiments. Experimental results also show that the proposed optimization remarkably reduces the search space and speeds up Hamm.
引用
收藏
页码:10224 / 10236
页数:13
相关论文
共 50 条
  • [31] Mining of top-k high utility itemsets with negative utility
    Sun, Rui
    Han, Meng
    Zhang, Chunyan
    Shen, Mingyao
    Du, Shiyu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5637 - 5652
  • [32] Mining correlated high-utility itemsets using various measures
    Fournier-Viger, Philippe
    Zhang, Yimin
    Lin, Jerry Chun-Wei
    Dinh, Duy-Tai
    Hoai Bac Le
    LOGIC JOURNAL OF THE IGPL, 2020, 28 (01) : 19 - 32
  • [33] On Minimal and Maximal High Utility Itemsets Mining using Propositional Satisfiability
    Hidouri, Amel
    Jabbour, Said
    Dlala, Imen Ouled
    Raddaoui, Badran
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 622 - 628
  • [34] Mining Correlated High-Utility Itemsets Using the Cosine Measure
    Huynh Anh Duy
    Huynh Anh Khoa
    Phan Duy Hung
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 307 - 319
  • [35] Mining Correlated High-Utility Itemsets Using the Bond Measure
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Tai Dinh
    Hoai Bac Le
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2016, 9648 : 53 - 65
  • [36] Mining long high utility itemsets in transaction databases
    Yu, Guangzhu
    Shao, Shihuang
    Sun, Daoqing
    Luo, Bin
    NEW ADVANCES IN SIMULATION, MODELLING AND OPTIMIZATION (SMO '07), 2007, : 326 - +
  • [37] Fast mining local high-utility itemsets
    Song, Wei
    Ren, Guibin
    Gan, Wensheng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
  • [38] Efficiently mining uncertain high-utility itemsets
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Vincent S. Tseng
    Soft Computing, 2017, 21 : 2801 - 2820
  • [39] Mining Closed High Utility Itemsets in Uncertain Databases
    Nguyen Bui
    Bay Vo
    Van-Nam Huynh
    Lin, Chun-Wei
    Nguyen, Loan T. T.
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 7 - 14
  • [40] High average-utility itemsets mining: a survey
    Kuldeep Singh
    Rajiv Kumar
    Bhaskar Biswas
    Applied Intelligence, 2022, 52 : 3901 - 3938