Mining High Utility Itemsets Using Prefix Trees and Utility Vectors

被引:13
|
作者
Qu, Jun-Feng [1 ]
Fournier-Viger, Philippe [2 ]
Liu, Mengchi [3 ]
Hang, Bo [1 ]
Hu, Chunyang [1 ]
机构
[1] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang 441053, Hubei, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518055, Guangdong, Peoples R China
[3] South China Normal Univ, Sch Comp Sci, Guangzhou Key Lab Big Data & Intelligent Educ, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
High utility itemset; mining algorithm; prefix tree; utility vector; ALGORITHM; GENERATION; DISCOVERY; PATTERNS;
D O I
10.1109/TKDE.2023.3256126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility itemsets can reveal combinations of items that have a high profit, expense, or importance. Mining high utility itemsets in a database with n items generally results in a huge search space, composed of 2(n )itemsets, and heavy utility calculations for the explored itemsets. Previous algorithms using prefix tree structures perform two phases, namely candidate generation and testing. To avoid generating candidate itemsets, one-phase algorithms use list or hyper-link structures and have been proven to be superior to two-phase algorithms. However, it should be noted that a prefix tree is still an efficient structure for itemset mining problems, and especially algorithms using prefix trees such as FP-Growth have shown excellent performance for mining frequent itemsets. This paper proposes Hamm, a High-performance AlgorithM for Mining high utility itemsets. Hamm employs a novel TV (prefix Tree and utility Vector) structure and mines high utility itemsets in one phase without candidate generation. We also develop an efficient optimization which is incorporated into Hamm as a component. Using prefix trees and utility vectors, Hamm outperforms state-of-the-art algorithms on various databases in experiments. Experimental results also show that the proposed optimization remarkably reduces the search space and speeds up Hamm.
引用
收藏
页码:10224 / 10236
页数:13
相关论文
共 50 条
  • [41] Efficient Vertical Mining of High Utility Quantitative Itemsets
    Li, Chia Hua
    Wu, Cheng-Wei
    Tseng, Vincent S.
    2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 155 - 160
  • [42] Mining High Utility Itemsets in Massive Transactional Datasets
    Thi, Vu Due
    Nguyen Huy Due
    ACTA CYBERNETICA, 2011, 20 (02): : 331 - 346
  • [43] PHM: Mining Periodic High-Utility Itemsets
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Quang-Huy Duong
    Thu-Lan Dam
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2016, 9728 : 64 - 79
  • [44] A New Method for Mining High Average Utility Itemsets
    Lu, Tien
    Vo, Bay
    Nguyen, Hien T.
    Hong, Tzung-Pei
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2014, 2014, 8838 : 33 - 42
  • [45] An efficient structure for fast mining high utility itemsets
    Zhi-Hong Deng
    Applied Intelligence, 2018, 48 : 3161 - 3177
  • [46] High average-utility itemsets mining: a survey
    Singh, Kuldeep
    Kumar, Rajiv
    Biswas, Bhaskar
    APPLIED INTELLIGENCE, 2022, 52 (04) : 3901 - 3938
  • [47] Mining High Utility Itemsets over Uncertain Databases
    Lan, Yuqing
    Wang, Yang
    Wang, Yanni
    Yi, Shengwei
    Yu, Dan
    2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 235 - 238
  • [48] An effective tree structure for mining high utility itemsets
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Lu, Wen-Hsiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) : 7419 - 7424
  • [49] Mining High Transaction-Weighted Utility Itemsets
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 314 - 318
  • [50] Pushing regularity constraint on high utility itemsets mining
    Amphawan, Komate
    Surarerks, Athasit
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,