Mining High Utility Itemsets Using Prefix Trees and Utility Vectors

被引：13

作者：

Qu, Jun-Feng ^{[1
]}

Fournier-Viger, Philippe ^{[2
]}

Liu, Mengchi ^{[3
]}

Hang, Bo ^{[1
]}

Hu, Chunyang ^{[1
]}

机构：

[1] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang 441053, Hubei, Peoples R China

[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518055, Guangdong, Peoples R China

[3] South China Normal Univ, Sch Comp Sci, Guangzhou Key Lab Big Data & Intelligent Educ, Guangzhou 510631, Guangdong, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 10期

基金：

中国国家自然科学基金;

关键词：

High utility itemset; mining algorithm; prefix tree; utility vector; ALGORITHM; GENERATION; DISCOVERY; PATTERNS;

D O I：

10.1109/TKDE.2023.3256126

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High utility itemsets can reveal combinations of items that have a high profit, expense, or importance. Mining high utility itemsets in a database with n items generally results in a huge search space, composed of 2(n )itemsets, and heavy utility calculations for the explored itemsets. Previous algorithms using prefix tree structures perform two phases, namely candidate generation and testing. To avoid generating candidate itemsets, one-phase algorithms use list or hyper-link structures and have been proven to be superior to two-phase algorithms. However, it should be noted that a prefix tree is still an efficient structure for itemset mining problems, and especially algorithms using prefix trees such as FP-Growth have shown excellent performance for mining frequent itemsets. This paper proposes Hamm, a High-performance AlgorithM for Mining high utility itemsets. Hamm employs a novel TV (prefix Tree and utility Vector) structure and mines high utility itemsets in one phase without candidate generation. We also develop an efficient optimization which is incorporated into Hamm as a component. Using prefix trees and utility vectors, Hamm outperforms state-of-the-art algorithms on various databases in experiments. Experimental results also show that the proposed optimization remarkably reduces the search space and speeds up Hamm.

引用

页码：10224 / 10236

页数：13

共 50 条

[21] Generalized maximal utility for mining high average-utility itemsets
Song, Wei
Liu, Lu
Huang, Chaomin
KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (11) : 2947 - 2967
[22] New approaches for mining high utility itemsets with multiple utility thresholds
Bao Huynh
N. T. Tung
Trinh D. D. Nguyen
Cuong Trinh
Vaclav Snasel
Loan Nguyen
Applied Intelligence, 2024, 54 : 767 - 790
[23] Mining High Utility Itemsets with Regular Occurrence
Amphawan, Komate
Lenca, Philippe
Jitpattanakul, Anuchit
Surarerks, Athasit
JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2016, 10 (02) : 153 - 176
[24] A Fast Algorithm for Mining High Utility Itemsets
Shankar, S.
Purusothaman, T.
Jayanthi, S.
Babu, Nishanth
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1459 - +
[25] Mining Locally Trending High Utility Itemsets
Fournier-Viger, Philippe
Yang, Yanjun
Lin, Jerry Chun-Wei
Frnda, Jaroslav
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 99 - 111
[26] Generalized maximal utility for mining high average-utility itemsets
Wei Song
Lu Liu
Chaomin Huang
Knowledge and Information Systems, 2021, 63 : 2947 - 2967
[27] Mining Minimal High-Utility Itemsets
Fournier-Viger, Philippe
Lin, Jerry Chun-Wei
Wu, Cheng-Wei
Tseng, Vincent S.
Faghihi, Usef
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT I, 2016, 9827 : 88 - 101
[28] Mining High Average-Utility Itemsets
Hong, Tzung-Pei
Lee, Cho-Han
Wang, Shyue-Liang
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2526 - +
[29] Efficient mining of high utility itemsets with multiple minimum utility thresholds
Krishnamoorthy, Srikumar
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 69 : 112 - 126
[30] Mining of High Utility Itemsets with Negative Utility values for Incremental Datasets
Pushp
Chand, Satish
2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 431 - 436

← 1 2 3 4 5 →