An Algorithm for Mining Fixed-Length High Utility Itemsets

被引:1
|
作者
Wang, Le [1 ]
机构
[1] Ningbo Univ Finance & Econ, Ningbo 315175, Zhejiang, Peoples R China
关键词
Data mining; High utility itemsets; Pattern growth; Frequent pattern; DISCOVERY; STRATEGY;
D O I
10.1007/978-3-031-11217-1_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern/itemset mining is a hotspot of data mining. Different from the traditional frequent pattern, high utility pattern takes into consideration not only the number of items in the transaction, but also the weight of these items, such as profit and price. Hence the computational complexity of this mining algorithm is higher than the traditional frequent pattern mining. Thus, one essential topic of this field is to reduce the search space and improve the mining efficiency. Constraint on pattern length can effectively reduce algorithm search space while fulfill a certain kind of actual requirement. Addressing fixed length high utility pattern mining, we propose a novel algorithm, called HUIK (High Utility Itemsets with K-lengthMiner), that first compresses transaction data into a tree, then recursively searches high utility patterns with designated length using a pattern growth approach. An effective pruning strategy is also proposed to reduce the number of candidate items on the compressed tree, to further reduce the search space and improve algorithm efficiency. The performance of the algorithm HUIK is verified on six classical datasets. Experimental results verify that the proposed algorithm has a significant improvement in time efficiency, especially for long datasets and dense datasets.
引用
收藏
页码:3 / 20
页数:18
相关论文
共 50 条
  • [21] Mining Local High Utility Itemsets
    Fournier-Viger, Philippe
    Zhang, Yimin
    Lin, Jerry Chun-Wei
    Fujita, Hamido
    Koh, Yun Sing
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 450 - 460
  • [22] An Efficient Algorithm for Mining High-Utility Itemsets with Discount Notion
    Bansal, Ruchita
    Dawar, Siddharth
    Goyal, Vikram
    BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 84 - 98
  • [23] An efficient biobjective evolutionary algorithm for mining frequent and high utility itemsets
    Fang, Wei
    Li, Chongyang
    Zhang, Qiang
    Zhang, Xin
    Lin, Jerry Chun-Wei
    APPLIED SOFT COMPUTING, 2023, 140
  • [24] Vertical Mining for High Utility Itemsets
    Song, Wei
    Liu, Yu
    Li, Jinhong
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 429 - 434
  • [25] Binary partition for itemsets expansion in mining high utility itemsets
    Song, Wei
    Wang, Chunhua
    Li, Jinhong
    INTELLIGENT DATA ANALYSIS, 2016, 20 (04) : 915 - 931
  • [26] Mining of high-utility itemsets with negative utility
    Singh, Kuldeep
    Shakya, Harish Kumar
    Singh, Abhimanyu
    Biswas, Bhaskar
    EXPERT SYSTEMS, 2018, 35 (06)
  • [27] An efficient algorithm for mining temporal high utility itemsets from data streams
    Chu, Chun-Jung
    Tseng, Vincent S.
    Liang, Tyne
    JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (07) : 1105 - 1117
  • [28] TKEH: an efficient algorithm for mining top-k high utility itemsets
    Singh, Kuldeep
    Singh, Shashank Sheshar
    Kumar, Ajay
    Biswas, Bhaskar
    APPLIED INTELLIGENCE, 2019, 49 (03) : 1078 - 1097
  • [29] TKEH: an efficient algorithm for mining top-k high utility itemsets
    Kuldeep Singh
    Shashank Sheshar Singh
    Ajay Kumar
    Bhaskar Biswas
    Applied Intelligence, 2019, 49 : 1078 - 1097
  • [30] HMiner: Efficiently mining high utility itemsets
    Krishnamoorthy, Srikumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 : 168 - 183