An Algorithm for Mining Fixed-Length High Utility Itemsets

被引:1
|
作者
Wang, Le [1 ]
机构
[1] Ningbo Univ Finance & Econ, Ningbo 315175, Zhejiang, Peoples R China
关键词
Data mining; High utility itemsets; Pattern growth; Frequent pattern; DISCOVERY; STRATEGY;
D O I
10.1007/978-3-031-11217-1_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern/itemset mining is a hotspot of data mining. Different from the traditional frequent pattern, high utility pattern takes into consideration not only the number of items in the transaction, but also the weight of these items, such as profit and price. Hence the computational complexity of this mining algorithm is higher than the traditional frequent pattern mining. Thus, one essential topic of this field is to reduce the search space and improve the mining efficiency. Constraint on pattern length can effectively reduce algorithm search space while fulfill a certain kind of actual requirement. Addressing fixed length high utility pattern mining, we propose a novel algorithm, called HUIK (High Utility Itemsets with K-lengthMiner), that first compresses transaction data into a tree, then recursively searches high utility patterns with designated length using a pattern growth approach. An effective pruning strategy is also proposed to reduce the number of candidate items on the compressed tree, to further reduce the search space and improve algorithm efficiency. The performance of the algorithm HUIK is verified on six classical datasets. Experimental results verify that the proposed algorithm has a significant improvement in time efficiency, especially for long datasets and dense datasets.
引用
收藏
页码:3 / 20
页数:18
相关论文
共 50 条
  • [41] Efficient algorithm for mining high average-utility itemsets in incremental transaction databases
    Kim, Donggyu
    Yun, Unil
    APPLIED INTELLIGENCE, 2017, 47 (01) : 114 - 131
  • [42] An efficient algorithm for mining high utility itemsets with negative item values in large databases
    Chu, Chun-Jung
    Tseng, Vincent S.
    Liang, Tyne
    APPLIED MATHEMATICS AND COMPUTATION, 2009, 215 (02) : 767 - 778
  • [43] An efficient algorithm for mining top-k on-shelf high utility itemsets
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 52 (03) : 621 - 655
  • [44] PHUIMUS: A Potential High Utility Itemsets Mining Algorithm Based on Stream Data with Uncertainty
    Wang, Ju
    Liu, Fuxian
    Jin, Chunjie
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [45] An efficient algorithm for mining top-k on-shelf high utility itemsets
    Thu-Lan Dam
    Kenli Li
    Philippe Fournier-Viger
    Quang-Huy Duong
    Knowledge and Information Systems, 2017, 52 : 621 - 655
  • [46] Efficient algorithm for mining high average-utility itemsets in incremental transaction databases
    Donggyu Kim
    Unil Yun
    Applied Intelligence, 2017, 47 : 114 - 131
  • [47] FHN: An efficient algorithm for mining high-utility itemsets with negative unit profits
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Gan, Wensheng
    KNOWLEDGE-BASED SYSTEMS, 2016, 111 : 283 - 298
  • [48] FSKYMINE: A Faster Algorithm For Mining Skyline Frequent Utility Itemsets
    Hung Manh Nguyen
    Anh Viet Phan
    Lai Van Pham
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 251 - 255
  • [49] Mining long high utility itemsets in transaction databases
    Yu, Guangzhu
    Shao, Shihuang
    Sun, Daoqing
    Luo, Bin
    NEW ADVANCES IN SIMULATION, MODELLING AND OPTIMIZATION (SMO '07), 2007, : 326 - +
  • [50] A New Method for Mining High Average Utility Itemsets
    Lu, Tien
    Vo, Bay
    Nguyen, Hien T.
    Hong, Tzung-Pei
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2014, 2014, 8838 : 33 - 42