Efficient list based mining of high average utility patterns with maximum average pruning strategies

被引:41
|
作者
Kim, Heonho [1 ]
Yun, Unil [1 ]
Baek, Yoonji [1 ]
Kim, Jongseong [1 ]
Vo, Bay [2 ]
Yoon, Eunchul [3 ]
Fujita, Hamido [4 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Konkuk Univ, Dept Elect Engn, Seoul, South Korea
[4] Iwate Prefectural Univ, Fac Software & Informat Sci, Takizawa, Iwate, Japan
基金
新加坡国家研究基金会;
关键词
High average-utility pattern mining; Pruning technique; Maximum average utility upper-bound; Maximum remaining average utility; List structure; FREQUENT PATTERNS; DATA STREAMS; ITEMSETS; ALGORITHM; RULES; TIME;
D O I
10.1016/j.ins.2020.07.043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High average utility pattern mining is the concept proposed to complement drawbacks of high utility pattern mining by considering lengths of patterns along with the utilities of the patterns. High average utility pattern mining should be able to gratify the anti-monotone property like other pattern mining techniques. Many high average utility pattern mining studies to satisfy the anti-monotone property have been proposed in order to improve various upper-bounds because the performance of pattern mining can be improved efficiently by satisfying the anti-monotone property. Although those upper-bounds can effectively reduce the search space, they still take a lot of cost to calculate all unpromising patterns or cannot find them in advance. Therefore, in this paper, a novel high average utility pattern mining approach is proposed by employing two novel upper-bounds called tight maximum average utility upper-bound and maximum remaining average utility upper-bound. Moreover, a newly suggested list-based structure, TA-List, is designed to adopt two pruning strategies. The proposed technique can efficiently extract high average utility patterns by reducing search space. To evaluate the performance of the proposed method, various experiments using real and synthetic datasets are conducted in terms of runtime, memory usage and scalability and the proposed approach is compared with the state-of-the-art high average utility pattern mining algorithms. The results of experiments show that the suggested algorithm has better performance with regard to runtime, memory usage and scalability. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:85 / 105
页数:21
相关论文
共 50 条
  • [41] Efficient algorithm for mining high average-utility itemsets in incremental transaction databases
    Kim, Donggyu
    Yun, Unil
    APPLIED INTELLIGENCE, 2017, 47 (01) : 114 - 131
  • [42] EHAUPM: Efficient High Average-Utility Pattern Mining With Tighter Upper Bounds
    Lin, Jerry Chun-Wei
    Ren, Shifeng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    IEEE ACCESS, 2017, 5 : 12927 - 12940
  • [43] Efficient algorithm for mining high average-utility itemsets in incremental transaction databases
    Donggyu Kim
    Unil Yun
    Applied Intelligence, 2017, 47 : 114 - 131
  • [44] Generalized maximal utility for mining high average-utility itemsets
    Song, Wei
    Liu, Lu
    Huang, Chaomin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (11) : 2947 - 2967
  • [45] Generalized maximal utility for mining high average-utility itemsets
    Wei Song
    Lu Liu
    Chaomin Huang
    Knowledge and Information Systems, 2021, 63 : 2947 - 2967
  • [46] Effective utility mining with the measure of average utility
    Hong, Tzung-Pei
    Lee, Cho-Han
    Wang, Shyue-Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8259 - 8265
  • [47] High average-utility sequential pattern mining based on uncertain databases
    Jerry Chun-Wei Lin
    Ting Li
    Matin Pirouz
    Ji Zhang
    Philippe Fournier-Viger
    Knowledge and Information Systems, 2020, 62 : 1199 - 1228
  • [48] H-Map-Based Technique for Mining High Average Utility Itemset
    Bhuvaneswari, M. S.
    Balaganesh, N.
    Muneeswaran, K.
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8816 - 8828
  • [49] High average-utility sequential pattern mining based on uncertain databases
    Lin, Jerry Chun-Wei
    Li, Ting
    Pirouz, Matin
    Zhang, Ji
    Fournier-Viger, Philippe
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (03) : 1199 - 1228
  • [50] TUB-HAUPM: Tighter Upper Bound for Mining High Average-Utility Patterns
    Wu, Jimmy Ming-Tai
    Lin, Jerry Chun-Wei
    Pirouz, Matin
    Fournier-Viger, Philippe
    IEEE ACCESS, 2018, 6 : 18655 - 18669