Generalized maximal utility for mining high average-utility itemsets

被引:21
|
作者
Song, Wei [1 ,2 ]
Liu, Lu [1 ]
Huang, Chaomin [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
[2] Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing 100144, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; High average-utility itemset; Generalized maximal utility; Generalized average-utility upper bound; Critical support count; EFFICIENT ALGORITHM; PATTERNS; WINDOW;
D O I
10.1007/s10115-021-01614-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining high average-utility itemsets (HAUIs) is a promising research topic in data mining because, in contrast to high utility itemsets, they are not biased toward long itemsets. Regardless of what upper bounds and pruning strategies are used, most existing HAUI mining algorithms are founded on the concept of maximal utility, namely the highest utility of a single item in each transaction. In this paper, we study this problem by generalizing the typical maximal utility and average-utility upper bound from a single item to an itemset, and propose an efficient HAIU mining algorithm based on generalized maximal utility (HAUIM-GMU). For this algorithm, we first propose the concepts of generalized maximal utility and the generalized average-utility upper bound, and discuss how the proposed upper bound can be made tighter to generate fewer candidates. A new pruning strategy is then proposed based on the concept of support, and this is shown to be effective for filtering out unpromising itemsets. The final algorithm is described in detail. Extensive experimental results show that the HAUIM-GMU algorithm outperforms existing state-of-the-art algorithms.
引用
收藏
页码:2947 / 2967
页数:21
相关论文
共 50 条
  • [31] A Projection-Based Approach for Discovering High Average-Utility Itemsets
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (01) : 193 - 209
  • [32] Efficiently updating the discovered high average-utility itemsets with transaction insertion
    Lin, Jerry Chun-Wei
    Ren, Shifeng
    Fournier-Viger, Philippe
    Pan, Jeng-Shyan
    Hong, Tzung-Pei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 72 : 136 - 149
  • [33] Mining top-k high average-utility itemsets based on breadth-first search
    Xuan Liu
    Genlang Chen
    Fangyu Wu
    Shiting Wen
    Wanli Zuo
    [J]. Applied Intelligence, 2023, 53 : 29319 - 29337
  • [34] Mining top-k high average-utility itemsets based on breadth-first search
    Liu, Xuan
    Chen, Genlang
    Wu, Fangyu
    Wen, Shiting
    Zuo, Wanli
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 29319 - 29337
  • [35] A New Method for Mining High Average Utility Itemsets
    Lu, Tien
    Vo, Bay
    Nguyen, Hien T.
    Hong, Tzung-Pei
    [J]. COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2014, 2014, 8838 : 33 - 42
  • [36] Incremental high average-utility itemset mining: survey and challenges
    Chen, Jing
    Yang, Shengyi
    Ding, Weiping
    Li, Peng
    Liu, Aijun
    Zhang, Hongjun
    Li, Tian
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [37] Efficient algorithms for mining maximal high-utility itemsets
    Nguyen, Trinh D. D.
    Quoc-Bao Vu
    Nguyen, Loan T. T.
    [J]. PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 428 - 433
  • [38] Efficient algorithms for mining closed and maximal high utility itemsets
    Duong, Hai
    Hoang, Tien
    Tran, Thong
    Truong, Tin
    Le, Bac
    Fournier-Viger, Philippe
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 257
  • [39] Mining Maximal High Utility Itemsets on Dynamic Profit Databases
    Nguyen, Loan T. T.
    Vu, Dinh-Bao
    Nguyen, Trinh D. D.
    Vo, Bay
    [J]. CYBERNETICS AND SYSTEMS, 2020, 51 (02) : 140 - 160
  • [40] Constructing A New Algorithm For High Average Utility Itemsets Mining
    Nguyen Phuong
    Nguyen Due Duy
    [J]. 2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 273 - 278