TopUMS: Top-k Utility Mining in Stream Data

被引:2
|
作者
Song, Wei [1 ]
Fang, Caiyu [1 ]
Gan, Wensheng [2 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing, Peoples R China
[2] Jinan Univ, Coll Cyber Secur, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
utility mining; high utility itemset; top-k; stream data; EFFICIENT ALGORITHMS; ITEMSETS; PATTERNS;
D O I
10.1109/ICDMW53433.2021.00081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Top-k utility mining has attracted great attention in the past few years. Its goal is to discover a set of patterns that have the k highest utilities in a database. Many algorithms have been proposed to efficiently discover top-k high utility itemsets (HUIs), but most of them assume that the database is static. The data are mostly streaming data with continuous, high-speed, and unrestricted features. Thus, it is not possible to store the complete stream data in the same manner as static data; real-time storage and processing are required. In this study, we propose a novel algorithm called TopUMS to mine the top-k HUIs in stream data using the sliding window model. Additionally, we propose a data structure called uList, which is constructed in a horizontal method without ineffective comparison operations. Moreover, we utilize the common batch utility between two consecutive windows to raise minimum utility threshold for the next sliding window. The experimental results demonstrate that TopUMS outperform the state-of-the-art algorithm in terms of execution time and memory.
引用
收藏
页码:615 / 622
页数:8
相关论文
共 50 条
  • [1] Mining Top-K Sequential Patterns in the Data Stream Environment
    Dai, Bi-Ru
    Jiang, Hung-Lin
    Chung, Chih-Heng
    [J]. INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 142 - 149
  • [2] Mining top-k high utility patterns over data streams
    Zihayat, Morteza
    An, Aijun
    [J]. INFORMATION SCIENCES, 2014, 285 : 138 - 161
  • [3] Efficient top-k high utility itemset mining on massive data
    Han, Xixian
    Liu, Xianmin
    Li, Jianzhong
    Gao, Hong
    [J]. INFORMATION SCIENCES, 2021, 557 : 382 - 406
  • [4] Mining top-k high-utility itemsets from a data stream under sliding window model
    Siddharth Dawar
    Veronica Sharma
    Vikram Goyal
    [J]. Applied Intelligence, 2017, 47 : 1240 - 1255
  • [5] Mining top-k high-utility itemsets from a data stream under sliding window model
    Dawar, Siddharth
    Sharma, Veronica
    Goyal, Vikram
    [J]. APPLIED INTELLIGENCE, 2017, 47 (04) : 1240 - 1255
  • [6] Mining of top-k high utility itemsets with negative utility
    Sun, Rui
    Han, Meng
    Zhang, Chunyan
    Shen, Mingyao
    Du, Shiyu
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5637 - 5652
  • [7] Targeted mining of top-k high utility itemsets
    Huang, Shan
    Gan, Wensheng
    Miao, Jinbao
    Han, Xuming
    Fournier-Viger, Philippe
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [8] Efficiently Mining Top-K High Utility Sequential Patterns
    Yin, Junfu
    Zheng, Zhigang
    Cao, Longbing
    Song, Yin
    Wei, Wei
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1259 - 1264
  • [9] TKQ: Top-K Quantitative High Utility Itemset Mining
    Nouioua, Mourad
    Fournier-Viger, Philippe
    Gan, Wensheng
    Wu, Youxi
    Lin, Jerry Chun-Wei
    Nouioua, Farid
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 16 - 28
  • [10] Efficient Algorithms for Mining Top-K High Utility Itemsets
    Tseng, Vincent S.
    Wu, Cheng-Wei
    Fournier-Viger, Philippe
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 54 - 67