TopUMS: Top-k Utility Mining in Stream Data

被引:2
|
作者
Song, Wei [1 ]
Fang, Caiyu [1 ]
Gan, Wensheng [2 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing, Peoples R China
[2] Jinan Univ, Coll Cyber Secur, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
utility mining; high utility itemset; top-k; stream data; EFFICIENT ALGORITHMS; ITEMSETS; PATTERNS;
D O I
10.1109/ICDMW53433.2021.00081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Top-k utility mining has attracted great attention in the past few years. Its goal is to discover a set of patterns that have the k highest utilities in a database. Many algorithms have been proposed to efficiently discover top-k high utility itemsets (HUIs), but most of them assume that the database is static. The data are mostly streaming data with continuous, high-speed, and unrestricted features. Thus, it is not possible to store the complete stream data in the same manner as static data; real-time storage and processing are required. In this study, we propose a novel algorithm called TopUMS to mine the top-k HUIs in stream data using the sliding window model. Additionally, we propose a data structure called uList, which is constructed in a horizontal method without ineffective comparison operations. Moreover, we utilize the common batch utility between two consecutive windows to raise minimum utility threshold for the next sliding window. The experimental results demonstrate that TopUMS outperform the state-of-the-art algorithm in terms of execution time and memory.
引用
收藏
页码:615 / 622
页数:8
相关论文
共 50 条
  • [31] Top-k high utility itemset mining: current status and future directions
    Kumar, Rajiv
    Singh, Kuldeep
    [J]. Knowledge Engineering Review, 2024, 39
  • [32] Top-k high utility pattern mining with effective threshold raising strategies
    Ryang, Heungmo
    Yun, Unil
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 76 : 109 - 126
  • [33] DEVELOPMENT OF AN EFFICIENT TECHNIQUE FOR MINING TOP-K CLOSED HIGH UTILITY ITEMSETS
    Velayudhan, Baby
    Sakthivel
    Subasree
    [J]. IIOAB JOURNAL, 2016, 7 (09) : 150 - 155
  • [34] Mining Top-K multidimensional gradients
    Alves, Ronnie
    Belo, Orlando
    Ribeiro, Joel
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2007, 4654 : 375 - +
  • [35] Mining Top-K constrained cross-level high-utility itemsets over data streams
    Meng Han
    Shujuan Liu
    Zhihui Gao
    Dongliang Mu
    Ang Li
    [J]. Knowledge and Information Systems, 2024, 66 : 2885 - 2924
  • [36] Mining Top-K constrained cross-level high-utility itemsets over data streams
    Han, Meng
    Liu, Shujuan
    Gao, Zhihui
    Mu, Dongliang
    Li, Ang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (05) : 2885 - 2924
  • [37] Incremental Mining of Top-k Maximal Influential Paths in Network Data
    Xu, Enliang
    Hsu, Wynne
    Lee, Mong Li
    Patel, Dhaval
    [J]. TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE- CENTERED SYSTEMS X: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS, 2013, 8220 : 173 - 199
  • [38] FTKHUIM: A Fast and Efficient Method for Mining Top-K High-Utility Itemsets
    Vu, Vinh V.
    Lam, Mi T. H.
    Duong, Thuy T. M.
    Manh, Ly T.
    Nguyen, Thuy T. T.
    Nguyen, Le V.
    Yun, Unil
    Snasel, Vaclav
    Vo, Bay
    [J]. IEEE ACCESS, 2023, 11 : 104789 - 104805
  • [39] Top-k high average-utility itemsets mining with effective pruning strategies
    Ronghui Wu
    Zhan He
    [J]. Applied Intelligence, 2018, 48 : 3429 - 3445
  • [40] Top-k high average-utility itemsets mining with effective pruning strategies
    Wu, Ronghui
    He, Zhan
    [J]. APPLIED INTELLIGENCE, 2018, 48 (10) : 3429 - 3445