TopUMS: Top-k Utility Mining in Stream Data

被引:2
|
作者
Song, Wei [1 ]
Fang, Caiyu [1 ]
Gan, Wensheng [2 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing, Peoples R China
[2] Jinan Univ, Coll Cyber Secur, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
utility mining; high utility itemset; top-k; stream data; EFFICIENT ALGORITHMS; ITEMSETS; PATTERNS;
D O I
10.1109/ICDMW53433.2021.00081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Top-k utility mining has attracted great attention in the past few years. Its goal is to discover a set of patterns that have the k highest utilities in a database. Many algorithms have been proposed to efficiently discover top-k high utility itemsets (HUIs), but most of them assume that the database is static. The data are mostly streaming data with continuous, high-speed, and unrestricted features. Thus, it is not possible to store the complete stream data in the same manner as static data; real-time storage and processing are required. In this study, we propose a novel algorithm called TopUMS to mine the top-k HUIs in stream data using the sliding window model. Additionally, we propose a data structure called uList, which is constructed in a horizontal method without ineffective comparison operations. Moreover, we utilize the common batch utility between two consecutive windows to raise minimum utility threshold for the next sliding window. The experimental results demonstrate that TopUMS outperform the state-of-the-art algorithm in terms of execution time and memory.
引用
收藏
页码:615 / 622
页数:8
相关论文
共 50 条
  • [41] On Top-K Closed Sequential Patterns Mining
    Wang, Jing
    Zhang, Lei
    Liu, Guiquan
    Liu, Qi
    Chen, Enhong
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 295 - 300
  • [42] Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data
    Song, Chunyao
    Li, Zheng
    Ge, Tingjian
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 146 - 157
  • [43] Interactive mining of top-K frequent closed itemsets from data streams
    Li, Hua-Fu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10779 - 10788
  • [44] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131
  • [45] Using Bloom Filters for Mining Top-k Frequent Itemsets in Data Streams
    Kim, Younghee
    Cho, Kyungsoo
    Yoon, Jaeyeol
    Kim, Ieejoon
    Kim, Ungmo
    [J]. SECURE AND TRUST COMPUTING, DATA MANAGEMENT, AND APPLICATIONS, 2011, 186 : 209 - 216
  • [46] Mining Top-K Frequent Closed Patterns from Gene Expression Data
    Ji, Shufan
    Wang, Xuejiao
    Zong, Yi
    Gao, Xiaopeng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 732 - 739
  • [47] Mining top-K significant itemsets in landmark windows over data streams
    Yang, Bei
    Huang, Houkuan
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (03): : 463 - 473
  • [48] Mining top-k frequent patterns over data streams sliding window
    Hui Chen
    [J]. Journal of Intelligent Information Systems, 2014, 42 : 111 - 131
  • [49] Heuristically mining the top-k high-utility itemsets with cross-entropy optimization
    Wei Song
    Chuanlong Zheng
    Chaomin Huang
    Lu Liu
    [J]. Applied Intelligence, 2022, 52 : 17026 - 17041
  • [50] Top-k queries on temporal data
    Li, Feifei
    Yi, Ke
    Le, Wangchao
    [J]. VLDB JOURNAL, 2010, 19 (05): : 715 - 733