Top-k Frequent Items and Item Frequency Tracking over Sliding Windows of Any Sizes

被引:3
|
作者
Song, Chunyao [1 ,2 ]
Liu, Xuanming [3 ]
Ge, Tingjian [3 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
[2] UMass Lowell, Lowell, MA 01854 USA
[3] Univ Massachusetts, Lowell, MA USA
关键词
D O I
10.1109/ICDE.2017.74
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many big data applications today require querying highly dynamic and large-scale data streams for top-k frequent items in the most recent window of any specified size at any time. This is a challenging problem. We show that our novel solution is not only accurate, but it also one to two orders of magnitude faster than previous approaches. Moreover, its memory footprint grows only logarithmically with the window size, rather than linearly as in previous work. Our comprehensive experiments over real-world datasets show that our solution is very effective and scalable. In addition, we devise a concise and efficient solution to a related problem of tracking the frequency of selected items, improving upon previous work by twenty to thirty times in model conciseness while providing the same accuracy and efficiency.
引用
收藏
页码:199 / 202
页数:4
相关论文
共 45 条
  • [31] Location-Based Top-k Term Querying over Sliding Window
    Xu, Ying
    Chen, Lisi
    Yao, Bin
    Shang, Shuo
    Zhu, Shunzhi
    Zheng, Kai
    Li, Fang
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 : 299 - 314
  • [32] Spatio-temporal top-k term search over sliding window
    Chen, Lisi
    Shang, Shuo
    Yao, Bin
    Zheng, Kai
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (05): : 1953 - 1970
  • [33] Efficient Computation of Top-k Frequent Terms over Spatio-temporal Ranges
    Ahmed, Pritom
    Hasan, Mahbub
    Kashyap, Abhijith
    Hristidis, Vagelis
    Tsotras, Vassilis J.
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1227 - 1241
  • [34] A Differentially Private Scheme for Top-k Frequent Itemsets Mining Over Data Streams
    Liang, Wen-Juan
    Chen, Hong
    Zhao, Su-Yun
    Li, Cui-Ping
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (04): : 741 - 760
  • [35] Geo-Social Keyword Top-k Data Monitoring over Sliding Window
    Nishio, Shunya
    Amagata, Daichi
    Hara, Takahiro
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT I, 2017, 10438 : 409 - 424
  • [36] Sliding window top-k dominating query processing over distributed data streams
    Amagata, Daichi
    Hara, Takahiro
    Nishio, Shojiro
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 535 - 566
  • [37] Sliding window top-k dominating query processing over distributed data streams
    Daichi Amagata
    Takahiro Hara
    Shojiro Nishio
    [J]. Distributed and Parallel Databases, 2016, 34 : 535 - 566
  • [38] SKYPE: Top-k Spatial-keyword Publish/Subscribe Over Sliding Window
    Wang, Xiang
    Zhang, Ying
    Zhang, Wenjie
    Lin, Xuemin
    Huang, Zengfeng
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (07): : 588 - 599
  • [39] Mining Top-k Frequent-regular Itemsets from Data Streams Based on Sliding Window Technique
    Mesama, Tashinee
    Amphawan, Komate
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, : 224 - 230
  • [40] Mining Frequent Item Sets in Asynchronous Transactional Data Streams over Time Sensitive Sliding Windows Model
    Javaid, Qaisar
    Memon, Farida
    Talpur, Shahnawaz
    Arif, Muhammad
    Awan, Muhammad Daud
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2016, 35 (04) : 625 - 644