Top-k spatial-keyword publish/subscribe over sliding window

被引:20
|
作者
Wang, Xiang [1 ]
Zhang, Wenjie [1 ]
Zhang, Ying [2 ]
Lin, Xuemin [1 ]
Huang, Zengfeng [1 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ Technol, Ctr Artificial Intelligence, Sydney, NSW, Australia
来源
VLDB JOURNAL | 2017年 / 26卷 / 03期
基金
澳大利亚研究理事会;
关键词
Publish/subscribe system; Top-k spatial-keyword queries; Stream; Sliding window; Distributed processing;
D O I
10.1007/s00778-016-0453-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the prevalence of social media and GPS-enabled devices, a massive amount of geo-textual data have been generated in a stream fashion, leading to a variety of applications such as location-based recommendation and information dissemination. In this paper, we investigate a novel real-time top- monitoring problem over sliding window of streaming data; that is, we continuously maintain the top-k most relevant geo-textual messages (e.g., geo-tagged tweets) for a large number of spatial-keyword subscriptions (e.g., registered users interested in local events) simultaneously. To provide the most recent information under controllable memory cost, sliding window model is employed on the streaming geo-textual data. To the best of our knowledge, this is the first work to study top- spatial-keyword publish/subscribe over sliding window. A novel centralized system, called Skype (Top-k Spatial-keyword Publish/Subscribe), is proposed in this paper. In Skype, to continuously maintain top- results for massive subscriptions, we devise a novel indexing structure upon subscriptions such that each incoming message can be immediately delivered on its arrival. To reduce the expensive top- re-evaluation cost triggered by message expiration, we develop a novel cost-based k -skyband technique to reduce the number of re-evaluations in a cost-effective way. Extensive experiments verify the great efficiency and effectiveness of our proposed techniques. Furthermore, to support better scalability and higher throughput, we propose a distributed version of Skype, namely DSkype, on top of Storm, which is a popular distributed stream processing system. With the help of fine-tuned subscription/message distribution mechanisms, DSkype can achieve orders of magnitude speed-up than its centralized version.
引用
收藏
页码:301 / 326
页数:26
相关论文
共 50 条
  • [21] Relevance Matters: Capitalizing on Less Top-k Matching in Publish/Subscribe
    Sadoghi, Mohammad
    Jacobsen, Hans-Arno
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 786 - 797
  • [22] Diversity-Aware Top-k Publish/Subscribe for Text Stream
    Chen, Lisi
    Cong, Gao
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 347 - 362
  • [23] Reverse spatial top-k keyword queries
    Pritom Ahmed
    Ahmed Eldawy
    Vagelis Hristidis
    Vassilis J. Tsotras
    [J]. The VLDB Journal, 2023, 32 : 501 - 524
  • [24] Sliding Window Top-K Monitoring over Distributed Data Streams
    Lv, Zhijin
    Chen, Ben
    Yu, Xiaohui
    [J]. WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 527 - 540
  • [25] Top-K Collective Spatial Keyword Queries
    Su, Danni
    Zhou, Xu
    Yang, Zhibang
    Zeng, Yifu
    Gao, Yunjun
    [J]. IEEE ACCESS, 2019, 7 : 180779 - 180792
  • [26] Reverse spatial top-k keyword queries
    Ahmed, Pritom
    Eldawy, Ahmed
    Hristidis, Vagelis
    Tsotras, Vassilis J.
    [J]. VLDB JOURNAL, 2023, 32 (03): : 501 - 524
  • [27] Sliding Window Top-K Monitoring over Distributed Data Streams
    Chen B.
    Lv Z.
    Yu X.
    Liu Y.
    [J]. Data Science and Engineering, 2017, 2 (4) : 289 - 300
  • [28] Interactive Top-k Spatial Keyword Queries
    Zheng, Kai
    Su, Han
    Zheng, Bolong
    Shang, Shuo
    Xu, Jiajie
    Liu, Jiajun
    Zhou, Xiaofang
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 423 - 434
  • [29] Top-k term publish/subscribe for geo-textual data streams
    Lisi Chen
    Shuo Shang
    Christian S. Jensen
    Jianliang Xu
    Panos Kalnis
    Bin Yao
    Ling Shao
    [J]. The VLDB Journal, 2020, 29 : 1101 - 1128
  • [30] Efficient Top-k Subscription Matching for Location-Aware Publish/Subscribe
    Hu, Jiafeng
    Cheng, Reynold
    Wu, Dingming
    Jin, Beihong
    [J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES (SSTD 2015), 2015, 9239 : 333 - 351