Efficient Computation of Top-k Frequent Terms over Spatio-temporal Ranges

被引:20
|
作者
Ahmed, Pritom [1 ]
Hasan, Mahbub [1 ]
Kashyap, Abhijith [1 ]
Hristidis, Vagelis [1 ]
Tsotras, Vassilis J. [1 ]
机构
[1] UC Riverside, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Top-K; Spatio-Temporal Databases; Social Networks;
D O I
10.1145/3035918.3064032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The wide availability of tracking devices has drastically increased the role of geolocation in social networks, resulting in new commercial applications; for example, marketers can identify current trending topics within a region of interest and focus their products accordingly. In this paper we study a basic analytics query on geo-tagged data, namely: given a spatiotemporal region, find the most frequent terms among the social posts in that region. While there has been prior work on keyword search on spatial data (find the objects nearest to the query point that contain the query keywords), and on group keyword search on spatial data (retrieving groups of objects), our problem is different in that it returns keywords and aggregated frequencies as output, instead of having the keyword as input. Moreover, we differ from works addressing the streamed version of this query in that we operate on large, disk resident data and we provide exact answers. We propose an index structure and algorithms to efficiently answer such top-k spatiotemporal range queries, which we refer as Top-k Frequent Spatiotemporal Terms (kFST) queries. Our index structure employs an R-tree augmented by top-k sorted term lists (STLs), where a key challenge is to balance the size of the index to achieve faster execution and smaller space requirements. We theoretically study and experimentally validate the ideal length of the stored term lists, and perform detailed experiments to evaluate the performance of the proposed methods compared to baselines on real datasets.
引用
收藏
页码:1227 / 1241
页数:15
相关论文
共 50 条
  • [1] Efficient top-k recently-frequent term querying over spatio-temporal textual streams
    Dam, Thu-Lan
    Chester, Sean
    Norvag, Kjetil
    Duong, Quang-Huy
    [J]. INFORMATION SYSTEMS, 2021, 97
  • [2] Spatio-temporal top-k term search over sliding window
    Lisi Chen
    Shuo Shang
    Bin Yao
    Kai Zheng
    [J]. World Wide Web, 2019, 22 : 1953 - 1970
  • [3] Spatio-temporal top-k term search over sliding window
    Chen, Lisi
    Shang, Shuo
    Yao, Bin
    Zheng, Kai
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (05): : 1953 - 1970
  • [4] Scalable Top-k Spatio-Temporal Term Querying
    Skovsgaard, Anders
    Sidlauskas, Darius
    Jensen, Christian S.
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 148 - 159
  • [5] Approximate spatio-temporal top-k publish/subscribe
    Lisi Chen
    Shuo Shang
    [J]. World Wide Web, 2019, 22 : 2153 - 2175
  • [6] Approximate spatio-temporal top-k publish/subscribe
    Chen, Lisi
    Shang, Shuo
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (05): : 2153 - 2175
  • [7] Efficient computation of frequent and top-k elements in data streams
    Metwally, A
    Agrawal, D
    El Abbadi, A
    [J]. DATABASE THEORY - ICDT 2005, PROCEEDINGS, 2005, 3363 : 398 - 412
  • [8] Scalable Spatio-temporal Top-k InteractionQueries on Dynamic Communities
    Almaslukh, Abdulaziz
    Liu, Yongyi
    Magdy, Amr
    [J]. ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2024, 10 (01)
  • [9] What happened then and there: Top-k spatio-temporal keyword query
    Liu, Xiping
    Wan, Changxuan
    Xiong, Neal N.
    Liu, Dexi
    Liao, Guoqiong
    Deng, Song
    [J]. INFORMATION SCIENCES, 2018, 453 : 281 - 301
  • [10] Efficient Discovery of Top-K Sequential Patterns in Event-Based Spatio-Temporal Data
    Maciag, Piotr S.
    [J]. PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 47 - 56