FAST: Frequency-Aware Indexing for Spatio-Textual Data Streams

被引:17
|
作者
Mahmood, Ahmed R. [1 ]
Aly, Ahmed M. [2 ]
Aref, Walid G. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Google Inc, Mountain View, CA USA
关键词
D O I
10.1109/ICDE.2018.00036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications need to process massive streams of spatio-textual data in real-time against continuous spatio-textual queries. For example, in location-aware ad targeting publish/subscribe systems, it is required to disseminate millions of ads and promotions to millions of users based on the locations and textual profiles of users. In this paper, we study indexing of continuous spatio-textual queries. There exist several related spatio-textual indexes that typically integrate a spatial index with a textual index. However, these indexes usually have a high demand for main-memory and assume that the entire vocabulary of keywords is known in advance. Also, these indexes do not successfully capture the variations in the frequencies of keywords across different spatial regions and treat frequent and infrequent keywords in the same way. Moreover, existing indexes do not adapt to the changes in workload over space and time. For example, some keywords may be trending at certain times in certain locations and this may change as time passes. This affects the indexing and searching performance of existing indexes significantly. In this paper, we introduce FAST, a Frequency-Aware Spatio-Textual index for continuous spatio-textual queries. FAST is a main-memory index that requires up to one third of the memory needed by the state-of-the-art index. FAST does not assume prior knowledge of the entire vocabulary of indexed objects. FAST adaptively accounts for the difference in the frequencies of keywords within their corresponding spatial regions to automatically choose the best indexing approach that optimizes the insertion and search times. Extensive experimental evaluation using real and synthetic datasets demonstrates that FAST is up to 3x faster in search time and 5x faster in insertion time than the state-of-the-art indexes.
引用
收藏
页码:305 / 316
页数:12
相关论文
共 31 条
  • [1] Approximate Indexing for Top-k Queries over Massive Spatio-textual Data Streams
    Cen, Hangjia
    Xie, Xike
    Cao, Xin
    Weng, Jiali
    2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 8 - 11
  • [2] Spatio-textual indexing for geographical search on the web
    Vaid, S
    Jones, CB
    Joho, H
    Sanderson, M
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2005, 3633 : 218 - 235
  • [3] A time-aware spatio-textual recommender system
    Kefalas, Pavlos
    Manolopoulos, Yannis
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 78 : 396 - 406
  • [4] SSTD: A Distributed System on Streaming Spatio-Textual Data
    Chen, Yue
    Chen, Zhida
    Cong, Gao
    Mahmood, Ahmed R.
    Aref, Walid G.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (11): : 2284 - 2296
  • [5] Preference-Aware Top-k Spatio-Textual Queries
    Gao, Yunpeng
    Wang, Yao
    Yi, Shengwei
    WEB-AGE INFORMATION MANAGEMENT, 2016, 9998 : 186 - 197
  • [6] LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams
    Patil, Mayur
    Magdy, Amr
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1607 - 1618
  • [7] Density and Frequency-Aware Cluster Identification for Spatio-Temporal Sequence Data
    Keon Myung Lee
    Sang Yeon Lee
    Kyung Mi Lee
    Sang Ho Lee
    Wireless Personal Communications, 2017, 93 : 47 - 65
  • [8] Density and Frequency-Aware Cluster Identification for Spatio-Temporal Sequence Data
    Lee, Keon Myung
    Lee, Sang Yeon
    Lee, Kyung Mi
    Lee, Sang Ho
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 93 (01) : 47 - 65
  • [9] FAST: Frequency-aware static timing analysis
    Seth, K
    Anantaraman, A
    Mueller, F
    Rotenberg, E
    RTSS 2003: 24TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2003, : 40 - 51
  • [10] A Location-Aware Publish/Subscribe Framework for Parameterized Spatio-Textual Subscriptions
    Hu, Huiqi
    Liu, Yiqun
    Li, Guoliang
    Feng, Jianhua
    Tan, Kian-Lee
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 711 - 722