Batch Processing of Top-k Spatial-Textual Queries

被引:17
|
作者
Choudhury, Farhana M. [1 ]
Culpepper, J. Shane [1 ]
Bao, Zhifeng [1 ]
Sellis, Timos [2 ]
机构
[1] RMIT Univ, Sch Sci, Melbourne, Vic 3000, Australia
[2] Swinburne Univ Technol, Data Sci Res Inst, Hawthorn, Vic 3122, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Spatial-textual queries; batch queries; spatial-textual indexing; efficient query processing;
D O I
10.1145/3196155
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Since the mid-2000s, everal indexing techniques have been proposed to efficiently answer top-k spatial-textual queries. However, all of these approaches focus on answering one query at a time. In contrast, how to design efficient algorithms that can exploit similarities between incoming queries to improve performance has received little attention. In this article, we study a series of efficient approaches to batch process multiple topk spatial-textual queries concurrently. We carefully design a variety of indexing structures for the problem space by exploring the effect of prioritizing spatial and textual properties on system performance. Specifically, we present an efficient traversal method, SF-Sep, over an existing space-prioritized index structure. Then, we propose a new space-prioritized index structure, the MIR-Tree to support a filter-and-refine based technique, SF-Grp. To support the processing of text-intensive data, we propose an augmented, inverted indexing structure that can easily be added into existing text search engine architectures and a novel traversal method for batch processing of the queries. In all of these approaches, the goal is to improve the overall performance by sharing the I/O costs of similar queries. Finally, we demonstrate significant I/O savings in our algorithms over traditional approaches by extensive experiments on three real datasets and compare how properties of different datasets affect the performance. Many applications in streaming, micro-batching of continuous queries, and privacy-aware search can benefit from this line of work.
引用
收藏
页数:40
相关论文
共 50 条
  • [41] Efficient Processing of Moving Top-k Spatial Keyword Queries in Directed and Dynamic Road Networks
    Attique, Muhammad
    Cho, Hyung-Ju
    Chung, Tae-Sun
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [42] Continuous k Nearest Neighbor Queries over Large-Scale Spatial-Textual Data Streams
    Yang, Rong
    Niu, Baoning
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (11)
  • [43] Approximate distributed top-k queries
    Boaz Patt-Shamir
    Allon Shafrir
    Distributed Computing, 2008, 21 : 1 - 22
  • [44] Top-k Combinatorial Skyline Queries
    Su, I-Fang
    Chung, Yu-Chi
    Lee, Chiang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 79 - +
  • [45] Evaluating top-k selection queries
    Chaudhuri, S
    Gravano, L
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 399 - 410
  • [46] Optimizing Distributed Top-k Queries
    Neumann, Thomas
    Bender, Matthias
    Michel, Sebastian
    Schenkel, Ralf
    Triantafillou, Peter
    Weikum, Gerhard
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2008, PROCEEDINGS, 2008, 5175 : 337 - +
  • [47] Top-k queries on temporal data
    Li, Feifei
    Yi, Ke
    Le, Wangchao
    VLDB JOURNAL, 2010, 19 (05): : 715 - 733
  • [48] Top-k Dominating Queries: an introduction
    Manolopoulos, Yannis
    2015 12th IEEE International Conference on Programming and Systems (ISPS), 2015,
  • [49] Top-k queries on RDF graphs
    Wang, Dong
    Zou, Lei
    Zhao, Dongyan
    INFORMATION SCIENCES, 2015, 316 : 201 - 217
  • [50] Top-k queries on temporal data
    Feifei Li
    Ke Yi
    Wangchao Le
    The VLDB Journal, 2010, 19 : 715 - 733