A Spatial Structure Matching Algorithm for Large Spatial-Textual Datasets

被引:0
|
作者
Liu Z.-D. [1 ]
Lin W.-X. [1 ]
Wu K.-S. [1 ]
机构
[1] College of Computer Science and Software Engineering, Shenzhen University, Shenzhen
来源
基金
中国国家自然科学基金;
关键词
Spatial query; Spatial structure; Spatial-textual data; Subgraph isomorphism; Sweep-line;
D O I
10.11897/SP.J.1016.2022.01261
中图分类号
学科分类号
摘要
Many techniques have been proposed to query and search on the massive spatial-textual data for supporting various location-based services. Traditional spatial keyword query (SKQ) and the recent spatial pattern matching (SPM), however, cannot accurately capture users' query intention on the interested objects and their spatial relations, resulting in a huge number of irrelevant results. As a result, existing techniques still cannot well satisfy users' query requirements. Therefore, this paper proposes a novel query problem-Spatial Structure Matching (SSM), which allows the user to provide a set of keyword objects and meanwhile define the constraints on both distance and direction for any two concerned objects. To answer an SSM query, this paper firstly presents a multi-way join based baseline approach, which decomposes the SSM query into the keyword matching for each individual object, edge matching for a pair of objects, and aggregation matching for a set of objects. To improve the efficiency, we further propose a sweep-line based edge matching algorithm by exploiting the geographic locations of objects to filter out the pairs of objects that cannot meet distance constraints. In addition, we build an object-connection graph using the objects that meet all constraints, and transform the SSM query into a subgraph isomorphism problem, which searches the subgraphs with similar structure as SSM query over the object-connection graph. This problem is well solved by adopting a fast subgraph matching algorithm. Experiments based on four large real-world spatial-textual datasets demonstrate that the proposed approach significantly outperforms the compared approaches by providing refined and effective results, while reducing the query processing time by at least 3 times. © 2022, Science Press. All right reserved.
引用
收藏
页码:1261 / 1275
页数:14
相关论文
共 36 条
  • [1] Junglas I A, Watson R T., Location-based services, Communications of the ACM, 51, 3, pp. 65-69, (2008)
  • [2] Feng K, Cong G, Bhowmick S S, Et al., Towards best region search for data exploration, Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 1055-1070, (2016)
  • [3] Zhao P, Zhu H, Liu Y, Et al., Where to go next: A spatio-temporal gated network for next POI recommendation, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 5877-5884, (2019)
  • [4] Cenamor I, de la Rosa T, Nuez S, Borrajo D., Planning for tourism routes using social networks, Expert Systems with Applications, 69, 1, pp. 1-9, (2017)
  • [5] Liu Y, Liu C, Lu X, Et al., Point-of-interest demand modeling with human mobility patterns, Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 947-955, (2017)
  • [6] Chen Z, Chen L, Cong G, Jensen C S., Location-and keyword-based querying of geo-textual data: A survey, The VLDB Journal, 30, pp. 1-38, (2021)
  • [7] De Felipe I, Hristidis V, Rishe N., Keyword search on spatial databases, Proceedings of the IEEE 24th International Conference on Data Engineering, pp. 656-665, (2008)
  • [8] Liu Xi-Ping, Wan Chang-Xuan, Liu De-Xi, Liao Guo-Qiong, Survey on Spatial Keyword Search, Journal of Software, 27, 2, pp. 329-347, (2016)
  • [9] Cao X, Chen L, Cong G, Et al., Spatial keyword querying, Proceedings of the International Conference on Conceptual Modeling, pp. 16-29, (2012)
  • [10] Cong G, Jensen C S, Wu D M., Efficient retrieval of the top-k most relevant spatial Web objects, Proceedings of the VLDB Endowment, 2, 1, pp. 337-348, (2009)