A Prefix-Filter based Method for Spatio-Textual Similarity Join

被引:13
|
作者
Liu, Sitong [1 ]
Li, Guoliang [1 ]
Feng, Jianhua [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatio-textual objects; similarity join; MBR prefix; hybrid signature;
D O I
10.1109/TKDE.2013.83
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Location-based services have attracted significant attention due to modern mobile phones equipped with GPS devices. These services generate large amounts of spatio-textual data which contain both spatial location and textual descriptions. Since a spatio-textual object may have different representations, possibly because of deviations of GPS or different user descriptions, it calls for efficient methods to integrate spatio-textual data from different sources. In this paper we study a new research problem called spatio-textual similarity join: given two sets of spatio-textual objects, find the similar object pairs. We make the following contributions: (1) We develop a filter-and-refine framework and devise several efficient algorithms. We extend the prefix filter technique to generate spatial and textual signatures for the objects and build inverted index on top of these signatures. Then we generate candidate pairs using the inverted lists of signatures. Finally we refine the candidates and generate the final result. (2) We study how to generate high-quality signatures for spatial information. We develop an MBR-prefix based signature to prune large numbers of dissimilar object pairs. (3) We propose a hybrid signature scheme to support both textual pruning and spatial pruning simultaneously. (4) Experimental results on real and synthetic datasets show that our algorithms achieve high performance and scale well.
引用
收藏
页码:2354 / 2367
页数:14
相关论文
共 45 条
  • [1] Efficient Spatio-textual Similarity Join Using MapReduce
    Zhang, Yu
    Ma, Youzhong
    Meng, Xiaofeng
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2014, : 52 - 59
  • [2] Top-k Spatio-Textual Similarity Join
    Hu, Huiqi
    Li, Guoliang
    Bao, Zhifeng
    Feng, Jianhua
    Wu, Yongwei
    Gong, Zhiguo
    Xu, Yaoqiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (02) : 551 - 565
  • [3] Top-k Spatio-Textual Similarity Join
    Hu, Huiqi
    Li, Guoliang
    Bao, Zhifeng
    Feng, Jianhua
    Wu, Yongwei
    Gong, Zhiguo
    Xu, Yaoqiang
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1576 - 1577
  • [4] Efficient Spatio-Textual Similarity Join Processing on NUMA Systems
    Gautam, Saransh
    Ray, Suprio
    Nickerson, Bradford G.
    2021 22ND IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2021), 2021, : 79 - 88
  • [5] Spatio-Textual Similarity Joins
    Bouros, Panagiotis
    Ge, Shen
    Mamoulis, Nikos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 6 (01): : 1 - 12
  • [6] An Efficient Block Index Scheme with Segmentation for Spatio-Textual Similarity Join
    Xiang, Yiming
    Zhuang, Yi
    Jiang, Nan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (07): : 3578 - 3593
  • [7] Authentication of spatio-textual similarity join queries in untrusted cloud environments
    Yan, Han
    Cheng, Xiang
    Wang, Dezheng
    Su, Sen
    Zhang, Qiying
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (18) : 5518 - 5532
  • [8] Spatio-Textual similarity joins
    Bouros, P. (fpbouros@cs.hku.hk), 1600, Association for Computing Machinery (06):
  • [9] Privacy-Preserving Top-k Spatio-Textual Similarity Join
    Teng, Yiping
    Jiang, Dongyue
    Sun, Mengmeng
    Zhao, Liang
    Xu, Li
    Fan, Chunlong
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 718 - 726
  • [10] SEAL: Spatio-Textual Similarity Search
    Fan, Ju
    Li, Guoliang
    Zhou, Lizhu
    Chen, Shanshan
    Hu, Jun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 824 - 835