Top-k approximate selection for typicality query results over spatio-textual data

被引:0
|
作者
Meng, Xiangfu [1 ]
Zhang, Xiaoyan [1 ]
Huo, Hongjin [1 ]
Leng, Qiangkui [1 ]
机构
[1] Liaoning Tech Univ, Sch Elect & Informat Engn, Huludao 125105, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatio-textual data; spatial keyword query; Probability density estimation; Typicality analysis; Top-k approximate selection; KEYWORD SEARCH;
D O I
10.1007/s10115-023-02013-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial keyword query is a classical query processing mode for spatio-textual data, which aims to provide users the spatio-textual objects with the highest spatial proximity and textual similarity to the given query. However, the top-k result objects obtained by using the spatial keyword query mode are often similar to each other, while users hope that the system can pick top-k typicality results from the candidate query results in order to make users understand the representative features of the candidate result set. To deal with the problem of typicality analysis and typical object selection of spatio-textual data query results, a typicality evaluation and top-k approximate selection approach is proposed. First, the approach calculates the synthetic distances on dimensions of geographic location, textual semantics, and numeric attributes between all spatio-textual objects. And then, a hybrid index structure that can simultaneously support the location, text, and numeric multi-dimension matching is presented in order to expeditiously obtain the candidate query results. According to the synthetic distances between spatio-textual objects, a Gaussian kernel probability density estimation-based method for measuring the typicality of query results is proposed. To facilitate the query result analysis and top-k typical object selection, the Tournament strategy-based and local neighborhood-based top-k typical object approximate selection algorithms are presented, respectively. The experimental results demonstrated that the text semantic relevancy measuring method for spatio-textual objects is accurate and reasonable, and the local neighborhood-based top-k typicality result approximate selection algorithm achieved both the low error rate and high execution efficiency. The source code and datasets used in this paper are available to be accessed from https://github.com/JiaShengS/Typicality_analysis/.
引用
收藏
页码:1425 / 1468
页数:44
相关论文
共 50 条
  • [1] Top-k approximate selection for typicality query results over spatio-textual data
    Xiangfu Meng
    Xiaoyan Zhang
    Hongjin Huo
    Qiangkui Leng
    [J]. Knowledge and Information Systems, 2024, 66 : 1425 - 1468
  • [2] Approximate Indexing for Top-k Queries over Massive Spatio-textual Data Streams
    Cen, Hangjia
    Xie, Xike
    Cao, Xin
    Weng, Jiali
    [J]. 2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 8 - 11
  • [3] Top-k Spatio-Textual Similarity Join
    Hu, Huiqi
    Li, Guoliang
    Bao, Zhifeng
    Feng, Jianhua
    Wu, Yongwei
    Gong, Zhiguo
    Xu, Yaoqiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (02) : 551 - 565
  • [4] Top-k Spatio-textual Similarity Search
    Liu, Sitong
    Chu, Yaping
    Hu, Huiqi
    Feng, Jianhua
    Zhu, Xuan
    [J]. WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 602 - 614
  • [5] Top-k Spatio-Textual Similarity Join
    Hu, Huiqi
    Li, Guoliang
    Bao, Zhifeng
    Feng, Jianhua
    Wu, Yongwei
    Gong, Zhiguo
    Xu, Yaoqiang
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1576 - 1577
  • [6] Preference-Aware Top-k Spatio-Textual Queries
    Gao, Yunpeng
    Wang, Yao
    Yi, Shengwei
    [J]. WEB-AGE INFORMATION MANAGEMENT, 2016, 9998 : 186 - 197
  • [7] Privacy-Preserving Top-k Spatio-Textual Similarity Join
    Teng, Yiping
    Jiang, Dongyue
    Sun, Mengmeng
    Zhao, Liang
    Xu, Li
    Fan, Chunlong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 718 - 726
  • [8] Clustering Enhanced Error-tolerant Top-k Spatio-textual Search
    Yong Zhang
    Yu Chen
    Junye Yang
    Jin Wang
    Huiqi Hu
    Chunxiao Xing
    Xiaofang Zhou
    [J]. World Wide Web, 2021, 24 : 1185 - 1214
  • [9] Clustering Enhanced Error-tolerant Top-k Spatio-textual Search
    Zhang, Yong
    Chen, Yu
    Yang, Junye
    Wang, Jin
    Hu, Huiqi
    Xing, Chunxiao
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (04): : 1185 - 1214
  • [10] Top-K representative documents query over geo-textual data stream
    Bin Wang
    Rui Zhu
    Xiaochun Yang
    Guoren Wang
    [J]. World Wide Web, 2018, 21 : 537 - 555