A distributed framework for large-scale semantic trajectory similarity join

被引:0
|
作者
Tian, Ruijie [1 ]
Li, Jiajun [1 ]
Zhang, Weishi [1 ,2 ]
Wang, Fei [1 ,2 ]
机构
[1] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Liaoning, Peoples R China
[2] Key Lab Intelligent Software, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Semantic trajectory; Similarity join; Distributed process; TOP-K; SEARCH;
D O I
10.1007/s11042-023-15236-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The similarity join is a common yet expensive operator for large-scale semantic trajectories analytics. In this paper, we propose DFST, an efficient framework for semantic trajectory similarity join in distributed systems. We devise ITS index and summary index, which consider textual, temporal, and spatial domains, and theoretically demonstrate that they can effectively prune pairs of dissimilar trajectories. Moreover, DFST can support most existing similarity functions to quantify the spatial similarity between semantic trajectories. We have conducted extensive experiments on real world datasets, and experimental results show that DFST achieves a 13.6% improvement of join performance compared to existing semantic trajectory similarity join methods.
引用
收藏
页码:16205 / 16229
页数:25
相关论文
共 50 条
  • [31] A large-scale distributed framework for information retrieval in large dynamic search spaces
    Santos, Eugene, Jr.
    Santos, Eunice E.
    Hien Nguyen
    Pan, Long
    Korah, John
    APPLIED INTELLIGENCE, 2011, 35 (03) : 375 - 398
  • [32] Semantic tracking and recommendation using fourfold similarity measure from large scale data using hadoop distributed framework in cloud
    Priyadarshini, R.
    Tamilselvan, Latha
    Rajendran, N.
    INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS, 2019, 7 (04) : 189 - 208
  • [33] Towards Big Linked Data: A Large-Scale, Distributed Semantic Data Storage
    Hu, Bo
    Carvalho, Nuno
    Matsutsuka, Takahide
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (04) : 19 - 43
  • [34] Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity
    Vulic, Ivan
    Baker, Simon
    Ponti, Edoardo Maria
    Petti, Ulla
    Leviant, Ira
    Wing, Kelly
    Majewska, Olga
    Bar, Eden
    Malone, Matt
    Poibeau, Thierry
    Reichart, Roi
    Korhonen, Anna
    COMPUTATIONAL LINGUISTICS, 2020, 46 (04) : 847 - 897
  • [35] Learning Multilevel Semantic Similarity for Large-Scale Multi-Label Image Retrieval
    Song, Ge
    Tan, Xiaoyang
    ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 64 - 72
  • [36] A High-Level Framework for Distributed Processing of Large-Scale Graphs
    Krepska, Elzbieta
    Kielmann, Thilo
    Fokkink, Wan
    Bal, Henri
    DISTRIBUTED COMPUTING AND NETWORKING, 2011, 6522 : 155 - 166
  • [37] Trajectory Similarity Join in Spatial Networks
    Shang, Shuo
    Chen, Lisi
    Wei, Zhewei
    Jensen, Christian S.
    Zheng, Kai
    Kalnis, Panos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (11): : 1178 - 1189
  • [38] An Optimized Straggler Mitigation Framework for Large-Scale Distributed Computing Systems
    Said, Samar A.
    Habashy, Shahira M.
    Salem, Sameh A.
    Saad, Elsayed M.
    IEEE Access, 2022, 10 : 97075 - 97088
  • [39] DGCF: A Distributed Greedy Clustering Framework for Large-scale Genomic Sequences
    Yin, Zekun
    Xu, Xiaoming
    Fan, Kaichao
    Li, Ruilin
    Li, Weizhong
    Liu, Weiguo
    Niu, Beifang
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2272 - 2279
  • [40] An Optimized Straggler Mitigation Framework for Large-Scale Distributed Computing Systems
    Said, Samar A.
    Habashy, Shahira M.
    Salem, Sameh A.
    Saad, Elsayed M.
    IEEE ACCESS, 2022, 10 : 97075 - 97088