Fast-join: An efficient method for fuzzy token matching based string similarity join

被引:0
|
作者
Wang, Jiannan [1 ]
Li, Guoliang [1 ]
Fe, Jianhua [1 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
关键词
Compendex;
D O I
5767865
中图分类号
学科分类号
摘要
引用
收藏
页码:458 / 469
相关论文
共 50 条
  • [31] QSJoin: a new string similarity join method based on Q-sample and statistical features
    Wang, Xiaoxia
    Sun, Decai
    Wu, Bo
    Ji, Puzhao
    [J]. INTERNATIONAL JOURNAL OF ARTS AND TECHNOLOGY, 2019, 11 (03) : 285 - 308
  • [32] Fuzzy Similarity Join Algorithm Based on Dynamic Double Prefixes
    Yu C.-Y.
    Wang W.-H.
    Wen X.-J.
    Zhao Y.-H.
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2022, 43 (03): : 321 - 327
  • [33] State-of-the-art in String Similarity Search and Join
    Wandelt, Sebastian
    Deng, Dong
    Gerdjikov, Stefan
    Mishra, Shashwat
    Mitankin, Petar
    Patil, Manish
    Siragusa, Enrico
    Tiskin, Alexander
    Wang, Wei
    Wang, Jiaying
    Leser, Ulf
    [J]. SIGMOD RECORD, 2014, 43 (01) : 64 - 76
  • [34] TokenJoin: Efficient Filtering for Set Similarity Join with Maximum Weighted Bipartite Matching
    Zeakis, Alexandros
    Skoutas, Dimitrios
    Sacharidis, Dimitris
    Papapetrou, Odysseas
    Koubarakis, Manolis
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 16 (04): : 790 - 802
  • [35] An Efficient Similarity Join Algorithm with Cosine Similarity Predicate
    Lee, Dongjoo
    Park, Jaehui
    Shim, Junho
    Lee, Sang-goo
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT 2, 2010, 6262 : 422 - +
  • [36] Efficient Privacy Preserving Protocols for Similarity Join
    Hawashin, Bilal
    Fotouhi, Farshad
    Truta, Traian Marius
    Grosky, William
    [J]. TRANSACTIONS ON DATA PRIVACY, 2012, 5 (01) : 297 - 330
  • [37] Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints
    Wang, Jiannan
    Feng, Jianhua
    Li, Guoliang
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 1219 - 1230
  • [38] I/O-Efficient Similarity Join
    Paghl, Rasmus
    Phaml, Ninh
    Silvestril, Francesco
    Stockel, Morten
    [J]. ALGORITHMS - ESA 2015, 2015, 9294 : 941 - 952
  • [39] I/O-Efficient Similarity Join
    Rasmus Pagh
    Ninh Pham
    Francesco Silvestri
    Morten Stöckel
    [J]. Algorithmica, 2017, 78 : 1263 - 1283
  • [40] I/O-Efficient Similarity Join
    Pagh, Rasmus
    Pham, Ninh
    Silvestri, Francesco
    Stockel, Morten
    [J]. ALGORITHMICA, 2017, 78 (04) : 1263 - 1283