Supporting similarity operations based on approximate string matching on the web

被引:0
|
作者
Schallehn, E
Geist, I
Sattler, KU
机构
[1] Univ Magdeburg, Dept Comp Sci, D-39106 Magdeburg, Germany
[2] Tech Univ Ilmenau, Dept Comp Sci & Automat, D-98684 Ilmenau, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Querying and integrating sources of structured data from the Web in most cases requires similarity-based concepts to deal with data level conflicts. This is due to the often erroneous and imprecise nature of the data and diverging conventions for their representation. On the other hand, Web databases offer only limited interfaces and almost no support for similarity queries. The approach presented in this paper maps string similarity predicates to standard predicates like substring and keyword search as offered by many of the mentioned systems. To minimize the local processing costs and the required network traffic, the mapping uses materialized information on the selectivity of string samples such as q-samples, substrings, and keywords. Based on the predicate mapping similarity selections and joins are described and the quality and required effort of the operations is evaluated experimentally.
引用
收藏
页码:227 / 244
页数:18
相关论文
共 50 条
  • [1] Approximate string matching based on bit operations
    Hantinei, E.
    Yu, Yunqing
    Baba, Kensuke
    Murakami, Kazuaki
    [J]. RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 1473 - 1476
  • [2] Approximate string matching based on bit operations
    Hanmei, E.
    Yu, Yunqing
    Baba, Kensuke
    Murakami, Kazuaki
    [J]. RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 195 - 198
  • [3] APPROXIMATE STRING MATCHING
    HALL, PAV
    DOWLING, GR
    [J]. COMPUTING SURVEYS, 1980, 12 (04) : 381 - 402
  • [4] Supporting Uncertain Predicates in DBMS Using Approximate String Matching and Probabilistic Databases
    Jumde, Amol S.
    Keskar, Ravindra B.
    [J]. IEEE ACCESS, 2020, 8 : 169070 - 169081
  • [5] An accurate toponym-matching measure based on approximate string matching
    Kilinc, Deniz
    [J]. JOURNAL OF INFORMATION SCIENCE, 2016, 42 (02) : 138 - 149
  • [6] ALGORITHMS FOR APPROXIMATE STRING MATCHING
    UKKONEN, E
    [J]. INFORMATION AND CONTROL, 1985, 64 (1-3): : 100 - 118
  • [7] A Preprocessing for Approximate String Matching
    Baba, Kensuke
    Nakatoh, Tetsuya
    Yamada, Yasuhiro
    Ikeda, Daisuke
    [J]. INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II, 2011, 252 : 610 - +
  • [8] Spatial Approximate String Matching
    Katsumata, Akifumi
    Miura, Takao
    [J]. 2009 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 123 - 128
  • [9] Faster approximate string matching
    BaezaYates, R
    Navarro, G
    [J]. ALGORITHMICA, 1999, 23 (02) : 127 - 158
  • [10] Content based video retrieval based on approximate string matching
    Masihi, ZG
    Charkari, NM
    [J]. Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 1300 - 1303