Fuzzy Matching of Web Queries to Structured Data

被引:6
|
作者
Cheng, Tao [1 ]
Lauw, Hady W. [2 ]
Paparizos, Stelios [2 ]
机构
[1] Univ Illinois, 201 N Goodwin Ave, Urbana, IL 61801 USA
[2] Microsoft Res, Mountain View, CA 94043 USA
关键词
D O I
10.1109/ICDE.2010.5447817
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recognizing the alternative ways people use to reference an entity, is important for many Web applications that query structured data. In such applications, there is often a mismatch between how content creators describe entities and how different users try to retrieve them. In this paper, we consider the problem of determining whether a candidate query approximately matches with an entity. We propose an off-line, data-driven, bottom-up approach that mines query logs for instances where Web content creators and Web users apply a variety of strings to refer to the same Web pages. This way, given a set of strings that reference entities, we generate an expanded set of equivalent strings for each entity. The proposed method is verified with experiments on real-life data sets showing that we can dramatically increase the queries that can be matched.
引用
收藏
页码:713 / 716
页数:4
相关论文
共 50 条
  • [1] Answering Web Queries Using Structured Data Sources
    Paparizos, Stelios
    Ntoulas, Alexandros
    Shafer, John
    Agrawal, Rakesh
    [J]. ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1127 - 1129
  • [2] Structured Queries with Generalized Pattern Matching on Encrypted Cloud Data
    Jia, Nan
    Jia, Xiaohua
    Wang, Dongsheng
    Fu, Shaojing
    Xu, Ming
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [3] Semantic matching of natural language Web queries
    Karam, N
    Benbernou, S
    Hacid, MS
    Schneider, M
    [J]. WEB ENGINEERING, PROCEEDINGS, 2004, 3140 : 416 - 429
  • [4] RETRIEVING DEEP WEB DATA THROUGH MULTI-ATTRIBUTES INTERFACES WITH STRUCTURED QUERIES
    Tian, Jian-Wei
    Qi, Wen-Hui
    Liu, Xiao-Xiao
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2011, 21 (04) : 523 - 542
  • [5] Answering Complex Structured Queries over the Deep Web
    Wang, Fan
    Agrawal, Gagan
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 115 - 123
  • [6] Structured Data on the Web
    Cafarella, Michael J.
    Halevy, Alon
    Madhavan, Jayant
    [J]. COMMUNICATIONS OF THE ACM, 2011, 54 (02) : 72 - 79
  • [7] An Efficient Mechanism for Deep Web Data Extraction Based on Tree-Structured Web Pattern Matching
    Ahamed, B. Bazeer
    Yuvaraj, D.
    Shitharth, S.
    Mirza, Olfat M.
    Alsobhi, Aisha
    Yafoz, Ayman
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [8] On Provenance of Queries on Semantic Web Data
    Theoharis, Yannis
    Fundulaki, Irini
    Karvounarakis, Grigoris
    Christophides, Vassilis
    [J]. IEEE INTERNET COMPUTING, 2011, 15 (01) : 31 - 39
  • [9] Estimating Translation Probabilities from the Web for Structured Queries on CLIR
    Saralegi, Xabier
    Lopez de Lacalle, Maddalen
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 586 - 589
  • [10] Matching Objects to User's Queries in Web of Things' Applications
    Xu, Wenyi
    Marsala, Christophe
    Christophe, Benoit
    [J]. PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR COMMUNICATION SYSTEMS AND NETWORKS (CICOMMS), 2013, : 31 - 38