Semantic Matching in Search

被引:105
|
作者
Li, Hang
Xu, Jun
机构
[1] Huawei Technologies, Hong Kong
来源
关键词
INFORMATION; RELEVANCE; MODEL; FRAMEWORK; THINKING; NOTION; RANK;
D O I
10.1561/1500000035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relevance is the most important factor to assure users' satisfaction in search and the success of a search engine heavily depends on its performance on relevance. It has been observed that most of the dissatisfaction cases in relevance are due to term mismatch between queries and documents (e.g., query "ny times" does not match well with a document only containing "New York Times"), because term matching, i.e., the bag-of-words approach, still functions as the main mechanism of modern search engines. It is not exaggerated to say, therefore, that mismatch between query and document poses the most critical challenge in search. Ideally, one would like to see query and document match with each other, if they are topically relevant. Recently, researchers have expended significant effort to address the problem. The major approach is to conduct semantic matching, i.e., to perform more query and document understanding to represent the meanings of them, and perform better matching between the enriched query and document representations. With the availability of large amounts of log data and advanced machine learning techniques, this becomes more feasible and significant progress has been made recently. This survey gives a systematic and detailed introduction to newly developed machine learning technologies for query document matching (semantic matching) in search, particularly web search. It focuses on the fundamental problems, as well as the state-of-the-art solutions of query document matching on form aspect, phrase aspect, word sense aspect, topic aspect, and structure aspect. The ideas and solutions explained may motivate industrial practitioners to turn the research results into products. The methods introduced and the discussions made may also stimulate academic researchers to find new research directions and approaches. Matching between query and document is not limited to search and similar problems can be found in question answering, online advertising, cross-language information retrieval, machine translation, recommender systems, link prediction, image annotation, drug design, and other applications, as the general task of matching between objects from two different spaces. The technologies introduced can be generalized into more general machine learning techniques, which is referred to as learning to match in this survey.
引用
收藏
页码:345 / +
页数:127
相关论文
共 50 条
  • [1] Semantic Matching in APP Search
    Zhuo, Juchao
    Huang, Zeqian
    Liu, Yunfeng
    Kang, Zhanhui
    Cao, Xun
    Li, Mingzhi
    Jin, Long
    [J]. WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 209 - 209
  • [2] CSRS: Code Search with Relevance Matching and Semantic Matching
    Cheng, Yi
    Kuang, Li
    [J]. 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2022), 2022, : 533 - 542
  • [3] Conceptual graph matching for semantic search
    Zhong, JW
    Zhu, HP
    Li, JM
    Yu, Y
    [J]. CONCEPTUAL STRUCTURES: INTEGRATION AND INTERFACES, PROCEEDINGS, 2002, 2393 : 92 - 106
  • [4] PERSONALIZED SEMANTIC MATCHING FOR WEB SEARCH
    Jing, Kunlei
    Hao, Fei
    Zhang, Xizi
    Zhou, Yu
    [J]. 2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 205 - 210
  • [5] A semantic search approach by graph matching with negations and inferences
    Tu, KW
    Lu, J
    Zhu, HP
    Liu, GW
    Yu, Y
    [J]. CONCEPTUAL STRUCTURES FOR KNOWLEDGE CREATION AND COMMUNICATION, 2003, 2746 : 378 - 391
  • [6] Incorporating Semantic Knowledge into Latent Matching Model in Search
    Wang, Shuxin
    Jiang, Xin
    Li, Hang
    Xu, Jun
    Wang, Bin
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2016, 2016, 9994 : 29 - 41
  • [7] Enhancing Semantic Code Search With Deep Graph Matching
    Bibi, Nazia
    Maqbool, Ayesha
    Rana, Tauseef
    Afzal, Farkhanda
    Akgul, Ali
    Eldin, Sayed M.
    [J]. IEEE ACCESS, 2023, 11 : 52392 - 52411
  • [8] Semantic search for matching user requests with profiled enterprises
    Formica, Anna
    Missikoff, Michele
    Pourabbas, Elaheh
    Taglino, Francesco
    [J]. COMPUTERS IN INDUSTRY, 2013, 64 (03) : 191 - 202
  • [9] Scalable Semantic Matching of Queries to Ads in Sponsored Search Advertising
    Grbovic, Mihajlo
    Djuric, Nemanja
    Radosavljevic, Vladan
    Silvestri, Fabrizio
    Baeza-Yates, Ricardo
    Feng, Andrew
    Ordentlich, Erik
    Yang, Lee
    Owens, Gavin
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 375 - 384
  • [10] Extreme Multi-label Learning for Semantic Matching in Product Search
    Chang, Wei-Cheng
    Jiang, Daniel
    Yu, Hsiang-Fu
    Teo, Choon-Hui
    Zhang, Jiong
    Zhong, Kai
    Kolluri, Kedarnath
    Hu, Qie
    Shandilya, Nikhil
    Ievgrafov, Vyacheslav
    Singh, Japinder
    Dhillon, Inderjit S.
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2643 - 2651