Context- and Content-aware Embeddings for Query Rewriting in Sponsored Search

被引:59
|
作者
Grbovic, Mihajlo [1 ]
Djuric, Nemanja [1 ]
Radosavljevic, Vladan [1 ]
Silvestri, Fabrizio [2 ]
Bhamidipati, Narayan [1 ]
机构
[1] Yahoo Labs, 701 First Ave, Sunnyvale, CA 94085 USA
[2] Yahoo Labs, London, England
关键词
Query rewriting; word embeddings;
D O I
10.1145/2766462.2767709
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Search engines represent one of the most popular web services, visited by more than 85% of internet users on a daily basis. Advertisers are interested in making use of this vast business potential, as very clear intent signal communicated through an issued query allows effective targeting of users. This idea is embodied in a sponsored search model, where each advertiser maintains a list of keywords they deem indicative of increased user response rate with regards to their business. According to this targeting model, when a query is issued all advertisers with a matching keyword are entered into an auction according to the amount they bid for the query and the winner gets to show their ad. One of the main challenges is the fact that a query may not match many keywords, resulting in lower auction value, lower ad quality, and lost revenue for advertisers and publishers. Possible solution is to expand a query into a set of related queries and use them to increase the number of matched ads, called query rewriting. To this end, we propose rewriting method based on a novel query embedding algorithm, which jointly models query content as well as its context within a search session. As a result, semantically similar queries are mapped into vectors close in the embedding space, which allows expansion of a query via simple K-nearest neighbor search. The method was trained on more than 12 billion sessions, one of the largest corpus reported thus far, and evaluated on both public TREC data set and an in-house sponsored search data set. The results show that the proposed approach significantly outperformed existing state-of-the-art, strongly indicating its benefits and monetization potential.
引用
收藏
页码:383 / 392
页数:10
相关论文
共 22 条
  • [1] Special Section on Ultrawide Context- and Content-Aware Imaging
    Bremond, Francois
    Platisa, Ljiljana
    Battiato, Sebastiano
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (06)
  • [2] CONTENT-AWARE SPEAKER EMBEDDINGS FOR SPEAKER DIARISATION
    Sun, G.
    Liu, D.
    Zhang, C.
    Woodland, P. C.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7168 - 7172
  • [3] Sponsored Advertising for IDTV: a Personalized And Content-aware Approach
    Diaz Redondo, Rebeca P.
    Fernandez Vilas, Ana
    Pazos Arias, Jose J.
    Ramos Cabrer, Manuel
    Garcia Duque, Jorge
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2009, : 205 - 206
  • [4] Content-Aware Ranking for Visual Search
    Geng, Bo
    Yang, Linjun
    Xu, Chao
    Hua, Xian-Sheng
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3400 - 3407
  • [5] Unified Generative & Dense Retrieval for Query Rewriting in Sponsored Search
    Mohankumar, Akash Kumar
    Dodla, Bhargav
    Gururaj, K.
    Singh, Amit
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4745 - 4751
  • [6] Impact of query intent and search context on clickthrough behavior in sponsored search
    Azin Ashkan
    Charles L. A. Clarke
    [J]. Knowledge and Information Systems, 2013, 34 : 425 - 452
  • [7] Impact of query intent and search context on clickthrough behavior in sponsored search
    Ashkan, Azin
    Clarke, Charles L. A.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (02) : 425 - 452
  • [9] Towards Content-Aware SPARQL Query Caching for Semantic Web Applications
    Shu, Yanfeng
    Compton, Michael
    Mueller, Heiko
    Taylor, Kerry
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT I, 2013, 8180 : 320 - 329
  • [10] Compressed Data Structures for Astronomical Content-Aware Resource Search
    Araya, Mauricio
    Arroyuelo, Diego
    Saldias, Camilo
    Solar, Mauricio
    [J]. 2019 38TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2019,