Multi-view Embedding-based Synonyms for Email Search

被引:13
|
作者
Li, Cheng [1 ]
Zhang, Mingyang [1 ]
Bendersky, Michael [1 ]
Deng, Hongbo [1 ,2 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Alibaba Inc, Hangzhou, Zhejiang, Peoples R China
关键词
embedding; synonym expansion; personal search; email search; INFORMATION-RETRIEVAL; QUERY EXPANSION; MODELS;
D O I
10.1145/3331184.3331250
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Synonym expansion is a technique that adds related words to search queries, which may lead to more relevant documents being retrieved, thus improving recall. There is extensive prior work on synonym expansion for web search, however very few studies have tackled its application for email search. Synonym expansion for private corpora like emails poses several unique research challenges. First, the emails are not shared across users, which precludes us from directly employing query-document bipartite graphs, which are standard in web search synonym expansion. Second, user search queries are of personal nature, and may not be generalizable across users. Third, the size of the underlying corpora from which the synonyms may be mined is relatively small (i.e., user's private email inbox) compared to the size of the web corpus. Therefore, in this paper, we propose a solution tailored to the challenges of synonym expansion for email search. We formulate it as a multi-view learning problem, and propose a novel embedding-based model that joins information from multiple sources to obtain the optimal synonym candidates. To demonstrate the effectiveness of the proposed technique, we evaluate our model using both explicit human ratings as well as a live experiment using the Gmail Search service, one of the world's largest email search engines.
引用
收藏
页码:575 / 584
页数:10
相关论文
共 50 条
  • [1] Graph Embedding-Based Deep Multi-view Clustering
    Chen, Cong
    Zhou, Jin
    Han, Shiyuan
    Wang, Yingxu
    Du, Tao
    Yang, Cheng
    Liu, Bowen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14863 : 166 - 175
  • [2] Local voting based multi-view embedding
    Gao, Xinjian
    Mu, Tingting
    Wang, Meng
    NEUROCOMPUTING, 2016, 171 : 901 - 909
  • [3] Multi-view label embedding
    Zhu, Pengfei
    Hu, Qi
    Hu, Qinghua
    Zhang, Changqing
    Feng, Zhizhao
    PATTERN RECOGNITION, 2018, 84 : 126 - 135
  • [4] Embedding-based search in JetBrains IDEs
    Abramov, Evgeny
    Palchikov, Nikolai
    PROCEEDINGS OF THE 2024 FIRST IDE WORKSHOP, IDE 2024, 2024, : 62 - 65
  • [5] Embedding-based Retrieval in Facebook Search
    Huang, Jui-Ting
    Sharma, Ashish
    Sun, Shuying
    Xia, Li
    Zhang, David
    Pronin, Philip
    Padmanabhan, Janani
    Ottaviano, Giuseppe
    Yang, Linjun
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2553 - 2561
  • [6] Deep Multi-view Clustering Based on Graph Embedding
    Zhang, Chen
    Zhou, Weidong
    Zhou, Jin
    Wang, Yingxu
    Han, Shiyuan
    Du, Tao
    Yang, Cheng
    Liu, Bowen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 715 - 726
  • [7] Multi-view Heterogeneous Network Embedding
    Du, Ouxia
    Zhang, Yujia
    Li, Xinyue
    Zhu, Junyi
    Zheng, Tanghu
    Li, Ya
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 3 - 15
  • [8] Multi-View Collaborative Network Embedding
    Ata, Sezin Kircali
    Fang, Yuan
    Wu, Min
    Shi, Jiaqi
    Kwoh, Chee Keong
    Li, Xiaoli
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [9] Multi-View Learning of Network Embedding
    Han, Zhongming
    Zheng, Chenye
    Liu, Dan
    Duan, Dagao
    Yang, Weijie
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE (JSAI-ISAI 2018), 2019, 11717 : 90 - 98
  • [10] Embedding-based Product Retrieval in Taobao Search
    Li, Sen
    Lv, Fuyu
    Jin, Taiwei
    Lin, Guli
    Yang, Keping
    Zeng, Xiaoyi
    Wu, Xiao-Ming
    Ma, Qianli
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3181 - 3189