Multi-view Embedding-based Synonyms for Email Search

被引:13
|
作者
Li, Cheng [1 ]
Zhang, Mingyang [1 ]
Bendersky, Michael [1 ]
Deng, Hongbo [1 ,2 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Alibaba Inc, Hangzhou, Zhejiang, Peoples R China
关键词
embedding; synonym expansion; personal search; email search; INFORMATION-RETRIEVAL; QUERY EXPANSION; MODELS;
D O I
10.1145/3331184.3331250
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Synonym expansion is a technique that adds related words to search queries, which may lead to more relevant documents being retrieved, thus improving recall. There is extensive prior work on synonym expansion for web search, however very few studies have tackled its application for email search. Synonym expansion for private corpora like emails poses several unique research challenges. First, the emails are not shared across users, which precludes us from directly employing query-document bipartite graphs, which are standard in web search synonym expansion. Second, user search queries are of personal nature, and may not be generalizable across users. Third, the size of the underlying corpora from which the synonyms may be mined is relatively small (i.e., user's private email inbox) compared to the size of the web corpus. Therefore, in this paper, we propose a solution tailored to the challenges of synonym expansion for email search. We formulate it as a multi-view learning problem, and propose a novel embedding-based model that joins information from multiple sources to obtain the optimal synonym candidates. To demonstrate the effectiveness of the proposed technique, we evaluate our model using both explicit human ratings as well as a live experiment using the Gmail Search service, one of the world's largest email search engines.
引用
收藏
页码:575 / 584
页数:10
相关论文
共 50 条
  • [41] Consistent Multiple Graph Embedding for Multi-View Clustering
    Wang, Yiming
    Chang, Dongxia
    Fu, Zhiqiang
    Zhao, Yao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1008 - 1018
  • [42] Multi-view reconstructive preserving embedding for dimension reduction
    Huibing Wang
    Lin Feng
    Adong Kong
    Bo Jin
    Soft Computing, 2020, 24 : 7769 - 7780
  • [43] Multi-view clustering via spectral embedding fusion
    Hongwei Yin
    Fanzhang Li
    Li Zhang
    Zhao Zhang
    Soft Computing, 2019, 23 : 343 - 356
  • [44] MULTI-VIEW CLUSTERING VIA MIXED EMBEDDING APPROXIMATION
    Wu, Danyang
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3977 - 3981
  • [45] Multi-view Dynamic Heterogeneous Information Network Embedding
    Zhang, Zhenghao
    Huang, Jianbin
    Tan, Qinglin
    COMPUTER JOURNAL, 2022, 65 (08): : 2016 - 2033
  • [46] Relaxed multi-view clustering in latent embedding space
    Chen, Man-Sheng
    Huang, Ling
    Wang, Chang-Dong
    Huang, Dong
    Lai, Jian-Huang
    INFORMATION FUSION, 2021, 68 : 8 - 21
  • [47] Multi-view network embedding with node similarity ensemble
    Yuan, Weiwei
    He, Kangya
    Shi, Chenyang
    Guan, Donghai
    Tian, Yuan
    Al-Dhelaan, Abdullah
    Al-Dhelaan, Mohammed
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (05): : 2699 - 2714
  • [48] Robust multi-view locality preserving regression embedding
    Jing, Ling
    Li, Yi
    Zhang, Hongjie
    PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 28
  • [49] Elastic deep multi-view autoencoder with diversity embedding
    Daneshfar, Fatemeh
    Saifee, Bahar Sar
    Soleymanbaigi, Sayvan
    Aeini, Mohammad
    INFORMATION SCIENCES, 2025, 689
  • [50] Robust graph-based multi-view clustering in latent embedding space
    Yanying Mei
    Zhenwen Ren
    Bin Wu
    Yanhua Shao
    Tao Yang
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 497 - 508