Learning Query and Document Relevance from a Web-scale Click Graph

被引:32
|
作者
Jiang, Shan [1 ]
Hu, Yuening [2 ]
Kang, Changsung [2 ]
Daly, Tim, Jr. [2 ]
Yin, Dawei [2 ]
Chang, Yi [2 ]
Zhai, Chengxiang [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Yahoo Res, Sunnyvale, CA USA
关键词
Click-through bipartite graph; vector propagation; vector generation; Web search; query-document relevance;
D O I
10.1145/2911451.2911531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through logs over query-document pairs provide rich and valuable information for multiple tasks in information retrieval. This paper proposes a vector propagation algorithm on the click graph to learn vector representations for both queries and documents in the same semantic space. The proposed approach incorporates both click and content information, and the produced vector representations can directly improve ranking performance for queries and documents that have been observed in the click log. For new queries and documents that are not in the click log, we propose a two-step framework to generate the vector representation, which significantly improves the coverage of our vectors while maintaining the high quality. Experiments on Web-scale search logs from a major commercial search engine demonstrate the effectiveness and scalability of the proposed method. Evaluation results show that NDCG scores are significantly improved against multiple baselines by using the proposed method both as a ranking model and as a feature in a learning-to-rank framework.
引用
收藏
页码:185 / 194
页数:10
相关论文
共 50 条
  • [1] Graph-based Representation Learning for Web-scale Recommender Systems
    El-Kishky, Ahmed
    Bronstein, Michael
    Xiao, Ying
    Haghighi, Aria
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4784 - 4785
  • [2] PGLBox: Multi-GPU Graph Learning Framework for Web-Scale Recommendation
    Jiao, Xuewu
    Li, Weibin
    Wu, Xinxuan
    Hu, Wei
    Li, Miao
    Bian, Jiang
    Dai, Siming
    Luo, Xinsheng
    Hu, Mingqing
    Huang, Zhengjie
    Feng, Danlei
    Yang, Junchao
    Feng, Shikun
    Xiong, Haoyi
    Yu, Dianhai
    Li, Shuanglong
    He, Jingzhou
    Ma, Yanjun
    Liu, Lin
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4262 - 4272
  • [3] Learning with Click Graph for Query Intent Classification
    Li, Xiao
    Wang, Ye-Yi
    Shen, Dou
    Acero, Alex
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [4] In-Memory Graph Databases for Web-Scale Data
    Castellana, Vito Giovanni
    Morari, Alessandro
    Weaver, Jesse
    Tumeo, Antonino
    Haglin, David
    Villa, Oreste
    Feo, John
    [J]. COMPUTER, 2015, 48 (03) : 24 - 35
  • [5] Candidate Document Retrieval for Web-Scale Text Reuse Detection
    Hagen, Matthias
    Stein, Benno
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, 2011, 7024 : 356 - 367
  • [6] Graph Convolutional Neural Networks for Web-Scale Recommender Systems
    Ying, Rex
    He, Ruining
    Chen, Kaifeng
    Eksombatchai, Pong
    Hamilton, William L.
    Leskovec, Jure
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 974 - 983
  • [7] Search Query Quality and Web-Scale Discovery: A Qualitative and Quantitative Analysis
    Meadow, Kelly
    Meadow, James
    [J]. COLLEGE & UNDERGRADUATE LIBRARIES, 2012, 19 (2-4) : 163 - 175
  • [8] Weakly Supervised Learning of Object Segmentations from Web-Scale Video
    Hartmann, Glenn
    Grundmann, Matthias
    Hoffman, Judy
    Tsai, David
    Kwatra, Vivek
    Madani, Omid
    Vijayanarasimhan, Sudheendra
    Essa, Irfan
    Rehg, James
    Sukthankar, Rahul
    [J]. COMPUTER VISION - ECCV 2012: WORKSHOPS AND DEMONSTRATIONS, PT I, 2012, 7583 : 198 - 208
  • [9] An Analysis of Web-scale Discovery Services From the Perspective of User's Relevance Judgment
    Lee, Boram
    Chung, EunKyung
    [J]. JOURNAL OF ACADEMIC LIBRARIANSHIP, 2016, 42 (05): : 529 - 534
  • [10] Drinking From a Firehose: Continual Learning With Web-Scale Natural Language
    Hu, Hexiang
    Sener, Ozan
    Sha, Fei
    Koltun, Vladlen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5684 - 5696