LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

被引:3
|
作者
Li, Yuchen [1 ]
Xiong, Haoyi [2 ]
Kong, Linghe [1 ]
Wang, Shuaiqiang [2 ]
Sun, Zeyi [3 ]
Chen, Hongyang [3 ]
Chen, Guihai [1 ]
Yin, Dawei [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
[3] Zhejiang Lab, Hangzhou, Peoples R China
基金
国家重点研发计划; 上海市科技启明星计划;
关键词
Learning to Rank; Graph Convolutional Networks; Web Search;
D O I
10.1007/978-3-031-43427-3_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While traditional Learning to Rank (LTR) models use query-webpage pairs to perform regression tasks to predict the ranking scores, they usually fail to capture the structure of interactions between queries and webpages over an extremely large bipartite graph. In recent years, Graph Convolutional Neural Networks (GCNs) have demonstrated their unique advantages in link prediction over bipartite graphs and have been successfully used for user-item recommendations. However, it is still difficult to scale-up GCNs for web search, due to the (1) extreme sparsity of links in query-webpage bipartite graphs caused by the expense of ranking scores annotation and (2) imbalance between queries (billions) and web-pages (trillions) for web-scale search as well as the imbalance in annotations. In this work, we introduce the Q-subgraph and W-subgraph to represent every query and webpage with the structure of interaction preserved, and then propose LtrGCN-an LTR pipeline that samples Q-subgraphs and W-subgraphs from all query-webpage pairs, learns to extract features from Q-subgraphs and W-subgraphs, and predict ranking scores in an end-to-end manner. We carried out extensive experiments to evaluate LtrGCN using two real-world datasets and online experiments based on the A/B test at a large-scale search engine. The offline results show that LtrGCN could achieve Delta NDCG(5) = 2.89%-3.97% compared to baselines. We deploy LtrGCN with realistic traffic at a large-scale search engine, where we can still observe significant improvement. LtrGCN performs consistently in both offline and online experiments.
引用
收藏
页码:635 / 651
页数:17
相关论文
共 50 条
  • [31] Graph Convolutional Networks-Based Noisy Data Imputation in Electronic Health Record
    Lee, Byeong Tak
    Kwon, O-Yeon
    Park, Hyunho
    Cho, Kyung-Jae
    Kwon, Joon-Myoung
    Lee, Yeha
    CRITICAL CARE MEDICINE, 2020, 48 (11) : E1106 - E1111
  • [32] Graph convolutional networks-based method for uncertainty quantification of building design loads
    Lu, Jie
    Zheng, Zeyu
    Zhang, Chaobo
    Zhao, Yang
    Feng, Chenxin
    Choudhary, Ruchi
    BUILDING SIMULATION, 2025, 18 (02) : 321 - 337
  • [33] Graph Convolutional Networks-Based Super-Resolution Land Cover Mapping
    Zhang, Xining
    Ge, Yong
    Ling, Feng
    Chen, Jin
    Chen, Yuehong
    Jia, Yuanxin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 7667 - 7681
  • [34] Scalable Community Search over Large-scale Graphs based on Graph Transformer
    Wang, Yuxiang
    Gou, Xiaoxuan
    Xu, Xiaoliang
    Geng, Yuxia
    Ke, Xiangyu
    Wu, Tianxing
    Yu, Zhiyuan
    Chen, Runhuai
    Wu, Xiangying
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1680 - 1690
  • [35] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [36] Large-scale duplicate detection for web image search
    Wang, Bin
    Li, Zhiwei
    Li, Mingjing
    Ma, Wei-Ying
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 353 - +
  • [37] Machine Learning Based Graph Mining of Large-scale Network and Optimization
    Liu, Mingyue
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [38] Extreme Learning Machine for Large-Scale Graph Classification Based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 93 - 105
  • [39] Extreme Learning Machine for large-scale graph classification based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    NEUROCOMPUTING, 2017, 261 : 106 - 114
  • [40] Large-scale Video Classification with Convolutional Neural Networks
    Karpathy, Andrej
    Toderici, George
    Shetty, Sanketh
    Leung, Thomas
    Sukthankar, Rahul
    Fei-Fei, Li
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1725 - 1732