LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

被引：3

作者：

Li, Yuchen ^{[1
]}

Xiong, Haoyi ^{[2
]}

Kong, Linghe ^{[1
]}

Wang, Shuaiqiang ^{[2
]}

Sun, Zeyi ^{[3
]}

Chen, Hongyang ^{[3
]}

Chen, Guihai ^{[1
]}

Yin, Dawei ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

[3] Zhejiang Lab, Hangzhou, Peoples R China

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI | 2023年 / 14174卷

基金：

国家重点研发计划; 上海市科技启明星计划;

关键词：

Learning to Rank; Graph Convolutional Networks; Web Search;

D O I：

10.1007/978-3-031-43427-3_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While traditional Learning to Rank (LTR) models use query-webpage pairs to perform regression tasks to predict the ranking scores, they usually fail to capture the structure of interactions between queries and webpages over an extremely large bipartite graph. In recent years, Graph Convolutional Neural Networks (GCNs) have demonstrated their unique advantages in link prediction over bipartite graphs and have been successfully used for user-item recommendations. However, it is still difficult to scale-up GCNs for web search, due to the (1) extreme sparsity of links in query-webpage bipartite graphs caused by the expense of ranking scores annotation and (2) imbalance between queries (billions) and web-pages (trillions) for web-scale search as well as the imbalance in annotations. In this work, we introduce the Q-subgraph and W-subgraph to represent every query and webpage with the structure of interaction preserved, and then propose LtrGCN-an LTR pipeline that samples Q-subgraphs and W-subgraphs from all query-webpage pairs, learns to extract features from Q-subgraphs and W-subgraphs, and predict ranking scores in an end-to-end manner. We carried out extensive experiments to evaluate LtrGCN using two real-world datasets and online experiments based on the A/B test at a large-scale search engine. The offline results show that LtrGCN could achieve Delta NDCG(5) = 2.89%-3.97% compared to baselines. We deploy LtrGCN with realistic traffic at a large-scale search engine, where we can still observe significant improvement. LtrGCN performs consistently in both offline and online experiments.

引用

页码：635 / 651

页数：17

共 50 条

[31] Graph Convolutional Networks-Based Noisy Data Imputation in Electronic Health Record
Lee, Byeong Tak
Kwon, O-Yeon
Park, Hyunho
Cho, Kyung-Jae
Kwon, Joon-Myoung
Lee, Yeha
CRITICAL CARE MEDICINE, 2020, 48 (11) : E1106 - E1111
[32] Graph convolutional networks-based method for uncertainty quantification of building design loads
Lu, Jie
Zheng, Zeyu
Zhang, Chaobo
Zhao, Yang
Feng, Chenxin
Choudhary, Ruchi
BUILDING SIMULATION, 2025, 18 (02) : 321 - 337
[33] Graph Convolutional Networks-Based Super-Resolution Land Cover Mapping
Zhang, Xining
Ge, Yong
Ling, Feng
Chen, Jin
Chen, Yuehong
Jia, Yuanxin
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 7667 - 7681
[34] Scalable Community Search over Large-scale Graphs based on Graph Transformer
Wang, Yuxiang
Gou, Xiaoxuan
Xu, Xiaoliang
Geng, Yuxia
Ke, Xiangyu
Wu, Tianxing
Yu, Zhiyuan
Chen, Runhuai
Wu, Xiangying
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1680 - 1690
[35] The anatomy of a large-scale hypertextual Web search engine
Brin, S
Page, L
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
[36] Large-scale duplicate detection for web image search
Wang, Bin
Li, Zhiwei
Li, Mingjing
Ma, Wei-Ying
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 353 - +
[37] Machine Learning Based Graph Mining of Large-scale Network and Optimization
Liu, Mingyue
PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
[38] Extreme Learning Machine for Large-Scale Graph Classification Based on MapReduce
Wang, Zhanghui
Zhao, Yuhai
Wang, Guoren
PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 93 - 105
[39] Extreme Learning Machine for large-scale graph classification based on MapReduce
Wang, Zhanghui
Zhao, Yuhai
Yuan, Ye
Wang, Guoren
Chen, Lei
NEUROCOMPUTING, 2017, 261 : 106 - 114
[40] Large-scale Video Classification with Convolutional Neural Networks
Karpathy, Andrej
Toderici, George
Shetty, Sanketh
Leung, Thomas
Sukthankar, Rahul
Fei-Fei, Li
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1725 - 1732

← 1 2 3 4 5 →