Pre-trained Language Model-based Retrieval and Ranking forWeb Search

被引：4

作者：

Zou, Lixin ^{[1
]}

Lu, Weixue ^{[1
]}

Liu, Yiding ^{[1
]}

Cai, Hengyi ^{[1
]}

Chu, Xiaokai ^{[1
]}

Ma, Dehong ^{[1
]}

Shi, Daiting ^{[1
]}

Sun, Yu ^{[1
]}

Cheng, Zhicong ^{[1
]}

Gu, Simiu ^{[1
]}

Wang, Shuaiqiang ^{[1
]}

Yin, Dawei ^{[1
]}

机构：

[1] Baidu Inc, Beijing, Peoples R China

来源：

ACM TRANSACTIONS ON THE WEB | 2023年 / 17卷 / 01期

关键词：

Pre-trained language model; web retrieval; ranking; ALGORITHM;

D O I：

10.1145/3568681

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Pre-trained language representation models (PLMs) such as BERT and Enhanced Representation through kNowledge IntEgration (ERNIE) have been integral to achieving recent improvements on various downstream tasks, including information retrieval. However, it is nontrivial to directly utilize these models for the large-scale web search due to the following challenging issues: (1) the prohibitively expensive computations ofmassive neural PLMs, especially for long texts in the web document, prohibit their deployments in the web search system that demands extremely low latency; (2) the discrepancy between existing task-agnostic pre-training objectives and the ad hoc retrieval scenarios that demand comprehensive relevance modeling is another main barrier for improving the online retrieval and ranking effectiveness; and (3) to create a significant impact on real-world applications, it also calls for practical solutions to seamlessly interweave the resultant PLM and other components into a cooperative system to serve web-scale data. Accordingly, we contribute a series of successfully applied techniques in tackling these exposed issues in this work when deploying the state-of-the-art Chinese pre-trained language model, i.e., ERNIE, in the online search engine system. We first present novel practices to perform expressive PLM-based semantic retrieval with a flexible poly-interaction scheme and cost-efficiently contextualize and rank web documents with a cheap yet powerful Pyramid-ERNIE architecture. We then endow innovative pre-training and fine-tuning paradigms to explicitly incentivize the query-document relevance modeling in PLM-based retrieval and ranking with the large-scale noisy and biased post-click behavioral data. We also introduce a series of effective strategies to seamlessly interwoven the designed PLM-based models with other conventional components into a cooperative system. Extensive offline and online experimental results show that our proposed techniques are crucial to achieving more effective search performance. We also provide a thorough analysis of our methodology and experimental results.

引用

页数：36

共 50 条

[41] SsciBERT: a pre-trained language model for social science texts
Si Shen
Jiangfeng Liu
Litao Lin
Ying Huang
Lin Zhang
Chang Liu
Yutong Feng
Dongbo Wang
Scientometrics, 2023, 128 : 1241 - 1263
[42] A Pre-trained Clinical Language Model for Acute Kidney Injury
Mao, Chengsheng
Yao, Liang
Luo, Yuan
2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 531 - 532
[43] Knowledge Enhanced Pre-trained Language Model for Product Summarization
Yin, Wenbo
Ren, Junxiang
Wu, Yuejiao
Song, Ruilin
Liu, Lang
Cheng, Zhen
Wang, Sibo
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 263 - 273
[44] Few-Shot NLG with Pre-Trained Language Model
Chen, Zhiyu
Eavani, Harini
Chen, Wenhu
Liu, Yinyin
Wang, William Yang
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 183 - 190
[45] Pre-Trained Language Models and Their Applications
Wang, Haifeng
Li, Jiwei
Wu, Hua
Hovy, Eduard
Sun, Yu
ENGINEERING, 2023, 25 : 51 - 65
[46] IndicBART: A Pre-trained Model for Indic Natural Language Generation
Dabre, Raj
Shrotriya, Himani
Kunchukuttan, Anoop
Puduppully, Ratish
Khapra, Mitesh M.
Kumar, Pratyush
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1849 - 1863
[47] ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence
Hu, Yibo
Hosseini, MohammadSaleh
Parolin, Erick Skorupa
Osorio, Javier
Khan, Latifur
Brandt, Patrick T.
D'Orazio, Vito J.
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5469 - 5482
[48] Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking
Dong, Qian
Liu, Yiding
Cheng, Suqi
Wang, Shuaiqiang
Cheng, Zhicong
Niu, Shuzi
Yin, Dawei
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1490 - 1501
[49] Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Shon, Suwon
Brusco, Pablo
Pan, Jing
Han, Kyu J.
Watanabe, Shinji
INTERSPEECH 2021, 2021, : 3420 - 3424
[50] AraXLNet: pre-trained language model for sentiment analysis of Arabic
Alduailej, Alhanouf
Alothaim, Abdulrahman
JOURNAL OF BIG DATA, 2022, 9 (01)

← 1 2 3 4 5 →