H-ERNIE: A Multi-Granularity Pre-Trained Language Model for Web Search

被引：7

作者：

Chu, Xiaokai ^{[1
,3
]}

Zhao, Jiashu ^{[2
]}

Zou, Lixin ^{[3
]}

Yin, Dawei ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[2] Wilfrid Laurier Univ, Waterloo, ON, Canada

[3] Baidu Inc, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

关键词：

Information Retrieval; Web Search; Pre-trained Language Models;

D O I：

10.1145/3477495.3531986

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The pre-trained language models (PLMs), such as BERT and ERNIE, have achieved outstanding performance in many natural language understanding tasks. Recently, PLMs-based Information Retrieval models have also been investigated and showed substantially state-of-the-art effectiveness, e.g., MORES, PROP and Co1BERT. Moreover, most of the PLMs-based rankers only focus on a single level relevance matching (e.g., character-level), while ignore the other granularity information (e.g., words and phrases), which easily lead to the ambiguity of query understanding and inaccurate matching issues in web search. In this paper, we aim to improve the state-of-the-art PLMs ERNIE for web search, by modeling multi-granularity context information with the awareness of word importance in queries and documents. In particular, we propose a novel H-ERNIE framework, which includes a query-document analysis component and a hierarchical ranking component. The query-document analysis component has several individual modules which generate the necessary variables, such as word segmentation, word importance analysis, and word tightness analysis. Based on these variables, the importance-aware multiple-level correspondences are sent to the ranking model. The hierarchical ranking model includes a multi-layer transformer module to learn the character-level representations, a word-level matching module, and a phrase-level matching module with word importance. Each of these modules models the query and the document matching from a different perspective. Also, these levels are inherently communicated to achieve the overall accurate matching. We discuss the time complexity of the proposed framework, and show that it can be efficiently implemented in real applications. The offline and online experiments on both public datasets and a commercial search engine illustrate the effectiveness of the proposed H-ERNIE framework.

引用

页码：1478 / 1489

页数：12

共 50 条

[31] Integrating Pre-Trained Language Model With Physical Layer Communications
Lee, Ju-Hyung
Lee, Dong-Ho
Lee, Joohan
Pujara, Jay
[J]. IEEE Transactions on Wireless Communications, 2024, 23 (11) : 17266 - 17278
[32] A survey of text classification based on pre-trained language model
[J]. Wan, Jun (junwan2014@whu.edu.cn), 2025, 616
[33] Idiom Cloze Algorithm Integrating with Pre-trained Language Model
Ju, Sheng-Gen
Huang, Fang-Yi
Sun, Jie-Ping
[J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3793 - 3805
[34] Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Shon, Suwon
Brusco, Pablo
Pan, Jing
Han, Kyu J.
Watanabe, Shinji
[J]. INTERSPEECH 2021, 2021, : 3420 - 3424
[35] AraXLNet: pre-trained language model for sentiment analysis of Arabic
Alduailej, Alhanouf
Alothaim, Abdulrahman
[J]. JOURNAL OF BIG DATA, 2022, 9 (01)
[36] Software Vulnerabilities Detection Based on a Pre-trained Language Model
Xu, Wenlin
Li, Tong
Wang, Jinsong
Duan, Haibo
Tang, Yahui
[J]. 2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 904 - 911
[37] SsciBERT: a pre-trained language model for social science texts
Shen, Si
Liu, Jiangfeng
Lin, Litao
Huang, Ying
Zhang, Lin
Liu, Chang
Feng, Yutong
Wang, Dongbo
[J]. SCIENTOMETRICS, 2023, 128 (02) : 1241 - 1263
[38] Interpretability of Entity Matching Based on Pre-trained Language Model
Liang, Zheng
Wang, Hong-Zhi
Dai, Jia-Jia
Shao, Xin-Yue
Ding, Xiao-Ou
Mu, Tian-Yu
[J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (03): : 1087 - 1108
[39] TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
Yang, Ziqing
Cui, Yiming
Chen, Zhigang
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 35 - 43
[40] Learning and Evaluating a Differentially Private Pre-trained Language Model
Hoory, Shlomo
Feder, Amir
Tendler, Avichai
Cohen, Alon
Erell, Sofia
Laish, Itay
Nakhost, Hootan
Stemmer, Uri
Benjamini, Ayelet
Hassidim, Avinatan
Matias, Yossi
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1178 - 1189

← 1 2 3 4 5 →