H-ERNIE: A Multi-Granularity Pre-Trained Language Model for Web Search

被引：7

作者：

Chu, Xiaokai ^{[1
,3
]}

Zhao, Jiashu ^{[2
]}

Zou, Lixin ^{[3
]}

Yin, Dawei ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[2] Wilfrid Laurier Univ, Waterloo, ON, Canada

[3] Baidu Inc, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

关键词：

Information Retrieval; Web Search; Pre-trained Language Models;

D O I：

10.1145/3477495.3531986

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The pre-trained language models (PLMs), such as BERT and ERNIE, have achieved outstanding performance in many natural language understanding tasks. Recently, PLMs-based Information Retrieval models have also been investigated and showed substantially state-of-the-art effectiveness, e.g., MORES, PROP and Co1BERT. Moreover, most of the PLMs-based rankers only focus on a single level relevance matching (e.g., character-level), while ignore the other granularity information (e.g., words and phrases), which easily lead to the ambiguity of query understanding and inaccurate matching issues in web search. In this paper, we aim to improve the state-of-the-art PLMs ERNIE for web search, by modeling multi-granularity context information with the awareness of word importance in queries and documents. In particular, we propose a novel H-ERNIE framework, which includes a query-document analysis component and a hierarchical ranking component. The query-document analysis component has several individual modules which generate the necessary variables, such as word segmentation, word importance analysis, and word tightness analysis. Based on these variables, the importance-aware multiple-level correspondences are sent to the ranking model. The hierarchical ranking model includes a multi-layer transformer module to learn the character-level representations, a word-level matching module, and a phrase-level matching module with word importance. Each of these modules models the query and the document matching from a different perspective. Also, these levels are inherently communicated to achieve the overall accurate matching. We discuss the time complexity of the proposed framework, and show that it can be efficiently implemented in real applications. The offline and online experiments on both public datasets and a commercial search engine illustrate the effectiveness of the proposed H-ERNIE framework.

引用

页码：1478 / 1489

页数：12

共 50 条

[21] One Size Does Not Fit All: Multi-Granularity Search of Web Forums
Ganu, Gayatree
Marian, Amelie
[J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 9 - 18
[22] Multi-task Learning based Pre-trained Language Model for Code Completion
Liu, Fang
Li, Ge
Zhao, Yunfei
Jin, Zhi
[J]. 2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020), 2020, : 473 - 485
[23] Enhancing Language Generation with Effective Checkpoints of Pre-trained Language Model
Park, Jeonghyeok
Zhao, Hai
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2686 - 2694
[24] ServiceBERT: A Pre-trained Model for Web Service Tagging and Recommendation
Wang, Xin
Zhou, Pingyi
Wang, Yasheng
Liu, Xiao
Liu, Jin
Wu, Hao
[J]. SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 464 - 478
[25] Few-Shot NLG with Pre-Trained Language Model
Chen, Zhiyu
Eavani, Harini
Chen, Wenhu
Liu, Yinyin
Wang, William Yang
[J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 183 - 190
[26] Knowledge Enhanced Pre-trained Language Model for Product Summarization
Yin, Wenbo
Ren, Junxiang
Wu, Yuejiao
Song, Ruilin
Liu, Lang
Cheng, Zhen
Wang, Sibo
[J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 263 - 273
[27] SsciBERT: a pre-trained language model for social science texts
Si Shen
Jiangfeng Liu
Litao Lin
Ying Huang
Lin Zhang
Chang Liu
Yutong Feng
Dongbo Wang
[J]. Scientometrics, 2023, 128 : 1241 - 1263
[28] A Pre-trained Clinical Language Model for Acute Kidney Injury
Mao, Chengsheng
Yao, Liang
Luo, Yuan
[J]. 2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 531 - 532
[29] ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence
Hu, Yibo
Hosseini, MohammadSaleh
Parolin, Erick Skorupa
Osorio, Javier
Khan, Latifur
Brandt, Patrick T.
D'Orazio, Vito J.
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5469 - 5482
[30] IndicBART: A Pre-trained Model for Indic Natural Language Generation
Dabre, Raj
Shrotriya, Himani
Kunchukuttan, Anoop
Puduppully, Ratish
Khapra, Mitesh M.
Kumar, Pratyush
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1849 - 1863

← 1 2 3 4 5 →