H-ERNIE: A Multi-Granularity Pre-Trained Language Model for Web Search

被引：7

作者：

Chu, Xiaokai ^{[1
,3
]}

Zhao, Jiashu ^{[2
]}

Zou, Lixin ^{[3
]}

Yin, Dawei ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[2] Wilfrid Laurier Univ, Waterloo, ON, Canada

[3] Baidu Inc, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

关键词：

Information Retrieval; Web Search; Pre-trained Language Models;

D O I：

10.1145/3477495.3531986

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The pre-trained language models (PLMs), such as BERT and ERNIE, have achieved outstanding performance in many natural language understanding tasks. Recently, PLMs-based Information Retrieval models have also been investigated and showed substantially state-of-the-art effectiveness, e.g., MORES, PROP and Co1BERT. Moreover, most of the PLMs-based rankers only focus on a single level relevance matching (e.g., character-level), while ignore the other granularity information (e.g., words and phrases), which easily lead to the ambiguity of query understanding and inaccurate matching issues in web search. In this paper, we aim to improve the state-of-the-art PLMs ERNIE for web search, by modeling multi-granularity context information with the awareness of word importance in queries and documents. In particular, we propose a novel H-ERNIE framework, which includes a query-document analysis component and a hierarchical ranking component. The query-document analysis component has several individual modules which generate the necessary variables, such as word segmentation, word importance analysis, and word tightness analysis. Based on these variables, the importance-aware multiple-level correspondences are sent to the ranking model. The hierarchical ranking model includes a multi-layer transformer module to learn the character-level representations, a word-level matching module, and a phrase-level matching module with word importance. Each of these modules models the query and the document matching from a different perspective. Also, these levels are inherently communicated to achieve the overall accurate matching. We discuss the time complexity of the proposed framework, and show that it can be efficiently implemented in real applications. The offline and online experiments on both public datasets and a commercial search engine illustrate the effectiveness of the proposed H-ERNIE framework.

引用

页码：1478 / 1489

页数：12

共 50 条

[1] Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models
Lai, Yuxuan
Liu, Yijia
Feng, Yansong
Huang, Songfang
Zhao, Dongyan
[J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1716 - 1731
[2] Pre-trained Language Model based Ranking in Baidu Search
Zou, Lixin
Zhang, Shengqiang
Cai, Hengyi
Ma, Dehong
Cheng, Suqi
Wang, Shuaiqiang
Shi, Daiting
Cheng, Zhicong
Yin, Dawei
[J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4014 - 4022
[3] Hyperbolic Pre-Trained Language Model
Chen, Weize
Han, Xu
Lin, Yankai
He, Kaichen
Xie, Ruobing
Zhou, Jie
Liu, Zhiyuan
Sun, Maosong
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3101 - 3112
[4] ERNIE-GeoL: A Geography-and-Language Pre-trained Model and its Applications in Baidu Maps
Huang, Jizhou
Wang, Haifeng
Sun, Yibo
Shi, Yunsheng
Huang, Zhengjie
Zhuo, An
Feng, Shikun
[J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3029 - 3039
[5] Pre-trained Language Model Representations for Language Generation
Edunov, Sergey
Baevski, Alexei
Auli, Michael
[J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4052 - 4059
[6] AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Zhang, Xinsong
Li, Pengshuai
Li, Hang
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 421 - 435
[7] Pre-trained Language Model forWeb-scale Retrieval in Baidu Search
Liu, Yiding
Lu, Weixue
Cheng, Suqi
Shi, Daiting
Wang, Shuaiqiang
Cheng, Zhicong
Yin, Dawei
[J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3365 - 3375
[8] Pre-trained Language Model-based Retrieval and Ranking forWeb Search
Zou, Lixin
Lu, Weixue
Liu, Yiding
Cai, Hengyi
Chu, Xiaokai
Ma, Dehong
Shi, Daiting
Sun, Yu
Cheng, Zhicong
Gu, Simiu
Wang, Shuaiqiang
Yin, Dawei
[J]. ACM TRANSACTIONS ON THE WEB, 2023, 17 (01)
[9] Adder Encoder for Pre-trained Language Model
Ding, Jianbang
Zhang, Suiyun
Li, Linlin
[J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 339 - 347
[10] Surgicberta: a pre-trained language model for procedural surgical language
Bombieri, Marco
Rospocher, Marco
Ponzetto, Simone Paolo
Fiorini, Paolo
[J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 18 (01) : 69 - 81

← 1 2 3 4 5 →