Using Pre-trained Language Model to Enhance Active Learning for Sentence Matching

被引：0

作者：

Bai, Guirong ^{[1
,2
]}

He, Shizhu ^{[1
,2
]}

Liu, Kang ^{[1
,2
]}

Zhao, Jun ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2022年 / 21卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Sentence matching; active learning; pre-trained language model;

D O I：

10.1145/3480937

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning is an effective method to substantially alleviate the problem of expensive annotation cost for data-driven models. Recently, pre-trained language models have been demonstrated to be powerful for learning language representations. In this article, we demonstrate that the pre-trained language model can also utilize its learned textual characteristics to enrich criteria of active learning. Specifically, we provide extra textual criteria with the pre-trained language model to measure instances, including noise, coverage, and diversity. With these extra textual criteria, we can select more efficient instances for annotation and obtain better results. We conduct experiments on both English and Chinese sentence matching datasets. The experimental results show that the proposed active learning approach can be enhanced by the pre-trained language model and obtain better performance.

引用

页数：19

共 50 条

[41] Solving ESL Sentence Completion Questions via Pre-trained Neural Language Models
Liu, Qiongqiong
Liu, Tianqiao
Zhao, Jiafu
Fang, Qiang
Ding, Wenbiao
Wu, Zhongqin
Xia, Feng
Tang, Jiliang
Liu, Zitao
ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 256 - 261
[42] Multi-task Learning based Pre-trained Language Model for Code Completion
Liu, Fang
Li, Ge
Zhao, Yunfei
Jin, Zhi
2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020), 2020, : 473 - 485
[43] PTMA: Pre-trained Model Adaptation for Transfer Learning
Li, Xiao
Yan, Junkai
Jiang, Jianjian
Zheng, Wei-Shi
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2024, 2024, 14884 : 176 - 188
[44] Improving the Performance of Pre-trained Systems in Sentence Retrieval
Rughbeer, Yastil
Pillay, Anban W.
Jembere, Edgar
2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 908 - 915
[45] SsciBERT: a pre-trained language model for social science texts
Si Shen
Jiangfeng Liu
Litao Lin
Ying Huang
Lin Zhang
Chang Liu
Yutong Feng
Dongbo Wang
Scientometrics, 2023, 128 : 1241 - 1263
[46] Knowledge Enhanced Pre-trained Language Model for Product Summarization
Yin, Wenbo
Ren, Junxiang
Wu, Yuejiao
Song, Ruilin
Liu, Lang
Cheng, Zhen
Wang, Sibo
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 263 - 273
[47] A Pre-trained Clinical Language Model for Acute Kidney Injury
Mao, Chengsheng
Yao, Liang
Luo, Yuan
2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 531 - 532
[48] Few-Shot NLG with Pre-Trained Language Model
Chen, Zhiyu
Eavani, Harini
Chen, Wenhu
Liu, Yinyin
Wang, William Yang
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 183 - 190
[49] LMPred: predicting antimicrobial peptides using pre-trained language models and deep learning
Dee, William
Gromiha, Michael
BIOINFORMATICS ADVANCES, 2022, 2 (01):
[50] A survey of text classification based on pre-trained language model
Wan, Jun (junwan2014@whu.edu.cn), 2025, 616

← 1 2 3 4 5 →