Multi-task Active Learning for Pre-trained Transformer-based Models

被引：6

作者：

Rotman, Guy ^{[1
]}

Reichart, Roi ^{[1
]}

机构：

[1] Fac Ind Engn & Management, Technion, IIT, Haifa, Israel

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2022年 / 10卷

关键词：

D O I：

10.1162/tacl_a_00515

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-task learning, in which several tasks are jointly learned by a single model, allows NLP models to share information from multiple annotations and may facilitate better predictions when the tasks are inter-related. This technique, however, requires annotating the same text with multiple annotation schemes, which may be costly and laborious. Active learning (AL) has been demonstrated to optimize annotation processes by iteratively selecting unlabeled examples whose annotation is most valuable for the NLP model. Yet, multi-task active learning (MT-AL) has not been applied to state-of-the-art pre-trained Transformer-based NLP models. This paper aims to close this gap. We explore various multi-task selection criteria in three realistic multi-task scenarios, reflecting different relations between the participating tasks, and demonstrate the effectiveness of multi-task compared to single-task selection. Our results suggest that MT-AL can be effectively used in order to minimize annotation efforts for multi-task NLP models.(1)

引用

页码：1209 / 1228

页数：20

共 50 条

[1] Pre-trained transformer-based language models for Sundanese
Wilson Wongso
Henry Lucky
Derwin Suhartono
[J]. Journal of Big Data, 9
[2] Pre-trained transformer-based language models for Sundanese
Wongso, Wilson
Lucky, Henry
Suhartono, Derwin
[J]. JOURNAL OF BIG DATA, 2022, 9 (01)
[3] Drug knowledge discovery via multi-task learning and pre-trained models
Li, Dongfang
Xiong, Ying
Hu, Baotian
Tang, Buzhou
Peng, Weihua
Chen, Qingcai
[J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (SUPPL 9)
[4] Drug knowledge discovery via multi-task learning and pre-trained models
Dongfang Li
Ying Xiong
Baotian Hu
Buzhou Tang
Weihua Peng
Qingcai Chen
[J]. BMC Medical Informatics and Decision Making, 21
[5] Multi-task Learning Based Online Dialogic Instruction Detection with Pre-trained Language Models
Hao, Yang
Li, Hang
Ding, Wenbiao
Wu, Zhongqin
Tang, Jiliang
Luckin, Rose
Liu, Zitao
[J]. ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 183 - 189
[6] Multi-task Learning based Pre-trained Language Model for Code Completion
Liu, Fang
Li, Ge
Zhao, Yunfei
Jin, Zhi
[J]. 2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020), 2020, : 473 - 485
[7] A survey of transformer-based multimodal pre-trained modals
Han, Xue
Wang, Yi-Tong
Feng, Jun-Lan
Deng, Chao
Chen, Zhan-Heng
Huang, Yu-An
Su, Hui
Hu, Lun
Hu, Peng-Wei
[J]. NEUROCOMPUTING, 2023, 515 : 89 - 106
[8] Enhancing Pre-trained Language Representation for Multi-Task Learning of Scientific Summarization
Jia, Ruipeng
Cao, Yannan
Fang, Fang
Li, Jinpeng
Liu, Yanbing
Yin, Pengfei
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[9] A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning
Kotei, Evans
Thirunavukarasu, Ramkumar
[J]. INFORMATION, 2023, 14 (03)
[10] Simple and Effective Multimodal Learning Based on Pre-Trained Transformer Models
Miyazawa, Kazuki
Kyuragi, Yuta
Nagai, Takayuki
[J]. IEEE ACCESS, 2022, 10 : 29821 - 29833

← 1 2 3 4 5 →