Decomposed Meta-Learning for Few-Shot Sequence Labeling

被引：1

作者：

Ma, Tingting ^{[1
]}

Wu, Qianhui ^{[2
]}

Jiang, Huiqiang ^{[3
]}

Lin, Jieru ^{[1
]}

Karlsson, Borje F. ^{[2
]}

Zhao, Tiejun ^{[1
]}

Lin, Chin-Yew ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China

[2] Microsoft Res Asia, Beijing 100080, Peoples R China

[3] Microsoft Res Asia, Shanghai 200232, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Task analysis; Labeling; Metalearning; Tagging; Detectors; Adaptation models; Speech processing; Few-shot sequence labeling; task decomposition; meta-learning; NAMED ENTITY RECOGNITION;

D O I：

10.1109/TASLP.2024.3372879

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Few-shot sequence labeling is a general problem formulation for many natural language understanding tasks in data-scarcity scenarios, which require models to generalize to new types via only a few labeled examples. Recent advances mostly adopt metric-based meta-learning and thus face the challenges of modeling the miscellaneous Other prototype and the inability to generalize to classes with large domain gaps. To overcome these challenges, we propose a decomposed meta-learning framework for few-shot sequence labeling that breaks down the task into few-shot mention detection and few-shot type classification, and sequentially tackles them via meta-learning. Specifically, we employ model-agnostic meta-learning (MAML) to prompt the mention detection model to learn boundary knowledge shared across types. With the detected mention spans, we further leverage the MAML-enhanced span-level prototypical network for few-shot type classification. In this way, the decomposition framework bypasses the requirement of modeling the miscellaneous Other prototype. Meanwhile, the adoption of the MAML algorithm enables us to explore the knowledge contained in support examples more efficiently, so that our model can quickly adapt to new types using only a few labeled examples. Under our framework, we explore a basic implementation that uses two separate models for the two subtasks. We further propose a joint model to reduce model size and inference time, making our framework more applicable for scenarios with limited resources. Extensive experiments on nine benchmark datasets, including named entity recognition, slot tagging, event detection, and part-of-speech tagging, show that the proposed approach achieves start-of-the-art performance across various few-shot sequence labeling tasks.

引用

页码：1980 / 1993

页数：14

共 50 条

[21] MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
Zhang, Baoquan
Luo, Chuyao
Yu, Demin
Li, Xutao
Lin, Huiwei
Ye, Yunming
Zhang, Bowen
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16687 - 16695
[22] Fast Few-Shot Classification by Few-Iteration Meta-Learning
Tripathi, Ardhendu Shekhar
Danelljan, Martin
Van Gool, Luc
Timofte, Radu
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9522 - 9528
[23] Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
Chen, Yinbo
Liu, Zhuang
Xu, Huijuan
Darrell, Trevor
Wang, Xiaolong
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9042 - 9051
[24] Hierarchical Meta-Learning with Hyper-Tasks for Few-Shot Learning
Guan, Yunchuan
Liu, Yu
Zhou, Ke
Huang, Junyuan
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 587 - 596
[25] Decentralized federated meta-learning framework for few-shot multitask learning
Li, Xiaoli
Li, Yuzheng
Wang, Jining
Chen, Chuan
Yang, Liu
Zheng, Zibin
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8490 - 8522
[26] PERSONALIZED FACE AUTHENTICATION BASED ON FEW-SHOT META-LEARNING
Shin, Chaehun
Lee, Jangho
Na, Byunggook
Yoon, Sungroh
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3897 - 3901
[27] Few-Shot Classification Based on Sparse Dictionary Meta-Learning
Jiang, Zuo
Wang, Yuan
Tang, Yi
MATHEMATICS, 2024, 12 (19)
[28] Prototype Bayesian Meta-Learning for Few-Shot Image Classification
Fu, Meijun
Wang, Xiaomin
Wang, Jun
Yi, Zhang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[29] MetaDelta: A Meta-Learning System for Few-shot Image Classification
Chen, Yudong
Guan, Chaoyu
Wei, Zhikun
Wang, Xin
Zhu, Wenwu
AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 17 - 28
[30] MetaMedSeg: Volumetric Meta-learning for Few-Shot Organ Segmentation
Farshad, Azade
Makarevich, Anastasia
Belagiannis, Vasileios
Navab, Nassir
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (DART 2022), 2022, 13542 : 45 - 55

← 1 2 3 4 5 →