Decomposed Meta-Learning for Few-Shot Sequence Labeling

被引:1
|
作者
Ma, Tingting [1 ]
Wu, Qianhui [2 ]
Jiang, Huiqiang [3 ]
Lin, Jieru [1 ]
Karlsson, Borje F. [2 ]
Zhao, Tiejun [1 ]
Lin, Chin-Yew [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Microsoft Res Asia, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Labeling; Metalearning; Tagging; Detectors; Adaptation models; Speech processing; Few-shot sequence labeling; task decomposition; meta-learning; NAMED ENTITY RECOGNITION;
D O I
10.1109/TASLP.2024.3372879
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Few-shot sequence labeling is a general problem formulation for many natural language understanding tasks in data-scarcity scenarios, which require models to generalize to new types via only a few labeled examples. Recent advances mostly adopt metric-based meta-learning and thus face the challenges of modeling the miscellaneous Other prototype and the inability to generalize to classes with large domain gaps. To overcome these challenges, we propose a decomposed meta-learning framework for few-shot sequence labeling that breaks down the task into few-shot mention detection and few-shot type classification, and sequentially tackles them via meta-learning. Specifically, we employ model-agnostic meta-learning (MAML) to prompt the mention detection model to learn boundary knowledge shared across types. With the detected mention spans, we further leverage the MAML-enhanced span-level prototypical network for few-shot type classification. In this way, the decomposition framework bypasses the requirement of modeling the miscellaneous Other prototype. Meanwhile, the adoption of the MAML algorithm enables us to explore the knowledge contained in support examples more efficiently, so that our model can quickly adapt to new types using only a few labeled examples. Under our framework, we explore a basic implementation that uses two separate models for the two subtasks. We further propose a joint model to reduce model size and inference time, making our framework more applicable for scenarios with limited resources. Extensive experiments on nine benchmark datasets, including named entity recognition, slot tagging, event detection, and part-of-speech tagging, show that the proposed approach achieves start-of-the-art performance across various few-shot sequence labeling tasks.
引用
收藏
页码:1980 / 1993
页数:14
相关论文
共 50 条
  • [41] EMO: EPISODIC MEMORY OPTIMIZATION FOR FEW-SHOT META-LEARNING
    Du, Yingjun
    Shen, Jiayi
    Zhen, Xiantong
    Snoek, Cees G. M.
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 1 - 20
  • [42] Ensemble Meta-Learning for Few-Shot Soot Density Recognition
    Gu, Ke
    Zhang, Yonghui
    Qiao, Junfei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (03) : 2261 - 2270
  • [43] Few-Shot Named Entity Recognition via Meta-Learning
    Li, Jing
    Chiu, Billy
    Feng, Shanshan
    Wang, Hao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
  • [44] MetaFSCEL A Meta-Learning Approach for Few-Shot Class Incremental Learning
    Chi, Zhixiang
    Gu, Li
    Liu, Huan
    Wang, Yang
    Yu, Yuanhao
    Tang, Jin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14146 - 14155
  • [45] Few-Shot Human Motion Prediction via Meta-learning
    Gui, Liang-Yan
    Wang, Yu-Xiong
    Ramanan, Deva
    Moura, Jose M. F.
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 441 - 459
  • [46] SML: Semantic meta-learning for few-shot semantic segmentation * *
    Pambala, Ayyappa Kumar
    Dutta, Titir
    Biswas, Soma
    PATTERN RECOGNITION LETTERS, 2021, 147 : 93 - 99
  • [47] Generative Probabilistic Meta-Learning for Few-Shot Image Classification
    Fu, Meijun
    Wang, Xiaomin
    Wang, Jun
    Yi, Zhang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [48] Meta-Learning With Relation Embedding for Few-Shot Deepfake Detection
    Liu, Xiaoyong
    Song, Pengcheng
    Lu, Pei
    Wang, Yanjun
    IEEE ACCESS, 2024, 12 : 180135 - 180145
  • [49] Few-shot personalized saliency prediction using meta-learning
    Luo, Xinhui
    Liu, Zhi
    Wei, Weijie
    Ye, Linwei
    Zhang, Tianhong
    Xu, Lihua
    Wang, Jijun
    IMAGE AND VISION COMPUTING, 2022, 124
  • [50] Effective Few-Shot Named Entity Linking by Meta-Learning
    Li, Xiuxing
    Li, Zhenyu
    Zhang, Zhengyan
    Liu, Ning
    Yuan, Haitao
    Zhang, Wei
    Liu, Zhiyuan
    Wang, Jianyong
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 178 - 191