Decomposed Meta-Learning for Few-Shot Sequence Labeling

被引:1
|
作者
Ma, Tingting [1 ]
Wu, Qianhui [2 ]
Jiang, Huiqiang [3 ]
Lin, Jieru [1 ]
Karlsson, Borje F. [2 ]
Zhao, Tiejun [1 ]
Lin, Chin-Yew [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Microsoft Res Asia, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Labeling; Metalearning; Tagging; Detectors; Adaptation models; Speech processing; Few-shot sequence labeling; task decomposition; meta-learning; NAMED ENTITY RECOGNITION;
D O I
10.1109/TASLP.2024.3372879
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Few-shot sequence labeling is a general problem formulation for many natural language understanding tasks in data-scarcity scenarios, which require models to generalize to new types via only a few labeled examples. Recent advances mostly adopt metric-based meta-learning and thus face the challenges of modeling the miscellaneous Other prototype and the inability to generalize to classes with large domain gaps. To overcome these challenges, we propose a decomposed meta-learning framework for few-shot sequence labeling that breaks down the task into few-shot mention detection and few-shot type classification, and sequentially tackles them via meta-learning. Specifically, we employ model-agnostic meta-learning (MAML) to prompt the mention detection model to learn boundary knowledge shared across types. With the detected mention spans, we further leverage the MAML-enhanced span-level prototypical network for few-shot type classification. In this way, the decomposition framework bypasses the requirement of modeling the miscellaneous Other prototype. Meanwhile, the adoption of the MAML algorithm enables us to explore the knowledge contained in support examples more efficiently, so that our model can quickly adapt to new types using only a few labeled examples. Under our framework, we explore a basic implementation that uses two separate models for the two subtasks. We further propose a joint model to reduce model size and inference time, making our framework more applicable for scenarios with limited resources. Extensive experiments on nine benchmark datasets, including named entity recognition, slot tagging, event detection, and part-of-speech tagging, show that the proposed approach achieves start-of-the-art performance across various few-shot sequence labeling tasks.
引用
收藏
页码:1980 / 1993
页数:14
相关论文
共 50 条
  • [1] Decomposed Meta-Learning for Few-Shot Named Entity Recognition
    Ma, Tingting
    Jiang, Huiqiang
    Wu, Qianhui
    Zhao, Tiejun
    Lin, Chin-Yew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1584 - 1596
  • [2] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    PATTERN RECOGNITION, 2021, 116
  • [3] Meta-Learning for Few-Shot NMT Adaptation
    Sharaf, Amr
    Hassan, Hany
    Daume, Hal, III
    NEURAL GENERATION AND TRANSLATION, 2020, : 43 - 53
  • [4] Fair Meta-Learning For Few-Shot Classification
    Zhao, Chen
    Li, Changbin
    Li, Jincheng
    Chen, Feng
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 275 - 282
  • [5] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [6] META-LEARNING WITH ATTENTION FOR IMPROVED FEW-SHOT LEARNING
    Hou, Zejiang
    Walid, Anwar
    Kung, Sun-Yuan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2725 - 2729
  • [7] Few-shot driver identification via meta-learning
    Lu, Lin
    Xiong, Shengwu
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [8] Few-Shot Directed Meta-Learning for Image Classification
    Ouyang, Jihong
    Duan, Ganghai
    Liu, Siguang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [9] Meta-learning for few-shot time series forecasting
    Xiao, Feng
    Liu, Lu
    Han, Jiayu
    Guo, Degui
    Wang, Shang
    Cui, Hai
    Peng, Tao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 325 - 341
  • [10] Few-shot driver identification via meta-learning
    Lu, Lin
    Xiong, Shengwu
    Expert Systems with Applications, 2022, 203