Decomposed Meta-Learning for Few-Shot Sequence Labeling

被引:1
|
作者
Ma, Tingting [1 ]
Wu, Qianhui [2 ]
Jiang, Huiqiang [3 ]
Lin, Jieru [1 ]
Karlsson, Borje F. [2 ]
Zhao, Tiejun [1 ]
Lin, Chin-Yew [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Microsoft Res Asia, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Labeling; Metalearning; Tagging; Detectors; Adaptation models; Speech processing; Few-shot sequence labeling; task decomposition; meta-learning; NAMED ENTITY RECOGNITION;
D O I
10.1109/TASLP.2024.3372879
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Few-shot sequence labeling is a general problem formulation for many natural language understanding tasks in data-scarcity scenarios, which require models to generalize to new types via only a few labeled examples. Recent advances mostly adopt metric-based meta-learning and thus face the challenges of modeling the miscellaneous Other prototype and the inability to generalize to classes with large domain gaps. To overcome these challenges, we propose a decomposed meta-learning framework for few-shot sequence labeling that breaks down the task into few-shot mention detection and few-shot type classification, and sequentially tackles them via meta-learning. Specifically, we employ model-agnostic meta-learning (MAML) to prompt the mention detection model to learn boundary knowledge shared across types. With the detected mention spans, we further leverage the MAML-enhanced span-level prototypical network for few-shot type classification. In this way, the decomposition framework bypasses the requirement of modeling the miscellaneous Other prototype. Meanwhile, the adoption of the MAML algorithm enables us to explore the knowledge contained in support examples more efficiently, so that our model can quickly adapt to new types using only a few labeled examples. Under our framework, we explore a basic implementation that uses two separate models for the two subtasks. We further propose a joint model to reduce model size and inference time, making our framework more applicable for scenarios with limited resources. Extensive experiments on nine benchmark datasets, including named entity recognition, slot tagging, event detection, and part-of-speech tagging, show that the proposed approach achieves start-of-the-art performance across various few-shot sequence labeling tasks.
引用
收藏
页码:1980 / 1993
页数:14
相关论文
共 50 条
  • [21] MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
    Zhang, Baoquan
    Luo, Chuyao
    Yu, Demin
    Li, Xutao
    Lin, Huiwei
    Ye, Yunming
    Zhang, Bowen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16687 - 16695
  • [22] Fast Few-Shot Classification by Few-Iteration Meta-Learning
    Tripathi, Ardhendu Shekhar
    Danelljan, Martin
    Van Gool, Luc
    Timofte, Radu
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9522 - 9528
  • [23] Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
    Chen, Yinbo
    Liu, Zhuang
    Xu, Huijuan
    Darrell, Trevor
    Wang, Xiaolong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9042 - 9051
  • [24] Hierarchical Meta-Learning with Hyper-Tasks for Few-Shot Learning
    Guan, Yunchuan
    Liu, Yu
    Zhou, Ke
    Huang, Junyuan
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 587 - 596
  • [25] Decentralized federated meta-learning framework for few-shot multitask learning
    Li, Xiaoli
    Li, Yuzheng
    Wang, Jining
    Chen, Chuan
    Yang, Liu
    Zheng, Zibin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8490 - 8522
  • [26] PERSONALIZED FACE AUTHENTICATION BASED ON FEW-SHOT META-LEARNING
    Shin, Chaehun
    Lee, Jangho
    Na, Byunggook
    Yoon, Sungroh
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3897 - 3901
  • [27] Few-Shot Classification Based on Sparse Dictionary Meta-Learning
    Jiang, Zuo
    Wang, Yuan
    Tang, Yi
    MATHEMATICS, 2024, 12 (19)
  • [28] Prototype Bayesian Meta-Learning for Few-Shot Image Classification
    Fu, Meijun
    Wang, Xiaomin
    Wang, Jun
    Yi, Zhang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [29] MetaDelta: A Meta-Learning System for Few-shot Image Classification
    Chen, Yudong
    Guan, Chaoyu
    Wei, Zhikun
    Wang, Xin
    Zhu, Wenwu
    AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 17 - 28
  • [30] MetaMedSeg: Volumetric Meta-learning for Few-Shot Organ Segmentation
    Farshad, Azade
    Makarevich, Anastasia
    Belagiannis, Vasileios
    Navab, Nassir
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (DART 2022), 2022, 13542 : 45 - 55