A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER

被引:7
|
作者
Dong, Guanting [1 ]
Wang, Zechen [1 ]
Zhao, Jinxu [1 ]
Zhao, Gang [1 ]
Guo, Daichi [1 ]
Fu, Dayuan [1 ]
Hui, Tingfeng [1 ]
Zeng, Chen [1 ]
He, Keqing [2 ]
Li, Xuefeng [1 ]
Wang, Liwen [1 ]
Cui, Xinyue [1 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan Grp Beijing, Beijing, Peoples R China
关键词
Few-shot NER; Multi-Task; Semantic Decomposition; Pre-training;
D O I
10.1145/3583780.3614766
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of few-shot named entity recognition is to identify named entities with limited labeled instances. Previous works have primarily focused on optimizing the traditional token-wise classification framework, while neglecting the exploration of information based on NER data characteristics. To address this issue, we propose a Multi-Task Semantic Decomposition Framework via Joint Task-specific Pre-training (MSDP) for few-shot NER. Drawing inspiration from demonstration-based and contrastive learning, we introduce two novel pre-training tasks: Demonstration-based Masked Language Modeling (MLM) and Class Contrastive Discrimination. These tasks effectively incorporate entity boundary information and enhance entity representation in Pre-trained Language Models (PLMs). In the downstream main task, we introduce a multitask joint optimization framework with the semantic decomposing method, which facilitates the model to integrate two different semantic information for entity classification. Experimental results of two few-shot NER benchmarks demonstrate that MSDP consistently outperforms strong baselines by a large margin. Extensive analyses validate the effectiveness and generalization of MSDP.
引用
收藏
页码:430 / 440
页数:11
相关论文
共 50 条
  • [21] Task-Specific Data Augmentation for Zero-shot and Few-shot Stance Detection
    Zhang, Jiarui
    Wu, Shaojuan
    Zhang, Xiaowang
    Feng, Zhiyong
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 160 - 163
  • [22] Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
    Mi, Fei
    Zhou, Wanhao
    Cai, Fengyu
    Kong, Lingjing
    Huang, Minlie
    Faltings, Boi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1887 - 1898
  • [23] FSMT: Few-shot object detection via Multi-Task Decoupled
    Qin, Jiahui
    Xu, Yang
    Fu, Yifan
    Wu, Zebin
    Wei, Zhihui
    PATTERN RECOGNITION LETTERS, 2025, 192 : 8 - 14
  • [24] Multi-Task Few-Shot Text Steganalysis Based on Capsule Network
    Yang Y.
    Zhang Z.-W.
    Wen J.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (12): : 2592 - 2604
  • [25] Multi-task Based Few-Shot Learning for Disease Similarity Measurement
    Gao, Jianliang
    Tian, Ling
    Liu, Yuxin
    Wang, Jianxin
    Li, Zhao
    Hu, Xiaohua
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 263 - 268
  • [26] Few-Shot Learning of Compact Models via Task-Specific Meta Distillation
    Wu, Yong
    Chanda, Shekhor
    Hosseinzadeh, Mehrdad
    Liu, Zhi
    Wang, Yang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6254 - 6263
  • [27] Task-specific Part Discovery for Fine-grained Few-shot Classification
    Wei, Yongxian
    Wei, Xiu-Shen
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (05) : 954 - 965
  • [28] Effectiveness of Pre-training for Few-shot Intent Classification
    Zhang, Haode
    Zhang, Yuwei
    Zhan, Li-Ming
    Chen, Jiaxin
    Shi, Guangyuan
    Wu, Xiao-Ming
    Lam, Albert Y. S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1114 - 1120
  • [29] Named Entity Recognition for Few-Shot Power Dispatch Based on Multi-Task
    Tan, Zhixiang
    Chen, Yan
    Liang, Zengfu
    Meng, Qi
    Lin, Dezhao
    ELECTRONICS, 2023, 12 (16)
  • [30] Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues
    Cordier, Thibault
    Urvoy, Tanguy
    Lefevre, Fabrice
    Rojas-Barahona, Lina M.
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 432 - 441