A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER

被引:7
|
作者
Dong, Guanting [1 ]
Wang, Zechen [1 ]
Zhao, Jinxu [1 ]
Zhao, Gang [1 ]
Guo, Daichi [1 ]
Fu, Dayuan [1 ]
Hui, Tingfeng [1 ]
Zeng, Chen [1 ]
He, Keqing [2 ]
Li, Xuefeng [1 ]
Wang, Liwen [1 ]
Cui, Xinyue [1 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan Grp Beijing, Beijing, Peoples R China
关键词
Few-shot NER; Multi-Task; Semantic Decomposition; Pre-training;
D O I
10.1145/3583780.3614766
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of few-shot named entity recognition is to identify named entities with limited labeled instances. Previous works have primarily focused on optimizing the traditional token-wise classification framework, while neglecting the exploration of information based on NER data characteristics. To address this issue, we propose a Multi-Task Semantic Decomposition Framework via Joint Task-specific Pre-training (MSDP) for few-shot NER. Drawing inspiration from demonstration-based and contrastive learning, we introduce two novel pre-training tasks: Demonstration-based Masked Language Modeling (MLM) and Class Contrastive Discrimination. These tasks effectively incorporate entity boundary information and enhance entity representation in Pre-trained Language Models (PLMs). In the downstream main task, we introduce a multitask joint optimization framework with the semantic decomposing method, which facilitates the model to integrate two different semantic information for entity classification. Experimental results of two few-shot NER benchmarks demonstrate that MSDP consistently outperforms strong baselines by a large margin. Extensive analyses validate the effectiveness and generalization of MSDP.
引用
收藏
页码:430 / 440
页数:11
相关论文
共 50 条
  • [1] Multi-Task Supervised Alignment Pre-Training for Few-Shot Multimodal Sentiment Analysis
    Yang, Junyang
    Cao, Jiuxin
    Duan, Chengge
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [2] Graph Few-shot Learning with Task-specific Structures
    Wang, Song
    Chen, Chen
    Li, Jundong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [3] Multi-task classification network for few-shot learning
    Ji, Zhong
    Liu, Yuanheng
    Wang, Xuan
    Liu, Jingren
    Cao, Jiale
    Yu, Yunlong
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
  • [4] Multi-task Pre-training Language Model for Semantic Network Completion
    Li, Da
    Zhu, Boqing
    Yang, Sen
    Xu, Kele
    Yi, Ming
    He, Yukai
    Wang, Huaimin
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [5] A Task-Specific Meta-Learning Framework for Few-Shot Sound Event Detection
    Zhang, Tianyang
    Yang, Liping
    Gu, Xiaohua
    Wang, Yuyang
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [6] CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text
    Nandy, Abhilash
    Kapadnis, Manav Nitin
    Goyal, Pawan
    Ganguly, Niloy
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8793 - 8806
  • [7] Multi-task learning for few-shot biomedical relation extraction
    Moscato, Vincenzo
    Napolano, Giuseppe
    Postiglione, Marco
    Sperli, Giancarlo
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) : 13743 - 13763
  • [8] Adaptive Multi-task Learning for Few-Shot Object Detection
    Ren, Yan
    Li, Yanling
    Kong, Adams Wai-Kin
    COMPUTER VISION-ECCV 2024, PT VII, 2025, 15065 : 297 - 314
  • [9] Hierarchical Prompt Tuning for Few-Shot Multi-Task Learning
    Liu, Jingping
    Chen, Tao
    Liang, Zujie
    Jiang, Haiyun
    Xiao, Yanghua
    Wei, Feng
    Qian, Yuxi
    Hao, Zhenghong
    Han, Bing
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1556 - 1565
  • [10] Multi-task Self-supervised Few-Shot Detection
    Zhang, Guangyong
    Duan, Lijuan
    Wang, Wenjian
    Gong, Zhi
    Ma, Bian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 107 - 119