Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

被引:0
|
作者
Di Zhao [1 ]
Wenxuan Mu [2 ]
Xiangxing Jia [3 ]
Shuang Liu [1 ]
Yonghe Chu [1 ]
Jiana Meng [1 ]
Hongfei Lin [4 ]
机构
[1] Dalian Minzu University,School of Computer Science and Engineering
[2] Dalian University of Technology,School of Computer Science and Technology
[3] Postdoctoral Workstation of Dalian Yongia Electronic Technology Co.,undefined
[4] Ltd,undefined
[5] Nantong University,undefined
关键词
Few-shot learning; ChatGPT; Data augmentation; Named entity recognition;
D O I
10.1186/s13040-025-00443-y
中图分类号
学科分类号
摘要
Named Entity Recognition (NER) is a fundamental task in processing biomedical text. Due to the limited availability of labeled data, researchers have investigated few-shot learning methods to tackle this challenge. However, replicating the performance of fully supervised methods remains difficult in few-shot scenarios. This paper addresses two main issues. In terms of data augmentation, existing methods primarily focus on replacing content in the original text, which can potentially distort the semantics. Furthermore, current approaches often neglect sentence features at multiple scales. To overcome these challenges, we utilize ChatGPT to generate enriched data with distinct semantics for the same entities, thereby reducing noisy data. Simultaneously, we employ dynamic convolution to capture multi-scale semantic information in sentences and enhance feature representation based on PubMedBERT. We evaluated the experiments on four biomedical NER datasets (BC5CDR-Disease, NCBI, BioNLP11EPI, BioNLP13GE), and the results exceeded the current state-of-the-art models in most few-shot scenarios, including mainstream large language models like ChatGPT. The results confirm the effectiveness of the proposed method in data augmentation and model generalization.
引用
收藏
相关论文
共 50 条
  • [31] A multi-step loss meta-learning method based on multi-scale feature extraction for few-shot fault diagnosis
    Xu, Zhenheng
    Liu, Zhong
    Tian, Bing
    Lv, Qiancheng
    Liu, Hu
    INSIGHT, 2024, 66 (05) : 294 - 304
  • [32] Few-shot Partial Multi-label Learning with Data Augmentation
    Sun, Yifan
    Zhao, Yunfeng
    Yu, Guoxian
    Yan, Zhongmin
    Domeniconi, Carlotta
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 478 - 487
  • [33] Few-Shot Image Classification Based on Multi-Scale Label Propagation
    Wang H.
    Tian S.
    Tang Q.
    Chen D.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (07): : 1486 - 1495
  • [34] Adaptive multi-scale transductive information propagation for few-shot learning
    Fu, Sichao
    Liu, Baodi
    Liu, Weifeng
    Zou, Bin
    You, Xinhua
    Peng, Qinmu
    Jing, Xiao-Yuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [35] Multi-scale Few-Shot Classification Model Based on Attention Mechanism
    Xu, Yi
    Zhu, Qisheng
    Pan, ZhengYue
    Liu, Yin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 476 - 487
  • [36] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771
  • [37] Multi-scale fusion for few-shot remote sensing image classification
    Qiao, Xujian
    Xing, Lei
    Han, Anxun
    Liu, Weifeng
    Liu, Baodi
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6012 - 6032
  • [38] A Progressive Multi-Scale Relation Network for Few-Shot Image Classification
    Tong, Le
    Zhu, Renchaoli
    Li, Tianjiu
    Li, Xinran
    Zhou, Xiaoping
    IEEE ACCESS, 2024, 12 : 157039 - 157049
  • [39] Joint data augmentation and knowledge distillation for few-shot continual relation extraction
    Wei, Zhongcheng
    Zhang, Yunping
    Lian, Bin
    Fan, Yongjian
    Zhao, Jijun
    APPLIED INTELLIGENCE, 2024, 54 (04) : 3516 - 3528
  • [40] Joint data augmentation and knowledge distillation for few-shot continual relation extraction
    Zhongcheng Wei
    Yunping Zhang
    Bin Lian
    Yongjian Fan
    Jijun Zhao
    Applied Intelligence, 2024, 54 : 3516 - 3528