Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

被引:0
|
作者
Di Zhao [1 ]
Wenxuan Mu [2 ]
Xiangxing Jia [3 ]
Shuang Liu [1 ]
Yonghe Chu [1 ]
Jiana Meng [1 ]
Hongfei Lin [4 ]
机构
[1] Dalian Minzu University,School of Computer Science and Engineering
[2] Dalian University of Technology,School of Computer Science and Technology
[3] Postdoctoral Workstation of Dalian Yongia Electronic Technology Co.,undefined
[4] Ltd,undefined
[5] Nantong University,undefined
关键词
Few-shot learning; ChatGPT; Data augmentation; Named entity recognition;
D O I
10.1186/s13040-025-00443-y
中图分类号
学科分类号
摘要
Named Entity Recognition (NER) is a fundamental task in processing biomedical text. Due to the limited availability of labeled data, researchers have investigated few-shot learning methods to tackle this challenge. However, replicating the performance of fully supervised methods remains difficult in few-shot scenarios. This paper addresses two main issues. In terms of data augmentation, existing methods primarily focus on replacing content in the original text, which can potentially distort the semantics. Furthermore, current approaches often neglect sentence features at multiple scales. To overcome these challenges, we utilize ChatGPT to generate enriched data with distinct semantics for the same entities, thereby reducing noisy data. Simultaneously, we employ dynamic convolution to capture multi-scale semantic information in sentences and enhance feature representation based on PubMedBERT. We evaluated the experiments on four biomedical NER datasets (BC5CDR-Disease, NCBI, BioNLP11EPI, BioNLP13GE), and the results exceeded the current state-of-the-art models in most few-shot scenarios, including mainstream large language models like ChatGPT. The results confirm the effectiveness of the proposed method in data augmentation and model generalization.
引用
收藏
相关论文
共 50 条
  • [1] Multi-scale feature network for few-shot learning
    Mengya Han
    Ronggui Wang
    Juan Yang
    Lixia Xue
    Min Hu
    Multimedia Tools and Applications, 2020, 79 : 11617 - 11637
  • [2] Multi-scale feature network for few-shot learning
    Han, Mengya
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    Hu, Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 11617 - 11637
  • [3] Few-Shot Learning With Enhancements to Data Augmentation and Feature Extraction
    Zhang, Yourun
    Gong, Maoguo
    Li, Jianzhao
    Feng, Kaiyuan
    Zhang, Mingyang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [4] Few-shot pulse wave contour classification based on multi-scale feature extraction
    Peng Lu
    Chao Liu
    Xiaobo Mao
    Yvping Zhao
    Hanzhang Wang
    Hongpo Zhang
    Lili Guo
    Scientific Reports, 11
  • [5] Few-Shot Learning Method for Multi-Scale Feature Aggregation
    Zeng, Wu
    Mao, Guojun
    Computer Engineering and Applications, 2023, 59 (15) : 151 - 159
  • [6] Few-shot pulse wave contour classification based on multi-scale feature extraction
    Lu, Peng
    Liu, Chao
    Mao, Xiaobo
    Zhao, Yvping
    Wang, Hanzhang
    Zhang, Hongpo
    Guo, Lili
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [7] Few-shot biomedical relation extraction using data augmentation and domain information
    Guo, Bocheng
    Zhao, Di
    Dong, Xin
    Meng, Jiana
    Lin, Hongfei
    NEUROCOMPUTING, 2024, 595
  • [8] Few-Shot Charge Prediction with Data Augmentation and Feature Augmentation
    Wang, Peipeng
    Zhang, Xiuguo
    Cao, Zhiying
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [9] MULTI-SCALE TEMPORAL FEATURE FUSION FOR FEW-SHOT ACTION RECOGNITION
    Lee, Jun-Tae
    Yun, Sungrack
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1785 - 1789
  • [10] Few-shot wildlife detection based on multi-scale context extraction
    Liu, Ke
    Lin, Shanling
    Shi, Xinyu
    Lin, Jianpu
    Lu, Shanhong
    Lin, Zhixian
    Guo, Tailiang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2025, 40 (03) : 516 - 526