Exploring the effectiveness of instruction tuning in biomedical language processing

被引:1
|
作者
Rohanian, Omid [1 ,2 ]
Nouriborji, Mohammadmahdi [2 ,3 ]
Kouchaki, Samaneh [4 ]
Nooralahzadeh, Farhad [5 ,6 ]
Clifton, Lei [7 ]
Clifton, David A. [1 ,8 ]
机构
[1] Univ Oxford, Dept Engn Sci, Oxford, England
[2] NLPie Res, Oxford, England
[3] Sharif Univ Technol, Tehran, Iran
[4] Univ Surrey, Dept Elect & Elect Engn, Guildford, England
[5] Univ Zurich, Zurich, Switzerland
[6] Univ Hosp Zurich, Zurich, Switzerland
[7] Univ Oxford, Nuffield Dept Populat Hlth, Oxford, England
[8] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China
关键词
Instruction tuning; Biomedical NLP; Named entity recognition; Relation extraction; Medical NLI; Llama2-MedTuned;
D O I
10.1016/j.artmed.2024.103007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately 200,000 instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our instruction-based tasks. This initiative represents an important step in utilising such models to achieve results on par with specialised encoder-only models like BioBERT and BioClinicalBERT for various classical biomedical NLP tasks. Our work includes an analysis of the dataset's composition and its impact on model performance, providing insights into the intricacies of instruction tuning. By sharing our codes, models, and the distinctively assembled instruction-based dataset, we seek to encourage ongoing research and development in this area.2
引用
收藏
页数:9
相关论文
共 50 条
  • [31] LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
    Luo, Chuwei
    Shen, Yufan
    Zhu, Zhaoqing
    Zheng, Qi
    Yu, Zhi
    Yao, Cong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15630 - 15640
  • [32] Tuna: Instruction Tuning using Feedback from Large Language Models
    Li, Haoran
    Liu, Yiran
    Zhang, Xingxing
    Lu, Wei
    Wei, Furu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15146 - 15163
  • [33] Exploring input processing in the classroom: An experimental comparison of processing instruction and enriched input
    Marsden, Emma
    LANGUAGE LEARNING, 2006, 56 (03) : 507 - 566
  • [34] An Empirical Study of Instruction-tuning Large Language Models in Chinese
    Si, Qingyi
    Wang, Tong
    Lin, Zheng
    Zhang, Xu
    Cao, Yanan
    Wang, Weiping
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4086 - 4107
  • [35] IAPT: Instruction-Aware Prompt Tuning for Large Language Models
    Zhu, Wei
    Tian, Aaron Xuxiang
    Yin, Congrui
    Ni, Yuan
    Wang, Xiaoling
    Xie, Guotong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 14285 - 14304
  • [36] Word embeddings for biomedical natural language processing: A survey
    Chiu, Billy
    Baker, Simon
    LANGUAGE AND LINGUISTICS COMPASS, 2020, 14 (12):
  • [37] Recent advances in natural language processing for biomedical applications
    Collier, Nigel
    Nazarenko, Adeline
    Baud, Robert
    Ruch, Patrick
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2006, 75 (06) : 413 - 417
  • [38] Biomedical language processing: What's beyond PubMed?
    Hunter, L
    Cohen, KB
    MOLECULAR CELL, 2006, 21 (05) : 589 - 594
  • [39] ITKBoard: A visual dataflow language for biomedical image processing
    Le, Hoang D. K.
    Li, Rongxin
    Ourselin, Sebastien
    Potter, John M.
    ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL PL/DPS/KE/MUSE, 2007, : 13 - 21