Exploring the effectiveness of instruction tuning in biomedical language processing

被引：1

作者：

Rohanian, Omid ^{[1
,2
]}

Nouriborji, Mohammadmahdi ^{[2
,3
]}

Kouchaki, Samaneh ^{[4
]}

Nooralahzadeh, Farhad ^{[5
,6
]}

Clifton, Lei ^{[7
]}

Clifton, David A. ^{[1
,8
]}

机构：

[1] Univ Oxford, Dept Engn Sci, Oxford, England

[2] NLPie Res, Oxford, England

[3] Sharif Univ Technol, Tehran, Iran

[4] Univ Surrey, Dept Elect & Elect Engn, Guildford, England

[5] Univ Zurich, Zurich, Switzerland

[6] Univ Hosp Zurich, Zurich, Switzerland

[7] Univ Oxford, Nuffield Dept Populat Hlth, Oxford, England

[8] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE IN MEDICINE | 2024年 / 158卷

关键词：

Instruction tuning; Biomedical NLP; Named entity recognition; Relation extraction; Medical NLI; Llama2-MedTuned;

D O I：

10.1016/j.artmed.2024.103007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately 200,000 instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our instruction-based tasks. This initiative represents an important step in utilising such models to achieve results on par with specialised encoder-only models like BioBERT and BioClinicalBERT for various classical biomedical NLP tasks. Our work includes an analysis of the dataset's composition and its impact on model performance, providing insights into the intricacies of instruction tuning. By sharing our codes, models, and the distinctively assembled instruction-based dataset, we seek to encourage ongoing research and development in this area.2

引用

页数：9

共 50 条

[31] LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Luo, Chuwei
Shen, Yufan
Zhu, Zhaoqing
Zheng, Qi
Yu, Zhi
Yao, Cong
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15630 - 15640
[32] Tuna: Instruction Tuning using Feedback from Large Language Models
Li, Haoran
Liu, Yiran
Zhang, Xingxing
Lu, Wei
Wei, Furu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15146 - 15163
[33] Exploring input processing in the classroom: An experimental comparison of processing instruction and enriched input
Marsden, Emma
LANGUAGE LEARNING, 2006, 56 (03) : 507 - 566
[34] An Empirical Study of Instruction-tuning Large Language Models in Chinese
Si, Qingyi
Wang, Tong
Lin, Zheng
Zhang, Xu
Cao, Yanan
Wang, Weiping
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4086 - 4107
[35] IAPT: Instruction-Aware Prompt Tuning for Large Language Models
Zhu, Wei
Tian, Aaron Xuxiang
Yin, Congrui
Ni, Yuan
Wang, Xiaoling
Xie, Guotong
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 14285 - 14304
[36] Word embeddings for biomedical natural language processing: A survey
Chiu, Billy
Baker, Simon
LANGUAGE AND LINGUISTICS COMPASS, 2020, 14 (12):
[37] Recent advances in natural language processing for biomedical applications
Collier, Nigel
Nazarenko, Adeline
Baud, Robert
Ruch, Patrick
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2006, 75 (06) : 413 - 417
[38] Biomedical language processing: What's beyond PubMed?
Hunter, L
Cohen, KB
MOLECULAR CELL, 2006, 21 (05) : 589 - 594
[39] ITKBoard: A visual dataflow language for biomedical image processing
Le, Hoang D. K.
Li, Rongxin
Ourselin, Sebastien
Potter, John M.
ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL PL/DPS/KE/MUSE, 2007, : 13 - 21
[40] NATURAL-LANGUAGE PROCESSING IN BIOMEDICAL LABORATORY COMPUTING
SAGER, N
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1985, 32 (10) : 884 - 884

← 1 2 3 4 5 →