InstructEdit: Instruction-Based Knowledge Editing for Large Language Models

被引：0

作者：

Zhang, Ningyu ^{[1
]}

Tian, Bozhong ^{[1
]}

Cheng, Siyuan ^{[2
]}

Liang, Xiaozhuan ^{[2
]}

Hu, Yi ^{[2
]}

Xue, Kouying ^{[2
]}

Gou, Yanjie ^{[2
]}

Chen, Xi ^{[2
]}

Chen, Huajun ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China

[2] Tencent, Shenzhen, Guangdong, Peoples R China

来源：

PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Knowledge editing for large language models can offer an efficient solution to alter a model's behavior without negatively impacting the overall performance. However, the current approaches encounter issues with limited generalizability across tasks, necessitating one distinct editor for each task, significantly hindering the broader applications. To address this, we take the first step to analyze the multi-task generalization issue in knowledge editing. Specifically, we develop an instruction-based editing technique, termed InstructEdit, which facilitates the editor's adaptation to various task performances simultaneously using simple instructions. With only one unified editor for each LLM, we empirically demonstrate that InstructEdit can improve the editor's control, leading to an average 14.86% increase in Reliability in multi-task editing setting. Furthermore, experiments involving holdout unseen task illustrate that InstructEdit consistently surpass previous strong baselines. To further investigate the underlying mechanisms of instruction-based knowledge editing, we analyze the principal components of the editing gradient directions, which unveils that instructions can help control optimization direction with stronger OOD generalization.

引用

页码：6633 / 6641

页数：9

共 50 条

[41] Rapid Instruction-Based Task Learning (RITL) in Schizophrenia
Sheffield, Julia M.
Ruge, Hannes
Kandala, Sridhar
Barch, Deanna M.
JOURNAL OF ABNORMAL PSYCHOLOGY, 2018, 127 (05) : 513 - 528
[42] Gestalt compositionality and dynamic instruction-based meaning construction
Col, Gilles
Aptekman, Jeanne
Girault, Stephanie
Victorri, Bernard
COGNITEXTES, 2010, 5
[43] A descriptive assessment of instruction-based interactions in the preschool classroom
Ndoro, VW
Hanley, GP
Tiger, JH
Heal, NA
JOURNAL OF APPLIED BEHAVIOR ANALYSIS, 2006, 39 (01) : 79 - 90
[44] Quantifying Domain Knowledge in Large Language Models
Sayenju, Sudhashree
Aygun, Ramazan
Franks, Bill
Johnston, Sereres
Lee, George
Choi, Hansook
Modgil, Girish
2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 193 - 194
[45] Knowledge management in organization and the large language models
Zelenkov, Yu. A.
ROSSIISKII ZHURNAL MENEDZHMENTA-RUSSIAN MANAGEMENT JOURNAL, 2024, 22 (03): : 573 - 601
[46] Large language models encode clinical knowledge
Singhal, Karan
Azizi, Shekoofeh
Tu, Tao
Mahdavi, S. Sara
Wei, Jason
Chung, Hyung Won
Scales, Nathan
Tanwani, Ajay
Cole-Lewis, Heather
Pfohl, Stephen
Payne, Perry
Seneviratne, Martin
Gamble, Paul
Kelly, Chris
Babiker, Abubakr
Schaerli, Nathanael
Chowdhery, Aakanksha
Mansfield, Philip
Demner-Fushman, Dina
Arcas, Blaise Aguera y
Webster, Dale
Corrado, Greg S.
Matias, Yossi
Chou, Katherine
Gottweis, Juraj
Tomasev, Nenad
Liu, Yun
Rajkomar, Alvin
Barral, Joelle
Semturs, Christopher
Karthikesalingam, Alan
Natarajan, Vivek
NATURE, 2023, 620 (7972) : 172 - +
[47] Large language models encode clinical knowledge
Karan Singhal
Shekoofeh Azizi
Tao Tu
S. Sara Mahdavi
Jason Wei
Hyung Won Chung
Nathan Scales
Ajay Tanwani
Heather Cole-Lewis
Stephen Pfohl
Perry Payne
Martin Seneviratne
Paul Gamble
Chris Kelly
Abubakr Babiker
Nathanael Schärli
Aakanksha Chowdhery
Philip Mansfield
Dina Demner-Fushman
Blaise Agüera y Arcas
Dale Webster
Greg S. Corrado
Yossi Matias
Katherine Chou
Juraj Gottweis
Nenad Tomasev
Yun Liu
Alvin Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
Nature, 2023, 620 : 172 - 180
[48] Do large language models "understand" their knowledge?
Venkatasubramanian, Venkat
AICHE JOURNAL, 2025, 71 (03)
[49] Debiasing Large Language Models with Structured Knowledge
Ma, Congda
Zhao, Tianyu
Okumura, Manabu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 10274 - 10287
[50] Evaluating Intelligence and Knowledge in Large Language Models
Bianchini, Francesco
TOPOI-AN INTERNATIONAL REVIEW OF PHILOSOPHY, 2025, 44 (01): : 163 - 173

← 1 2 3 4 5 →