InstructEdit: Instruction-Based Knowledge Editing for Large Language Models

被引:0
|
作者
Zhang, Ningyu [1 ]
Tian, Bozhong [1 ]
Cheng, Siyuan [2 ]
Liang, Xiaozhuan [2 ]
Hu, Yi [2 ]
Xue, Kouying [2 ]
Gou, Yanjie [2 ]
Chen, Xi [2 ]
Chen, Huajun [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] Tencent, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge editing for large language models can offer an efficient solution to alter a model's behavior without negatively impacting the overall performance. However, the current approaches encounter issues with limited generalizability across tasks, necessitating one distinct editor for each task, significantly hindering the broader applications. To address this, we take the first step to analyze the multi-task generalization issue in knowledge editing. Specifically, we develop an instruction-based editing technique, termed InstructEdit, which facilitates the editor's adaptation to various task performances simultaneously using simple instructions. With only one unified editor for each LLM, we empirically demonstrate that InstructEdit can improve the editor's control, leading to an average 14.86% increase in Reliability in multi-task editing setting. Furthermore, experiments involving holdout unseen task illustrate that InstructEdit consistently surpass previous strong baselines. To further investigate the underlying mechanisms of instruction-based knowledge editing, we analyze the principal components of the editing gradient directions, which unveils that instructions can help control optimization direction with stronger OOD generalization.
引用
收藏
页码:6633 / 6641
页数:9
相关论文
共 50 条
  • [21] Gestalt compositionality and instruction-based meaning construction
    Gilles Col
    Jeanne Aptekman
    Stéphanie Girault
    Thierry Poibeau
    Cognitive Processing, 2012, 13 : 151 - 170
  • [22] Incorporating Instruction-Based Sampling into AMD CodeAnalyst
    Drongowski, Paul
    Yu, Lei
    Swehosky, Frank
    Suthikulpanit, Suravee
    Richter, Robert
    2010 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2010), 2010, : 119 - 120
  • [23] Editing Large Language Models: Problems, Methods, and Opportunities
    Yao, Yunzhi
    Wang, Peng
    Tian, Bozhong
    Chen, Siyuan
    Li, Zhoubo
    Deng, Shumin
    Chen, Huajun
    Zhang, Ningyu
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10222 - 10240
  • [24] Editing Graph Visualizations by Prompting Large Language Models
    Argyriou, Evmorfia
    Boehm, Jens
    Eberle, Anne
    Gonser, Julius
    Lumpp, Anna-Lena
    Niedermann, Benjamin
    Schwarzkopf, Fabian
    GRAPH DRAWING AND NETWORK VISUALIZATION, GD 2023, PT II, 2023, 14466 : 253 - 254
  • [25] Gestalt compositionality and instruction-based meaning construction
    Col, Gilles
    Aptekman, Jeanne
    Girault, Stephanie
    Poibeau, Thierry
    COGNITIVE PROCESSING, 2012, 13 (02) : 151 - 170
  • [27] OCTOPACK: INSTRUCTION TUNING CODE LARGE LANGUAGE MODELS
    Muennighoff, Niklas
    Liu, Qian
    Zebaze, Armel
    Zheng, Qinkai
    Hui, Binyuan
    Zhuo, Terry Yue
    Singh, Swayam
    Tang, Xiangru
    von Werra, Leandro
    Longpre, Shayne
    arXiv, 2023,
  • [28] GraphGPT: Graph Instruction Tuning for Large Language Models
    Tang, Jiabin
    Yang, Yuhao
    Wei, Wei
    Shi, Lei
    Su, Lixin
    Cheng, Suqi
    Yin, Dawei
    Huang, Chao
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 491 - 500
  • [29] History Matters: Temporal Knowledge Editing in Large Language Model
    Yin, Xunjian
    Jiang, Jin
    Yang, Liming
    Wan, Xiaojun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19413 - 19421
  • [30] Construction of Legal Knowledge Graph Based on Knowledge-Enhanced Large Language Models
    Li, Jun
    Qian, Lu
    Liu, Peifeng
    Liu, Taoxiong
    INFORMATION, 2024, 15 (11)