Black-Box Prompt Tuning With Subspace Learning

被引:0
|
作者
Zheng, Yuanhang [1 ]
Tan, Zhixing [2 ]
Li, Peng [3 ,4 ]
Liu, Yang [1 ,3 ,4 ,5 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Zhongguancun Lab, Beijing 100086, Peoples R China
[3] Tsinghua Univ, Inst AI Ind Res AIR, Beijing 100084, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai 200030, Peoples R China
[5] Jiangsu Collaborat Innovat Ctr Language Competence, Xuzhou 221116, Jiangsu, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Tuning; Closed box; Speech processing; Metalearning; Sun; Optimization; Black-box; large language models (LLMs); meta-learning; prompt tuning; subspace learning; ADAPTATION;
D O I
10.1109/TASLP.2024.3407519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Black-box prompt tuning employs derivative-free optimization algorithms to learn prompts within low-dimensional subspaces rather than back-propagating through the network of Large Language Models (LLMs). Recent studies reveal that black-box prompt tuning lacks versatility across tasks and LLMs, which we believe is related to the suboptimal choice of subspaces. In this paper, we introduce Black-box prompt tuning with Subspace Learning (BSL) to enhance the versatility of black-box prompt tuning. Based on the assumption that nearly optimal prompts for similar tasks reside in a common subspace, we propose identifying such subspaces through meta-learning on a collection of similar source tasks. Consequently, for a target task that shares similarities with the source tasks, we expect that optimizing within the identified subspace can yield a prompt that performs well on the target task. Experimental results confirm that our BSL framework consistently achieves competitive performance across various downstream tasks and LLMs.
引用
收藏
页码:3002 / 3013
页数:12
相关论文
共 50 条
  • [41] INSIDE BLACK-BOX
    DEAN, DS
    NON-DESTRUCTIVE TESTING, 1970, 3 (03): : 181 - &
  • [42] BLACK-BOX CHEMISTRY
    Wilson, Elizabeth K.
    CHEMICAL & ENGINEERING NEWS, 2011, 89 (33) : 36 - 37
  • [43] THE EMPTINESS OF THE BLACK-BOX
    SKRABANEK, P
    EPIDEMIOLOGY, 1994, 5 (05) : 553 - 555
  • [44] BLACK-BOX MULTIGRID
    DENDY, JE
    JOURNAL OF COMPUTATIONAL PHYSICS, 1982, 48 (03) : 366 - 386
  • [45] Black-box epidemiology
    Neutra, RR
    EPIDEMIOLOGY, 2005, 16 (03) : 418 - 419
  • [46] THE TRIUMPH OF THE BLACK-BOX
    DUNTEMANN, J
    DR DOBBS JOURNAL, 1992, 17 (05): : 123 - &
  • [47] BLACK-BOX BLUES
    DIXON, B
    SCIENCES-NEW YORK, 1983, 24 (02): : 11 - 12
  • [48] INSIDE THE BLACK-BOX
    OCONNOR, L
    MECHANICAL ENGINEERING, 1995, 117 (01) : 72 - 74
  • [49] Black-box man
    Banks, H
    FORBES, 2000, 166 (11): : 69 - +
  • [50] Users' trust in black-box machine learning algorithms
    Nakashima, Heitor Hoffman
    Mantovani, Daielly
    Machado Junior, Celso
    REGE-REVISTA DE GESTAO, 2024, 31 (02): : 237 - 250