Black-Box Prompt Tuning With Subspace Learning

被引:0
|
作者
Zheng, Yuanhang [1 ]
Tan, Zhixing [2 ]
Li, Peng [3 ,4 ]
Liu, Yang [1 ,3 ,4 ,5 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Zhongguancun Lab, Beijing 100086, Peoples R China
[3] Tsinghua Univ, Inst AI Ind Res AIR, Beijing 100084, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai 200030, Peoples R China
[5] Jiangsu Collaborat Innovat Ctr Language Competence, Xuzhou 221116, Jiangsu, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Tuning; Closed box; Speech processing; Metalearning; Sun; Optimization; Black-box; large language models (LLMs); meta-learning; prompt tuning; subspace learning; ADAPTATION;
D O I
10.1109/TASLP.2024.3407519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Black-box prompt tuning employs derivative-free optimization algorithms to learn prompts within low-dimensional subspaces rather than back-propagating through the network of Large Language Models (LLMs). Recent studies reveal that black-box prompt tuning lacks versatility across tasks and LLMs, which we believe is related to the suboptimal choice of subspaces. In this paper, we introduce Black-box prompt tuning with Subspace Learning (BSL) to enhance the versatility of black-box prompt tuning. Based on the assumption that nearly optimal prompts for similar tasks reside in a common subspace, we propose identifying such subspaces through meta-learning on a collection of similar source tasks. Consequently, for a target task that shares similarities with the source tasks, we expect that optimizing within the identified subspace can yield a prompt that performs well on the target task. Experimental results confirm that our BSL framework consistently achieves competitive performance across various downstream tasks and LLMs.
引用
收藏
页码:3002 / 3013
页数:12
相关论文
共 50 条
  • [21] Automatic Hyper-Parameter Tuning for Black-box LiDAR Odometry
    Koide, Kenji
    Yokozuka, Masashi
    Oishi, Shuji
    Banno, Atsuhiko
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5069 - 5074
  • [22] MetricOpt: Learning to Optimize Black-Box Evaluation Metrics
    Huang, Chen
    Zhai, Shuangfei
    Guo, Pengsheng
    Susskind, Josh
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 174 - 183
  • [23] Boosting Black-Box Adversarial Attacks with Meta Learning
    Fu, Junjie
    Sun, Jian
    Wang, Gang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7308 - 7313
  • [24] Deep Black-Box Reinforcement Learning with Movement Primitives
    Otto, Fabian
    Celik, Onur
    Zhou, Hongyi
    Ziesche, Hanna
    Ngo Anh Vien
    Neumann, Gerhard
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1244 - 1265
  • [25] Drift Detection for Black-Box Deep Learning Models
    Piano, Luca
    Garcea, Fabio
    Cavallone, Andrea
    Vazquez, Ignacio Aparicio
    Morra, Lia
    Lamberti, Fabrizio
    IT PROFESSIONAL, 2024, 26 (02) : 24 - 31
  • [26] ROCK☆ - Efficient Black-box Optimization for Policy Learning
    Hwangbo, Jemin
    Gehring, Christian
    Sommer, Hannes
    Siegwart, Roland
    Buchli, Jonas
    2014 14TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2014, : 535 - 540
  • [27] Policy Learning with an Effcient Black-Box Optimization Algorithm
    Hwangbo, Jemin
    Gehring, Christian
    Sommer, Hannes
    Siegwart, Roland
    Buchli, Jonas
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2015, 12 (03)
  • [28] Generalizable Black-Box Adversarial Attack With Meta Learning
    Yin, Fei
    Zhang, Yong
    Wu, Baoyuan
    Feng, Yan
    Zhang, Jingyi
    Fan, Yanbo
    Yang, Yujiu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1804 - 1818
  • [29] Learning outside the Black-Box: The pursuit of interpretable models
    Crabbe, Jonathan
    Zhang, Yao
    Zame, William R.
    van der Schaar, Mihaela
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] Practical Black-Box Attacks against Machine Learning
    Papernot, Nicolas
    McDaniel, Patrick
    Goodfellow, Ian
    Jha, Somesh
    Celik, Z. Berkay
    Swami, Ananthram
    PROCEEDINGS OF THE 2017 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIA CCS'17), 2017, : 506 - 519