Stealing the Decoding Algorithms of Language Models

被引:0
|
作者
Naseh, Ali [1 ]
Krishna, Kalpesh [1 ]
Iyyer, Mohit [1 ]
Houmansadr, Amir [1 ]
机构
[1] Univ Massachusetts Amherst, Amherst, MA 01003 USA
关键词
Hyperparameter stealing; language models; decoding algorithms;
D O I
10.1145/3576915.3616652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms. These algorithms determine how to generate text from the internal probability distribution generated by the LM. The process of choosing a decoding algorithm and tuning its hyperparameters takes significant time, manual effort, and computation, and it also requires extensive human evaluation. Therefore, the identity and hyperparameters of such decoding algorithms are considered to be extremely valuable to their owners. In this work, we show, for the first time, that an adversary with typical API access to an LM can steal the type and hyperparameters of its decoding algorithms at very low monetary costs. Our attack is effective against popular LMs used in text generation APIs, including GPT-2, GPT-3 and GPT-Neo. We demonstrate the feasibility of stealing such information with only a few dollars, e.g., $0.8, $1, $4, and $40 for the four versions of GPT-3.
引用
收藏
页码:1835 / 1849
页数:15
相关论文
共 50 条
  • [21] Algorithms for decoding and interpolation
    Kuijper, M
    CODES, SYSTEMS, AND GRAPHICAL MODELS, 2001, 123 : 265 - 282
  • [22] Mask-Predict: Parallel Decoding of Conditional Masked Language Models
    Ghazvininejad, Marjan
    Levy, Omer
    Liu, Yinhan
    Zettlemoyer, Luke
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6112 - 6121
  • [23] Towards better decoding and language model integration in sequence to sequence models
    Chorowski, Jan
    Jaitly, Navdeep
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 523 - 527
  • [24] Energy of Decoding Algorithms
    Blake, Christopher
    Kschischang, Frank R.
    2013 13TH CANADIAN WORKSHOP ON INFORMATION THEORY (CWIT), 2013, : 1 - 5
  • [25] DECODING THE ACLS ALGORITHMS
    SAVER, CL
    AMERICAN JOURNAL OF NURSING, 1994, 94 (01) : 27 - 35
  • [26] New decoding algorithms for Hidden Markov Models using distance measures on labellings
    Brown, Daniel G.
    Truszkowski, Jakub
    BMC BIOINFORMATICS, 2010, 11
  • [27] New decoding algorithms for Hidden Markov Models using distance measures on labellings
    Daniel G Brown
    Jakub Truszkowski
    BMC Bioinformatics, 11
  • [28] DECODING LANGUAGE OF BEE
    VONFRISCH, K
    SCIENCE, 1974, 185 (4152) : 663 - 668
  • [29] Decoding the Language of Success
    Rocklage, Matthew D.
    Melumad, Shiri
    ADVANCES IN CONSUMER RESEARCH, VOL L, ACR 2022, 2022, : 603 - 608
  • [30] Decoding the language of immunity
    Alvarez, Raymond A.
    James, Louisa K.
    SCIENCE, 2024, 383 (6679) : 146 - 147