Stealing the Decoding Algorithms of Language Models

被引：0

作者：

Naseh, Ali ^{[1
]}

Krishna, Kalpesh ^{[1
]}

Iyyer, Mohit ^{[1
]}

Houmansadr, Amir ^{[1
]}

机构：

[1] Univ Massachusetts Amherst, Amherst, MA 01003 USA

来源：

PROCEEDINGS OF THE 2023 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2023 | 2023年

关键词：

Hyperparameter stealing; language models; decoding algorithms;

D O I：

10.1145/3576915.3616652

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms. These algorithms determine how to generate text from the internal probability distribution generated by the LM. The process of choosing a decoding algorithm and tuning its hyperparameters takes significant time, manual effort, and computation, and it also requires extensive human evaluation. Therefore, the identity and hyperparameters of such decoding algorithms are considered to be extremely valuable to their owners. In this work, we show, for the first time, that an adversary with typical API access to an LM can steal the type and hyperparameters of its decoding algorithms at very low monetary costs. Our attack is effective against popular LMs used in text generation APIs, including GPT-2, GPT-3 and GPT-Neo. We demonstrate the feasibility of stealing such information with only a few dollars, e.g., $0.8, $1, $4, and $40 for the four versions of GPT-3.

引用

页码：1835 / 1849

页数：15

共 50 条

[21] Algorithms for decoding and interpolation
Kuijper, M
CODES, SYSTEMS, AND GRAPHICAL MODELS, 2001, 123 : 265 - 282
[22] Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Ghazvininejad, Marjan
Levy, Omer
Liu, Yinhan
Zettlemoyer, Luke
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6112 - 6121
[23] Towards better decoding and language model integration in sequence to sequence models
Chorowski, Jan
Jaitly, Navdeep
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 523 - 527
[24] Energy of Decoding Algorithms
Blake, Christopher
Kschischang, Frank R.
2013 13TH CANADIAN WORKSHOP ON INFORMATION THEORY (CWIT), 2013, : 1 - 5
[25] DECODING THE ACLS ALGORITHMS
SAVER, CL
AMERICAN JOURNAL OF NURSING, 1994, 94 (01) : 27 - 35
[26] New decoding algorithms for Hidden Markov Models using distance measures on labellings
Brown, Daniel G.
Truszkowski, Jakub
BMC BIOINFORMATICS, 2010, 11
[27] New decoding algorithms for Hidden Markov Models using distance measures on labellings
Daniel G Brown
Jakub Truszkowski
BMC Bioinformatics, 11
[28] DECODING LANGUAGE OF BEE
VONFRISCH, K
SCIENCE, 1974, 185 (4152) : 663 - 668
[29] Decoding the Language of Success
Rocklage, Matthew D.
Melumad, Shiri
ADVANCES IN CONSUMER RESEARCH, VOL L, ACR 2022, 2022, : 603 - 608
[30] Decoding the language of immunity
Alvarez, Raymond A.
James, Louisa K.
SCIENCE, 2024, 383 (6679) : 146 - 147

← 1 2 3 4 5 →