Stealing the Decoding Algorithms of Language Models

被引：0

作者：

Naseh, Ali ^{[1
]}

Krishna, Kalpesh ^{[1
]}

Iyyer, Mohit ^{[1
]}

Houmansadr, Amir ^{[1
]}

机构：

[1] Univ Massachusetts Amherst, Amherst, MA 01003 USA

来源：

PROCEEDINGS OF THE 2023 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2023 | 2023年

关键词：

Hyperparameter stealing; language models; decoding algorithms;

D O I：

10.1145/3576915.3616652

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms. These algorithms determine how to generate text from the internal probability distribution generated by the LM. The process of choosing a decoding algorithm and tuning its hyperparameters takes significant time, manual effort, and computation, and it also requires extensive human evaluation. Therefore, the identity and hyperparameters of such decoding algorithms are considered to be extremely valuable to their owners. In this work, we show, for the first time, that an adversary with typical API access to an LM can steal the type and hyperparameters of its decoding algorithms at very low monetary costs. Our attack is effective against popular LMs used in text generation APIs, including GPT-2, GPT-3 and GPT-Neo. We demonstrate the feasibility of stealing such information with only a few dollars, e.g., $0.8, $1, $4, and $40 for the four versions of GPT-3.

引用

页码：1835 / 1849

页数：15

共 50 条

[1] Decoding Symbolism in Language Models
Guo, Meiqi
Hwa, Rebecca
Kovashka, Adriana
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3311 - 3324
[2] STEALING THE LANGUAGE - OSTRIKER,AS
ALDAN, D
WORLD LITERATURE TODAY, 1987, 61 (02) : 291 - 292
[3] Data Stealing Attacks against Large Language Models via Backdooring
He, Jiaming
Hou, Guanyu
Jia, Xinyue
Chen, Yangyang
Liao, Wenqi
Zhou, Yinhang
Zhou, Rang
ELECTRONICS, 2024, 13 (14)
[4] Decoding with Shrinkage-Based Language Models
Emami, Ahmad
Chen, Stanley
Ittycheriah, Abraham
Soltau, Hagen
Zhao, Bing
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1033 - 1036
[5] Efficient Machine Translation Decoding with Slow Language Models
Emami, Ahmad
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2376 - 2379
[6] SYNDROME-DECODING ALGORITHMS FOR STATIC-DIAGNOSIS MODELS
ISHIDA, Y
TOKUMARU, H
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1987, 18 (07) : 1291 - 1304
[7] Decoding algorithms
Lomborg, Stine
Kapsch, Patrick Heiberg
MEDIA CULTURE & SOCIETY, 2020, 42 (05) : 745 - 761
[8] N-gram language models for document image decoding
Kopec, GE
Said, MR
Popat, K
DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 191 - 202
[9] Comparison of Diverse Decoding Methods from Conditional Language Models
Ippolito, Daphne
Kriz, Reno
Kustikova, Maria
Sedoc, Joao
Callison-Burch, Chris
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3752 - 3762
[10] Joint Discriminative Learning of Acoustic and Language Models on Decoding Graphs
Abdelhamid, Abdelaziz A.
Abdulla, Waleed H.
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,

← 1 2 3 4 5 →