Stealing the Decoding Algorithms of Language Models

被引:0
|
作者
Naseh, Ali [1 ]
Krishna, Kalpesh [1 ]
Iyyer, Mohit [1 ]
Houmansadr, Amir [1 ]
机构
[1] Univ Massachusetts Amherst, Amherst, MA 01003 USA
关键词
Hyperparameter stealing; language models; decoding algorithms;
D O I
10.1145/3576915.3616652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms. These algorithms determine how to generate text from the internal probability distribution generated by the LM. The process of choosing a decoding algorithm and tuning its hyperparameters takes significant time, manual effort, and computation, and it also requires extensive human evaluation. Therefore, the identity and hyperparameters of such decoding algorithms are considered to be extremely valuable to their owners. In this work, we show, for the first time, that an adversary with typical API access to an LM can steal the type and hyperparameters of its decoding algorithms at very low monetary costs. Our attack is effective against popular LMs used in text generation APIs, including GPT-2, GPT-3 and GPT-Neo. We demonstrate the feasibility of stealing such information with only a few dollars, e.g., $0.8, $1, $4, and $40 for the four versions of GPT-3.
引用
收藏
页码:1835 / 1849
页数:15
相关论文
共 50 条
  • [1] Decoding Symbolism in Language Models
    Guo, Meiqi
    Hwa, Rebecca
    Kovashka, Adriana
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3311 - 3324
  • [2] STEALING THE LANGUAGE - OSTRIKER,AS
    ALDAN, D
    WORLD LITERATURE TODAY, 1987, 61 (02) : 291 - 292
  • [3] Data Stealing Attacks against Large Language Models via Backdooring
    He, Jiaming
    Hou, Guanyu
    Jia, Xinyue
    Chen, Yangyang
    Liao, Wenqi
    Zhou, Yinhang
    Zhou, Rang
    ELECTRONICS, 2024, 13 (14)
  • [4] Decoding with Shrinkage-Based Language Models
    Emami, Ahmad
    Chen, Stanley
    Ittycheriah, Abraham
    Soltau, Hagen
    Zhao, Bing
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1033 - 1036
  • [5] Efficient Machine Translation Decoding with Slow Language Models
    Emami, Ahmad
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2376 - 2379
  • [6] SYNDROME-DECODING ALGORITHMS FOR STATIC-DIAGNOSIS MODELS
    ISHIDA, Y
    TOKUMARU, H
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1987, 18 (07) : 1291 - 1304
  • [7] Decoding algorithms
    Lomborg, Stine
    Kapsch, Patrick Heiberg
    MEDIA CULTURE & SOCIETY, 2020, 42 (05) : 745 - 761
  • [8] N-gram language models for document image decoding
    Kopec, GE
    Said, MR
    Popat, K
    DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 191 - 202
  • [9] Comparison of Diverse Decoding Methods from Conditional Language Models
    Ippolito, Daphne
    Kriz, Reno
    Kustikova, Maria
    Sedoc, Joao
    Callison-Burch, Chris
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3752 - 3762
  • [10] Joint Discriminative Learning of Acoustic and Language Models on Decoding Graphs
    Abdelhamid, Abdelaziz A.
    Abdulla, Waleed H.
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,