TraceGuard: Fine-Tuning Pre-Trained Model by Using Stego Images to Trace Its User

被引：1

作者：

Zhou, Limengnan ^{[1
]}

Ren, Xingdong ^{[2
]}

Qian, Cheng ^{[2
]}

Sun, Guangling ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Zhongshan Inst, Sch Elect & Informat Engn, Zhongshan 528402, Peoples R China

[2] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China

来源：

MATHEMATICS | 2024年 / 12卷 / 21期

基金：

中国国家自然科学基金;

关键词：

intellectual property protection; tracing pre-trained model; fine-tuning pre-trained model; steganography network; stego image; fingerprint removal attack;

D O I：

10.3390/math12213333

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Currently, a significant number of pre-trained models are published online to provide services to users owing to the rapid maturation and popularization of machine learning as a service (MLaaS). Some malicious users have pre-trained models illegally to redeploy them and earn money. However, most of the current methods focus on verifying the copyright of the model rather than tracing responsibility for the suspect model. In this study, TraceGuard is proposed, the first framework based on steganography for tracing a suspect self-supervised learning (SSL) pre-trained model, to ascertain which authorized user illegally released the suspect model or if the suspect model is independent. Concretely, the framework contains an encoder and decoder pair and the SSL pre-trained model. Initially, the base pre-trained model is frozen, and the encoder and decoder are jointly learned to ensure the two modules can embed the secret key into the cover image and extract the secret key from the embedding output by the base pre-trained model. Subsequently, the base pre-trained model is fine-tuned using stego images to implement a fingerprint while the encoder and decoder are frozen. To assure the effectiveness and robustness of the fingerprint and the utility of fingerprinted pre-trained models, three alternate steps of model stealing simulations, fine-tuning for uniqueness, and fine-tuning for utility are designed. Finally, the suspect pre-trained model is traced to its user by querying stego images. Experimental results demonstrate that TraceGuard can reliably trace suspect models and is robust against common fingerprint removal attacks such as fine-tuning, pruning, and model stealing. In the future, we will further improve the robustness against model stealing attack.

引用

页数：17

共 50 条

[1] Pruning Pre-trained Language ModelsWithout Fine-Tuning
Jiang, Ting
Wang, Deqing
Zhuang, Fuzhen
Xie, Ruobing
Xia, Feng
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 594 - 605
[2] Span Fine-tuning for Pre-trained Language Models
Bao, Rongzhou
Zhang, Zhuosheng
Zhao, Hai
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1970 - 1979
[3] Sentiment Analysis Using Pre-Trained Language Model With No Fine-Tuning and Less Resource
Kit, Yuheng
Mokji, Musa Mohd
IEEE ACCESS, 2022, 10 : 107056 - 107065
[4] Overcoming Catastrophic Forgetting for Fine-Tuning Pre-trained GANs
Zhang, Zeren
Li, Xingjian
Hong, Tao
Wang, Tianyang
Ma, Jinwen
Xiong, Haoyi
Xu, Cheng-Zhong
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 293 - 308
[5] Waste Classification by Fine-Tuning Pre-trained CNN and GAN
Alsabei, Amani
Alsayed, Ashwaq
Alzahrani, Manar
Al-Shareef, Sarah
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (08): : 65 - 70
[6] Fine-Tuning Pre-Trained Language Models with Gaze Supervision
Deng, Shuwen
Prasse, Paul
Reich, David R.
Scheffer, Tobias
Jager, Lena A.
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 217 - 224
[7] Fine-Tuning Pre-Trained Model to Extract Undesired Behaviors from App Reviews
Zhang, Wenyu
Wang, Xiaojuan
Lai, Shanyan
Ye, Chunyang
Zhou, Hui
2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 1125 - 1134
[8] Fine-Tuning Pre-Trained Model for Consumer Fraud Detection from Consumer Reviews
Tang, Xingli
Li, Keqi
Huang, Liting
Zhou, Hui
Ye, Chunyang
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT II, 2023, 14147 : 451 - 456
[9] Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Liao, Baohao
Tan, Shaomu
Monz, Christof
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[10] HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation
Yuan, Hongyi
Yuan, Zheng
Tan, Chuanqi
Huang, Fei
Huang, Songfang
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3246 - 3264

← 1 2 3 4 5 →