How to Protect Copyright Data in Optimization of Large Language Models?

被引:0
|
作者
Chu, Timothy [1 ]
Song, Zhao [2 ]
Yang, Chiwun [3 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Adobe Res, San Jose, CA USA
[3] Sun Yat Sen Univ, Guangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) and generative AI have played a transformative role in computer research and applications. Controversy has arisen as to whether these models output copyrighted data, which can occur if the data the models are trained on is copyrighted. LLMs are built on the transformer neural network architecture, which in turn relies on a mathematical computation called Attention that uses the softmax function. In this paper, we observe that large language model training and optimization can be seen as a softmax regression problem. We then establish a method of efficiently performing softmax regression, in a way that prevents the regression function from generating copyright data. This establishes a theoretical method of training large language models in a way that avoids generating copyright data.
引用
收藏
页码:17871 / 17879
页数:9
相关论文
共 50 条
  • [1] Copyright Violations and Large Language Models
    Karamolegkou, Antonia
    Li, Jiaang
    Zhou, Li
    Sogaard, Anders
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7403 - 7412
  • [2] How Large Language Models Will Disrupt Data Management
    Fernandez, Raul Castro
    Elmore, Aaron J.
    Franklin, Michael J.
    Krishnan, Sanjay
    Tan, Chenhao
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 3302 - 3309
  • [3] Prompt Optimization in Large Language Models
    Sabbatella, Antonio
    Ponti, Andrea
    Giordani, Ilaria
    Candelieri, Antonio
    Archetti, Francesco
    MATHEMATICS, 2024, 12 (06)
  • [4] Generating Data for Symbolic Language with Large Language Models
    Ye, Jiacheng
    Li, Chengzu
    Kong, Lingpeng
    Yu, Tao
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 8418 - 8443
  • [5] Periodic watermarking for copyright protection of large language models in cloud computing security
    Ye, Pei-Gen
    Li, Zhengdao
    Yang, Zuopeng
    Chen, Pengyu
    Zhang, Zhenxin
    Li, Ning
    Zheng, Jun
    COMPUTER STANDARDS & INTERFACES, 2025, 94
  • [6] Using Large Language Models to protect information search in Multidomain Operations
    Verma, Dinesh C.
    Beymer, David
    Chowdhary, Pawan
    Kadhe, Swanand Ravindra
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [7] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [8] How true is the role of large language models in nursing?
    Ray, Partha Pratim
    EUROPEAN JOURNAL OF CARDIOVASCULAR NURSING, 2024, 23 (05) : e79 - e80
  • [9] How to write effective prompts for large language models
    Lin, Zhicheng
    NATURE HUMAN BEHAVIOUR, 2024, 8 (4) : 611 - 615
  • [10] How to write effective prompts for large language models
    Zhicheng Lin
    Nature Human Behaviour, 2024, 8 : 611 - 615