How to Protect Copyright Data in Optimization of Large Language Models?

被引:0
|
作者
Chu, Timothy [1 ]
Song, Zhao [2 ]
Yang, Chiwun [3 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Adobe Res, San Jose, CA USA
[3] Sun Yat Sen Univ, Guangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) and generative AI have played a transformative role in computer research and applications. Controversy has arisen as to whether these models output copyrighted data, which can occur if the data the models are trained on is copyrighted. LLMs are built on the transformer neural network architecture, which in turn relies on a mathematical computation called Attention that uses the softmax function. In this paper, we observe that large language model training and optimization can be seen as a softmax regression problem. We then establish a method of efficiently performing softmax regression, in a way that prevents the regression function from generating copyright data. This establishes a theoretical method of training large language models in a way that avoids generating copyright data.
引用
收藏
页码:17871 / 17879
页数:9
相关论文
共 50 条
  • [21] How large language models can reshape collective intelligence
    Burton, Jason W.
    Lopez-Lopez, Ezequiel
    Hechtlinger, Shahar
    Rahwan, Zoe
    Aeschbach, Samuel
    Bakker, Michiel A.
    Becker, Joshua A.
    Berditchevskaia, Aleks
    Berger, Julian
    Brinkmann, Levin
    Flek, Lucie
    Herzog, Stefan M.
    Huang, Saffron
    Kapoor, Sayash
    Narayanan, Arvind
    Nussberger, Anne-Marie
    Yasseri, Taha
    Nickl, Pietro
    Almaatouq, Abdullah
    Hahn, Ulrike
    Kurvers, Ralf H. J. M.
    Leavy, Susan
    Rahwan, Iyad
    Siddarth, Divya
    Siu, Alice
    Woolley, Anita W.
    Wulff, Dirk U.
    Hertwig, Ralph
    NATURE HUMAN BEHAVIOUR, 2024, 8 (09): : 1643 - 1655
  • [22] SEMbeddings: how to evaluate model misfit before data collection using large-language models
    Feraco, Tommaso
    Toffalini, Enrico
    FRONTIERS IN PSYCHOLOGY, 2025, 15
  • [23] How to Safely Integrate Large Language Models Into Health Care
    Gottlieb, Scott
    Silvis, Lauren
    JAMA HEALTH FORUM, 2023, 4 (09):
  • [24] How good are large language models at product risk assessment?
    Collier, Zachary A.
    Gruss, Richard J.
    Abrahams, Alan S.
    RISK ANALYSIS, 2024,
  • [25] How to Optimize Prompting for Large Language Models in Clinical Research
    Lee, Jeong Hyun
    Shin, Jaeseung
    KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (10) : 869 - 873
  • [26] How artificial intelligence and large language models are revolutionizing dermatology
    Busik, Valentina
    DERMATOLOGIE, 2024, 75 (09): : 743 - 746
  • [27] How can large language models assist with a FRAM analysis?
    Sujan, M.
    Slater, D.
    Crumpton, E.
    SAFETY SCIENCE, 2025, 181
  • [28] How Useful Are Educational Questions Generated by Large Language Models?
    Elkins, Sabina
    Kochmar, Ekaterina
    Serban, Iulian
    Cheung, Jackie C. K.
    ARTIFICIAL INTELLIGENCE IN EDUCATION. POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2023, 2023, 1831 : 536 - 542
  • [29] Diagnosing infeasible optimization problems using large language models
    Chen, Hao
    Constante-Flores, Gonzalo E.
    Li, Can
    INFOR, 2024, 62 (04) : 573 - 587
  • [30] Watermark-based scheme to protect copyright of SVG data
    Zhou, Xu
    Pan, Xuezeng
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1199 - 1202