Tampering with Generative Artificial Intelligence by Jailbreaking

被引：0

作者：

Claverini, Corrado ^{[1
]}

机构：

[1] Univ Salento, Lecce, Italy

来源：

TEORIA-RIVISTA DI FILOSOFIA | 2024年 / 44卷 / 01期

关键词：

ChatGPT; ethics of artificial intelligence; generative artificial in- telligence; jailbreaking; regulation of artificial intelligence;

D O I：

10.4454/mg6wax06

中图分类号：

B [哲学、宗教];

学科分类号：

01 ; 0101 ;

摘要：

In this paper, I will analyse the risks linked to the use of generative artificial intelligence systems and relative risk-reduction strategies, while concentrating in particular on the possibility of tampering with the chatbot ChatGPT by jailbreaking. After examining how a user can tamper with this generative AI, bypassing its ethical and legal restrictions, through a series of prompts, I will turn my focus to the ethical issues raised by the malicious use of this technology: are the transparency requirements requested of generative AI sufficient or should there be tighter restrictions that do not hinder the innovation and development of these technologies? How can the risk of tampering with these AI tools be lowered? And, should a breach take place, who is responsible: the AI developer or the jailbreaker? To what extent could the changes needed to prevent jailbreaking involuntarily generate or strengthen certain biases? In conclusion, I will uphold the necessity of ethical reflection for the sustainable and "human-centric" development of AI.

引用

页数：172

共 50 条

[41] The Economics of Generative Artificial Intelligence in the Academic Industry
Kshetri, Nir
COMPUTER, 2023, 56 (08) : 77 - 83
[42] Statement on use of generative artificial intelligence by adolescents
Sakuraya, Asuka
Matsumura, Masayo
Komatsu, Shohei
Imamura, Kotaro
Iida, Mako
Kawakami, Norito
ASIAN JOURNAL OF PSYCHIATRY, 2024, 94
[43] Generative artificial intelligence in publishing - Reflection and discussion
Dien, Joseph
Ritz, Thomas
BIOLOGICAL PSYCHOLOGY, 2023, 181
[44] Generative Artificial Intelligence and the Future of Software Testing
Layman, Lucas
Vetter, Ron
COMPUTER, 2024, 57 (01) : 27 - 32
[45] Generative Artificial Intelligence in Defence and Security: An Introduction
De Angelis, Emma
RUSI JOURNAL, 2023, 168 (07): : 14 - 15
[46] Entrepreneurship education in the era of generative artificial intelligence
Robin Bell
Heather Bell
Entrepreneurship Education, 2023, 6 (3) : 229 - 244
[47] The Role of Materiality in an Era of Generative Artificial Intelligence
Tang, Kok-Sing
Cooper, Grant
SCIENCE & EDUCATION, 2024,
[48] Generative artificial intelligence community of practice for research
Cohen, Steve
Queen, Douglas
INTERNATIONAL WOUND JOURNAL, 2023, 20 (06) : 1817 - 1818
[49] Generative artificial intelligence: synthetic datasets in dentistry
Umer, Fahad
Adnan, Niha
BDJ OPEN, 2024, 10 (01)
[50] Generative Artificial Intelligence, Content Creation, and Platforms
Katsamakas, Evangelos
Sanchez-Cartas, J. Manuel
JOURNAL OF INDUSTRY COMPETITION & TRADE, 2024, 24 (01):

← 1 2 3 4 5 →