The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models

被引：0

作者：

School of Computer Science and Technology, Xidian University, China ^{[1
]}

机构：

来源：

arXiv |

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

共 50 条

[21] LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
Kahng, Minsuk
Tenney, Ian
Pushkarna, Mahima
Liu, Michael Xieyang
Wexler, James
Reif, Emily
Kallarackal, Krystal
Chang, Minsuk
Terry, Michael
Dixon, Lucas
[J]. EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[22] The Dark Side of Language Models: Exploring the Potential of LLMs in Multimedia Disinformation Generation and Dissemination
Barman, Dipto
Guo, Ziyi
Conlan, Owen
[J]. MACHINE LEARNING WITH APPLICATIONS, 2024, 16
[23] Does the dark side of a calling exist? Examining potential negative effects
Duffy, Ryan D.
Douglass, Richard P.
Autin, Kelsey L.
England, Jessica
Dik, Bryan J.
[J]. JOURNAL OF POSITIVE PSYCHOLOGY, 2016, 11 (06): : 634 - 646
[24] Generative AI and large language models in health care: pathways to implementation
Marium M. Raza
Kaushik P. Venkatesh
Joseph C. Kvedar
[J]. npj Digital Medicine, 7
[25] Generative AI and large language models in health care: pathways to implementation
Raza, Marium M.
Venkatesh, Kaushik P.
Kvedar, Joseph C.
[J]. NPJ DIGITAL MEDICINE, 2024, 7 (01)
[26] DarkBERT: A Language Model for the Dark Side of the Internet
Jin, Youngjin
Jang, Eugene
Cui, Jian
Chung, Jin-Woo
Lee, Yongjae
Shin, Seungwon
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7515 - 7533
[27] Hate speech. The dark side of language
Del Bo, Corrado
[J]. RIVISTA DI FILOSOFIA, 2021, 112 (02) : 312 - 313
[28] Large Language Models are Not Models of Natural Language: They are Corpus Models
Veres, Csaba
[J]. IEEE ACCESS, 2022, 10 : 61970 - 61979
[29] Large Language Models
Vargas, Diego Collarana
Katsamanis, Nassos
[J]. ERCIM NEWS, 2024, (136): : 12 - 13
[30] Large Language Models
Cerf, Vinton G.
[J]. COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 7 - 7

← 1 2 3 4 5 →