The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models

被引:0
|
作者
School of Computer Science and Technology, Xidian University, China [1 ]
机构
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
    Kahng, Minsuk
    Tenney, Ian
    Pushkarna, Mahima
    Liu, Michael Xieyang
    Wexler, James
    Reif, Emily
    Kallarackal, Krystal
    Chang, Minsuk
    Terry, Michael
    Dixon, Lucas
    [J]. EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [22] The Dark Side of Language Models: Exploring the Potential of LLMs in Multimedia Disinformation Generation and Dissemination
    Barman, Dipto
    Guo, Ziyi
    Conlan, Owen
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2024, 16
  • [23] Does the dark side of a calling exist? Examining potential negative effects
    Duffy, Ryan D.
    Douglass, Richard P.
    Autin, Kelsey L.
    England, Jessica
    Dik, Bryan J.
    [J]. JOURNAL OF POSITIVE PSYCHOLOGY, 2016, 11 (06): : 634 - 646
  • [24] Generative AI and large language models in health care: pathways to implementation
    Marium M. Raza
    Kaushik P. Venkatesh
    Joseph C. Kvedar
    [J]. npj Digital Medicine, 7
  • [25] Generative AI and large language models in health care: pathways to implementation
    Raza, Marium M.
    Venkatesh, Kaushik P.
    Kvedar, Joseph C.
    [J]. NPJ DIGITAL MEDICINE, 2024, 7 (01)
  • [26] DarkBERT: A Language Model for the Dark Side of the Internet
    Jin, Youngjin
    Jang, Eugene
    Cui, Jian
    Chung, Jin-Woo
    Lee, Yongjae
    Shin, Seungwon
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7515 - 7533
  • [27] Hate speech. The dark side of language
    Del Bo, Corrado
    [J]. RIVISTA DI FILOSOFIA, 2021, 112 (02) : 312 - 313
  • [28] Large Language Models are Not Models of Natural Language: They are Corpus Models
    Veres, Csaba
    [J]. IEEE ACCESS, 2022, 10 : 61970 - 61979
  • [29] Large Language Models
    Vargas, Diego Collarana
    Katsamanis, Nassos
    [J]. ERCIM NEWS, 2024, (136): : 12 - 13
  • [30] Large Language Models
    Cerf, Vinton G.
    [J]. COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 7 - 7