Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge

被引:0
|
作者
Maloyan, Narek
Verma, Ekansh
Nutfullin, Bulat
Ashinov, Bislan
机构
来源
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Computational linguistics - Malware
引用
下载
收藏
相关论文
共 50 条
  • [1] The Trojan Detection Challenge
    Mazeika, Mantas
    Hendrycks, Dan
    Li, Huichen
    Xu, Xiaojun
    Hough, Sidney
    Zou, Andy
    Rajabi, Arezoo
    Yao, Qi
    Wang, Zihao
    Tian, Jian
    Tang, Yao
    Tang, Di
    Smirnov, Roman
    Pleskov, Pavel
    Benkovich, Nikita
    Song, Dawn
    Poovendran, Radha
    Li, Bo
    Forsyth, David
    Proceedings of Machine Learning Research, 2023, 220 : 279 - 291
  • [2] Occlusion-Based Detection of Trojan-Triggering Inputs in Large Language Models of Code
    Hussain, Aftab
    Rabin, Md Rafiqul Islam
    Ahmed, Tofique
    Alipour, Mohammad Amin
    Xu, Bowen
    SSRN,
  • [3] Insights into the Mind of a Trojan Designer The Challenge to Integrate a Trojan into the Bitstream
    Ender, Maik
    Swierczynski, Pawel
    Wallat, Sebastian
    Wilhelm, Matthias
    Knopp, Paul Martin
    Paar, Christof
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 112 - 119
  • [4] Odyssey: Creation, Analysis and Detection of Trojan Models
    Edraki, Marzieh
    Karim, Nazmul
    Rahnavard, Nazanin
    Mian, Ajmal
    Shah, Mubarak
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4521 - 4533
  • [5] Hardware Trojan Detection
    Alluhaib, Ghalia
    Aldissi, Hanan
    Alqarni, Rasha
    Banafee, Shoroq
    Nagro, Wafaa
    Aljandali, Asia
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (03): : 141 - 147
  • [6] Hardware trojan detection
    Case Western Reserve University, Cleveland, United States
    Introduction to Hardw. Secty. and Trust, (339-364):
  • [7] Advances in Hardware Trojan Detection
    Vaidya, Jaideep
    COMPUTER, 2019, 52 (06) : 4 - 5
  • [8] On Reliability Trojan Injection and Detection
    Sreedhar, Aswin
    Kundu, Sandip
    Koren, Israel
    JOURNAL OF LOW POWER ELECTRONICS, 2012, 8 (05) : 674 - 683
  • [9] Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and Detection
    Sarihi, Amin
    Patooghy, Ahmad
    Jamieson, Peter
    Badawy, Abdel-Hameed A.
    arXiv, 2023,
  • [10] Trojan playground: a reinforcement learning framework for hardware Trojan insertion and detection
    Sarihi, Amin
    Patooghy, Ahmad
    Jamieson, Peter
    Badawy, Abdel-Hameed A.
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14295 - 14329