Towards Robust Models of Code via Energy-Based Learning on Auxiliary Datasets

被引:0
|
作者
Bui, Nghi D. Q. [1 ,2 ]
Yu, Yijun [2 ]
机构
[1] Singapore Management Univ, Singapore, Singapore
[2] Huawei Ireland Res Ctr, Dublin, Ireland
关键词
D O I
10.1145/3551349.3561171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing approaches to improving the robustness of source code models concentrate on recognizing adversarial samples rather than valid samples that fall outside of a given distribution, which we refer to as out-of-distribution (OOD) samples. To this end, we propose to use an auxiliary dataset (out-of-distribution) such that, when trained together with the main dataset, they will enhance the model's robustness. We adapt energy-bounded learning objective function to assign a higher score to in-distribution samples and a lower score to out-of-distribution samples in order to incorporate such out-of-distribution samples into the training process of source code models. In terms of OOD detection and adversarial samples detection, our evaluation results demonstrate a greater robustness for existing source code models to become more accurate at recognizing OOD data while being more resistant to adversarial attacks at the same time.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
    Dai, Hanjun
    Singh, Rishabh
    Dai, Bo
    Sutton, Charles
    Schuurmans, Dale
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Towards understanding retrosynthesis by energy-based models
    Sun, Ruoxi
    Dai, Hanjun
    Li, Li
    Kearnes, Steven
    Dai, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [3] Rate -Distortion via Energy-Based Models
    Li, Qing
    Kim, Yongjune
    Guyot, Cyril
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 351 - 351
  • [4] Learning Energy-Based Models with Adversarial Training
    Yin, Xuwang
    Li, Shiying
    Rohde, Gustavo K.
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 209 - 226
  • [5] Auxiliary Training: Towards Accurate and Robust Models
    Zhang, Linfeng
    Yu, Muzhou
    Chen, Tong
    Shi, Zuoqiang
    Bao, Chenglong
    Ma, Kaisheng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 369 - 378
  • [6] Towards Bridging the Performance Gaps of Joint Energy-based Models
    Yang, Xiulong
    Su, Qing
    Ji, Shihao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15732 - 15741
  • [7] Learning Kernel Stein Discrepancy for Training Energy-Based Models
    Niu, Lu
    Li, Shaobo
    Li, Zhenping
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [8] Unsupervised Similarity Learning for Image Registration with Energy-Based Models
    Grzech, Daniel
    Le Folgoc, Loic
    Azampour, Mohammad Farid
    Vlontzos, Athanasios
    Glocker, Ben
    Navab, Nassir
    Schnabel, Julia
    Kainz, Bernhard
    BIOMEDICAL IMAGE REGISTRATION, WBIR 2024, 2025, 15249 : 229 - 240
  • [9] Learning Multimodal Latent Generative Models with Energy-Based Prior
    Yuan, Shiyu
    Cui, Jiali
    Li, Hanao
    Han, Tian
    COMPUTER VISION - ECCV 2024, PT LXXXIX, 2025, 15147 : 86 - 100
  • [10] Shared Prior Learning of Energy-Based Models for Image Reconstruction
    Pinetz, Thomas
    Kobler, Erich
    Pock, Thomas
    Effland, Alexander
    SIAM JOURNAL ON IMAGING SCIENCES, 2021, 14 (04): : 1706 - 1748