The Role of ViT Design and Training in Robustness to Common Corruptions

被引:0
|
作者
Tian, Rui [1 ,2 ]
Wu, Zuxuan [1 ,2 ]
Dai, Qi [3 ]
Goldblum, Micah [4 ]
Hu, Han
Jiang, Yu-Gang [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 201203, Peoples R China
[2] Shanghai Collaborat Innovat Ctr Intelligent Visual, Shanghai 201203, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
[4] NYU, Ctr Data Sci, New York, NY 10012 USA
关键词
Robustness; Training; Transformers; Data augmentation; Benchmark testing; Accuracy; Noise; Computer vision; Standards; Resilience; Common corruptions; robustness; vision transformer;
D O I
10.1109/TMM.2024.3521721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vision transformer (ViT) variants have made rapid advances on a variety of computer vision tasks. However, their performance on corrupted inputs, which are inevitable in realistic use cases due to variations in lighting and weather, has not been explored comprehensively. In this paper, we probe the robustness gap among ViT variants and ask how these modern architectural developments affect performance under common types of corruption. Through extensive and rigorous benchmarking, we demonstrate that simple architectural designs such as overlapping patch embedding and convolutional feed-forward networks can promote the robustness of ViTs. Moreover, since the de facto training of ViTs relies heavily on data augmentation, exactly which augmentation strategies make ViTs more robust is worth investigating. We survey the efficacy of previous methods and verify that adversarial noise training is powerful. In addition, we introduce a novel conditional method for generating dynamic augmentation parameters conditioned on input images, which offers state-of-the-art robustness to common corruptions.
引用
收藏
页码:1374 / 1385
页数:12
相关论文
共 50 条
  • [21] An analysis of ConformalLayers' robustness to corruptions in natural images
    Sousa, Eduardo Vera
    Vasconcelos, Cristina Nader
    Fernandes, Leandro A. F.
    PATTERN RECOGNITION LETTERS, 2023, 166 : 190 - 197
  • [22] Exploring the Role of Feedback Inhibition for the Robustness Against Corruptions on Event-Based Data
    Larisch, Rene
    Berger, Lucien
    Hamker, Fred H.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 197 - 208
  • [23] On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness
    Mintun, Eric
    Kirillov, Alexander
    Xie, Saining
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [24] Robustness of ConvNet to High-Frequency Image Corruptions
    Banerjee, Arnab
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 1 - 12
  • [25] Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions
    Guo, Yong
    Stutz, David
    Schiele, Bernt
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4108 - 4118
  • [26] Benchmarking Object Detection Robustness against Real-World Corruptions
    Liu, Jiawei
    Wang, Zhijie
    Ma, Lei
    Fang, Chunrong
    Bai, Tongtong
    Zhang, Xufan
    Liu, Jia
    Chen, Zhenyu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4398 - 4416
  • [27] 3D Common Corruptions and Data Augmentation
    Kar, Oguzhan Fatih
    Yeo, Teresa
    Atanov, Andrei
    Zamir, Amir
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18941 - 18952
  • [28] Effective and Robust Adversarial Training Against Data and Label Corruptions
    Zhang, Peng-Fei
    Huang, Zi
    Xu, Xin-Shun
    Bai, Guangdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9477 - 9488
  • [29] Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions
    Tian, Zhiqiang
    Li, Weigang
    Hu, Junwei
    Deng, Chunhua
    INFORMATION SCIENCES, 2023, 648
  • [30] Training a Lightweight ViT Network for Image Retrieval
    Zhang, Hanqi
    Yu, Yunlong
    Li, Yingming
    Zhang, Zhongfei
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 240 - 250