The Role of ViT Design and Training in Robustness to Common Corruptions

被引:0
|
作者
Tian, Rui [1 ,2 ]
Wu, Zuxuan [1 ,2 ]
Dai, Qi [3 ]
Goldblum, Micah [4 ]
Hu, Han
Jiang, Yu-Gang [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 201203, Peoples R China
[2] Shanghai Collaborat Innovat Ctr Intelligent Visual, Shanghai 201203, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
[4] NYU, Ctr Data Sci, New York, NY 10012 USA
关键词
Robustness; Training; Transformers; Data augmentation; Benchmark testing; Accuracy; Noise; Computer vision; Standards; Resilience; Common corruptions; robustness; vision transformer;
D O I
10.1109/TMM.2024.3521721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vision transformer (ViT) variants have made rapid advances on a variety of computer vision tasks. However, their performance on corrupted inputs, which are inevitable in realistic use cases due to variations in lighting and weather, has not been explored comprehensively. In this paper, we probe the robustness gap among ViT variants and ask how these modern architectural developments affect performance under common types of corruption. Through extensive and rigorous benchmarking, we demonstrate that simple architectural designs such as overlapping patch embedding and convolutional feed-forward networks can promote the robustness of ViTs. Moreover, since the de facto training of ViTs relies heavily on data augmentation, exactly which augmentation strategies make ViTs more robust is worth investigating. We survey the efficacy of previous methods and verify that adversarial noise training is powerful. In addition, we introduce a novel conditional method for generating dynamic augmentation parameters conditioned on input images, which offers state-of-the-art robustness to common corruptions.
引用
收藏
页码:1374 / 1385
页数:12
相关论文
共 50 条
  • [41] Informational robustness of common belief in rationality
    Ziegler, Gabriel
    GAMES AND ECONOMIC BEHAVIOR, 2022, 132 : 592 - 597
  • [42] THE CONDUCTING SYSTEM IN THE HEART OF THE COMMON SEAL (PHOCA-VITULINA-VIT)
    VANNIE, CJ
    JOURNAL OF ANATOMY, 1983, 137 (SEP) : 439 - 439
  • [43] Design for reliability and robustness
    Cheng, Tim
    IEEE Design and Test of Computers, 2009, 26 (06): : 2 - 3
  • [44] Robustness in analog design
    De Mey, M
    ANALOG CIRCUIT DESIGN: FRACTIONAL-N SYNTHESIZERS, DESIGN FOR ROBUSTNESS, LINE AND BUS DRIVERS, 2003, : 243 - 253
  • [45] Robustness in SOC design
    Waldschmidt, Klaus
    Damm, Markus
    DSD 2006: 9TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN: ARCHITECTURES, METHODS AND TOOLS, PROCEEDINGS, 2006, : 27 - +
  • [46] Robustness in Experiment Design
    Rojas, Cristian R.
    Agueero, Juan-Carlos
    Welsh, James S.
    Goodwin, Graham C.
    Feuer, Arie
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (04) : 860 - 874
  • [47] Mechanical design for robustness
    Zhu, Xue-jun
    Wang, An-lin
    Huang, Hong-zhong
    Jixie Kexue Yu Jishu/Mechanical Science and Technology, 2000, 19 (02): : 230 - 233
  • [48] Design for reliability and robustness
    Cheng, Tim
    IEEE DESIGN & TEST OF COMPUTERS, 2009, 26 (06): : 2 - 3
  • [49] Robust Design for Robustness of Design Variables
    Kobayashi, Takahisa
    Arakawa, Masao
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 979 - 983
  • [50] A training framework for stack and Boolean filtering - Fast optimal design procedures and robustness case study
    Tabus, I
    Petrescu, D
    Gabbouj, M
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1996, 5 (06) : 809 - 826