The Role of ViT Design and Training in Robustness to Common Corruptions

被引:0
|
作者
Tian, Rui [1 ,2 ]
Wu, Zuxuan [1 ,2 ]
Dai, Qi [3 ]
Goldblum, Micah [4 ]
Hu, Han
Jiang, Yu-Gang [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 201203, Peoples R China
[2] Shanghai Collaborat Innovat Ctr Intelligent Visual, Shanghai 201203, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
[4] NYU, Ctr Data Sci, New York, NY 10012 USA
关键词
Robustness; Training; Transformers; Data augmentation; Benchmark testing; Accuracy; Noise; Computer vision; Standards; Resilience; Common corruptions; robustness; vision transformer;
D O I
10.1109/TMM.2024.3521721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vision transformer (ViT) variants have made rapid advances on a variety of computer vision tasks. However, their performance on corrupted inputs, which are inevitable in realistic use cases due to variations in lighting and weather, has not been explored comprehensively. In this paper, we probe the robustness gap among ViT variants and ask how these modern architectural developments affect performance under common types of corruption. Through extensive and rigorous benchmarking, we demonstrate that simple architectural designs such as overlapping patch embedding and convolutional feed-forward networks can promote the robustness of ViTs. Moreover, since the de facto training of ViTs relies heavily on data augmentation, exactly which augmentation strategies make ViTs more robust is worth investigating. We survey the efficacy of previous methods and verify that adversarial noise training is powerful. In addition, we introduce a novel conditional method for generating dynamic augmentation parameters conditioned on input images, which offers state-of-the-art robustness to common corruptions.
引用
收藏
页码:1374 / 1385
页数:12
相关论文
共 50 条
  • [31] Pyramid Adversarial Training Improves ViT Performance
    Herrmann, Charles
    Sargent, Kyle
    Jiang, Lu
    Zabih, Ramin
    Chang, Huiwen
    Liu, Ce
    Krishnan, Dilip
    Sun, Deqing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13409 - 13419
  • [32] Are Adversarial Robustness and Common Perturbation Robustness Independant Attributes ?
    Laugros, Alfred
    Caplier, Alice
    Ospici, Matthieu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1045 - 1054
  • [33] Common Factors' Role in Accredited MFT Training Programs
    D'Aniello, Carissa
    Fife, Stephen T.
    JOURNAL OF MARITAL AND FAMILY THERAPY, 2017, 43 (04) : 591 - 604
  • [34] DESIGN FOR ROBUSTNESS
    Takacs, Peter F.
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-BRIDGE ENGINEERING, 2010, 163 (03) : 156 - 156
  • [35] Robustness by Design
    Braubach, Lars
    Schulz, Theresa
    Jander, Kai
    INTELLIGENT DISTRIBUTED COMPUTING XVI, IDC 2023, 2024, 1138 : 267 - 284
  • [36] Role of Connectors in Corporate Fraud and Corruptions in Era of Circular Economy
    Nagnonhou, Salomon Ricardo Bignon
    Imoniana, Joshua Onome
    Reginato, Luciane
    Silva, Washington Lopes
    SOCIAL SCIENCES-BASEL, 2023, 12 (03):
  • [37] Common Genetic Variant in VIT Is Associated with Human Brain Asymmetry
    Tadayon, Sayed H.
    Vaziri-Pashkam, Maryam
    Kahali, Pegah
    Dezfouli, Mitra Ansari
    Abbassian, Abdolhossein
    FRONTIERS IN HUMAN NEUROSCIENCE, 2016, 10
  • [38] The protective role of Vit C and Vit C-containing foods in head and neck cancer
    Li, Gaofeng
    Liu, Dingsheng
    INTERNATIONAL JOURNAL OF CLINICAL AND EXPERIMENTAL MEDICINE, 2018, 11 (01): : 347 - 353
  • [39] On the Convergence and Robustness of Adversarial Training
    Wang, Yisen
    Ma, Xingjun
    Bailey, James
    Yi, Jinfeng
    Zhou, Bowen
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [40] Classification robustness to common optical aberrations
    Muller, Patrick
    Braun, Alexander
    Keuper, Margret
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3634 - 3645