Exploring the Robustness of Human Parsers Toward Common Corruptions

被引:0
|
作者
Zhang, Sanyi [1 ,2 ]
Cao, Xiaochun [3 ]
Wang, Rui [1 ,2 ]
Qi, Guo-Jun [4 ,5 ]
Zhou, Jie [6 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[4] Westlake Univ, Sch Engn, Hangzhou 310030, Peoples R China
[5] OPPO US Res Ctr, Bellevue, WA 98004 USA
[6] Tsinghua Univ, Beijing Res Ctr Informat Sci & Technol BNRist, Dept Automat, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Robustness; Data models; Task analysis; Computational modeling; Benchmark testing; Semantics; Data augmentation; Human parsing; model robustness; heterogeneous augmentation; common corruptions;
D O I
10.1109/TIP.2023.3313493
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human parsing aims to segment each pixel of the human image with fine-grained semantic categories. However, current human parsers trained with clean data are easily confused by numerous image corruptions such as blur and noise. To improve the robustness of human parsers, in this paper, we construct three corruption robustness benchmarks, termed LIP-C, ATR-C, and Pascal-Person-Part-C, to assist us in evaluating the risk tolerance of human parsing models. Inspired by the data augmentation strategy, we propose a novel heterogeneous augmentation-enhanced mechanism to bolster robustness under commonly corrupted conditions. Specifically, two types of data augmentations from different views, i.e., image-aware augmentation and model-aware image-to-image transformation, are integrated in a sequential manner for adapting to unforeseen image corruptions. The image-aware augmentation can enrich the high diversity of training images with the help of common image operations. The model-aware augmentation strategy that improves the diversity of input data by considering the model's randomness. The proposed method is model-agnostic, and it can plug and play into arbitrary state-of-the-art human parsing frameworks. The experimental results show that the proposed method demonstrates good universality which can improve the robustness of the human parsing models and even the semantic segmentation models when facing various image common corruptions. Meanwhile, it can still obtain approximate performance on clean data.
引用
收藏
页码:5394 / 5407
页数:14
相关论文
共 50 条
  • [21] An analysis of ConformalLayers' robustness to corruptions in natural images
    Sousa, Eduardo Vera
    Vasconcelos, Cristina Nader
    Fernandes, Leandro A. F.
    PATTERN RECOGNITION LETTERS, 2023, 166 : 190 - 197
  • [22] On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness
    Mintun, Eric
    Kirillov, Alexander
    Xie, Saining
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Robustness of ConvNet to High-Frequency Image Corruptions
    Banerjee, Arnab
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 1 - 12
  • [24] Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions
    Guo, Yong
    Stutz, David
    Schiele, Bernt
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4108 - 4118
  • [25] On the Effectiveness of Adversarial Training Against Common Corruptions
    Kireev, Klim
    Andriushchenko, Maksym
    Flammarion, Nicolas
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1012 - 1021
  • [26] Benchmarking Object Detection Robustness against Real-World Corruptions
    Liu, Jiawei
    Wang, Zhijie
    Ma, Lei
    Fang, Chunrong
    Bai, Tongtong
    Zhang, Xufan
    Liu, Jia
    Chen, Zhenyu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4398 - 4416
  • [27] 3D Common Corruptions and Data Augmentation
    Kar, Oguzhan Fatih
    Yeo, Teresa
    Atanov, Andrei
    Zamir, Amir
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18941 - 18952
  • [28] Comparing Italian parsers on a common treebank: the Evalita experience
    Bosco, C.
    Mazzei, A.
    Lombardo, V.
    Attardi, G.
    Corazza, A.
    Lavelli, A.
    Lesmo, L.
    Satta, G.
    Simi, M.
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2066 - 2073
  • [29] Exploring CNN features in the context of adversarial robustness and human perception
    Casamassima, Enzo
    Herbert, Andrew
    Merkel, Cory
    APPLICATIONS OF MACHINE LEARNING 2021, 2021, 11843
  • [30] A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial Examples
    Wang, Yuxuan
    Che, Wanxiang
    Titov, Ivan
    Cohen, Shay B.
    Lei, Zhilin
    Liu, Ting
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2344 - 2354