Improving Adversarial Robustness via Unlabeled Out-of-Domain Data

被引:0
|
作者
Deng, Zhun [1 ]
Zhang, Linjun [2 ]
Ghorbani, Amirata [3 ]
Zou, James [3 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] Rutgers State Univ, New Brunswick, NJ USA
[3] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation by incorporating cheap unlabeled data from multiple domains is a powerful way to improve prediction especially when there is limited labeled data. In this work, we investigate how adversarial robustness can be enhanced by leveraging out-of-domain unlabeled data. We demonstrate that for broad classes of distributions and classifiers, there exists a sample complexity gap between standard and robust classification. We quantify the extent to which this gap can be bridged by leveraging unlabeled samples from a shifted domain by providing both upper and lower bounds. Moreover, we show settings where we achieve better adversarial robustness when the unlabeled data come from a shifted domain rather than the same domain as the labeled data. We also investigate how to leverage out-of-domain data when some structural information, such as sparsity, is shared between labeled and unlabeled domains. Experimentally, we augment object recognition datasets (CIFAR10, CINIC-10, and SVHN) with easy-to-obtain and unlabeled out-of-domain data and demonstrate substantial improvement in the model's robustness against `1 adversarial attacks on the original domain.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] In and Out-of-Domain Text Adversarial Robustness via Label Smoothing
    Yang, Yahan
    Dan, Soham
    Roth, Dan
    Lee, Insup
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 657 - 669
  • [2] Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
    Gokhale, Tejas
    Mishra, Swaroop
    Luo, Man
    Sachdeva, Bhavdeep Singh
    Baral, Chitta
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2705 - 2718
  • [3] OodGAN: Generative Adversarial Network for Out-of-Domain Data Generation
    Marek, Petr
    Naik, Vishal Ishwar
    Auvray, Vincent
    Goyal, Anuj
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 238 - 245
  • [4] SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness
    Ng, Nathan
    Cho, Kyunghyun
    Ghassemi, Marzyeh
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1268 - 1283
  • [5] Unlabeled Data Improves Adversarial Robustness
    Carmon, Yair
    Raghunathan, Aditi
    Schmidt, Ludwig
    Liang, Percy
    Duchi, John C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Practical and Efficient Out-of-Domain Detection with Adversarial Learning
    Wang, Bo
    Mine, Tsunenori
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 853 - 862
  • [7] Out-of-domain Detection based on Generative Adversarial Network
    Ryu, Seonghan
    Koo, Sangjun
    Yu, Hwanjo
    Lee, Gary Geunbae
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 714 - 718
  • [8] Pretraining boosts out-of-domain robustness for pose estimation
    Mathis, Alexander
    Biasi, Thomas
    Schneider, Steffen
    Yuksekgonul, Mert
    Rogers, Byron
    Bethge, Matthias
    Mathis, Mackenzie W.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1858 - 1867
  • [9] Cross-domain Paraphrasing For Improving Language Modelling Using Out-of-domain Data
    Liu, X.
    Gales, M. J. F.
    Woodland, P. C.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3391 - 3395
  • [10] Adversarial Self-Supervised Learning for Out-of-Domain Detection
    Zeng, Zhiyuan
    He, Keqing
    Yan, Yuanmeng
    Xu, Hong
    Xu, Weiran
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5631 - 5639