MixStyle Neural Networks for Domain Generalization and Adaptation

被引:7
|
作者
Zhou, Kaiyang [1 ]
Yang, Yongxin [2 ]
Qiao, Yu [3 ,4 ]
Xiang, Tao [5 ]
机构
[1] Hong Kong Baptist Univ, Hong Kong, Peoples R China
[2] Queen Mary Univ London, London, England
[3] Chinese Acad Sci, Shanghai AI Lab, Shenzhen, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[5] Univ Surrey, Guildford, England
关键词
D O I
10.1007/s11263-023-01913-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks do not generalize well to unseen data with domain shifts-a longstanding problem in machine learning and AI. To overcome the problem, we propose MixStyle, a simple plug-and-play, parameter-free module that can improve domain generalization performance without the need to collect more data or increase model capacity. The design of MixStyle is simple: it mixes the feature statistics of two random instances in a single forward pass during training. The idea is grounded by the finding from recent style transfer research that feature statistics capture image style information, which essentially defines visual domains. Therefore, mixing feature statistics can be seen as an efficient way to synthesize new domains in the feature space, thus achieving data augmentation. MixStyle is easy to implement with a few lines of code, does not require modification to training objectives, and can fit a variety of learning paradigms including supervised domain generalization, semi-supervised domain generalization, and unsupervised domain adaptation. Our experiments show that MixStyle can significantly boost out-of-distribution generalization performance across a wide range of tasks including image recognition, instance retrieval and reinforcement learning. The source code is released at https://github.com/KaiyangZhou/mixstyle-release.
引用
收藏
页码:822 / 836
页数:15
相关论文
共 50 条
  • [1] MixStyle Neural Networks for Domain Generalization and Adaptation
    Kaiyang Zhou
    Yongxin Yang
    Yu Qiao
    Tao Xiang
    International Journal of Computer Vision, 2024, 132 : 822 - 836
  • [2] Alleviating the generalization issue in adversarial domain adaptation networks
    Zhe, Xiao
    Du, Zhekai
    Lou, Chunwei
    Li, Jingjing
    IMAGE AND VISION COMPUTING, 2023, 135
  • [3] DOMAIN GENERALIZATION METHOD FOR PERSON RE-ID USING METABIN AND MIXSTYLE
    Park, Sungyeon
    Shin, Hyunhak
    Yun, Sangbin
    Yang, Seongyeop
    Lim, Jeongeun
    Noh, Seungin
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 860 - 864
  • [4] Binary Domain Generalization for Sparsifying Binary Neural Networks
    Schiavone, Riccardo
    Galati, Francesco
    Zuluaga, Maria A.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 123 - 140
  • [5] Meta Convolutional Neural Networks for Single Domain Generalization
    Wan, Chaoqun
    Shen, Xu
    Zhang, Yonggang
    Yin, Zhiheng
    Tian, Xinmei
    Gao, Feng
    Huang, Jianqiang
    Hua, Xian-Sheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4672 - 4681
  • [6] Gated Convolutional Neural Networks for Domain Adaptation
    Madasu, Avinash
    Rao, Vijjini Anvesh
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 118 - 130
  • [7] Generalization, Adaptation and Low-Rank Representation in Neural Networks
    Oymak, Samet
    Fabian, Zalan
    Li, Mingchen
    Soltanolkotabi, Mandi
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 581 - 585
  • [8] Domain adversarial neural networks for domain generalization: when it works and how to improve
    Anthony Sicilia
    Xingchen Zhao
    Seong Jae Hwang
    Machine Learning, 2023, 112 : 2685 - 2721
  • [9] Domain adversarial neural networks for domain generalization: when it works and how to improve
    Sicilia, Anthony
    Zhao, Xingchen
    Hwang, Seong Jae
    MACHINE LEARNING, 2023, 112 (07) : 2685 - 2721
  • [10] Event Detection and Domain Adaptation with Convolutional Neural Networks
    Thien Huu Nguyen
    Grishman, Ralph
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 365 - 371