MixStyle Neural Networks for Domain Generalization and Adaptation

被引:7
|
作者
Zhou, Kaiyang [1 ]
Yang, Yongxin [2 ]
Qiao, Yu [3 ,4 ]
Xiang, Tao [5 ]
机构
[1] Hong Kong Baptist Univ, Hong Kong, Peoples R China
[2] Queen Mary Univ London, London, England
[3] Chinese Acad Sci, Shanghai AI Lab, Shenzhen, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[5] Univ Surrey, Guildford, England
关键词
D O I
10.1007/s11263-023-01913-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks do not generalize well to unseen data with domain shifts-a longstanding problem in machine learning and AI. To overcome the problem, we propose MixStyle, a simple plug-and-play, parameter-free module that can improve domain generalization performance without the need to collect more data or increase model capacity. The design of MixStyle is simple: it mixes the feature statistics of two random instances in a single forward pass during training. The idea is grounded by the finding from recent style transfer research that feature statistics capture image style information, which essentially defines visual domains. Therefore, mixing feature statistics can be seen as an efficient way to synthesize new domains in the feature space, thus achieving data augmentation. MixStyle is easy to implement with a few lines of code, does not require modification to training objectives, and can fit a variety of learning paradigms including supervised domain generalization, semi-supervised domain generalization, and unsupervised domain adaptation. Our experiments show that MixStyle can significantly boost out-of-distribution generalization performance across a wide range of tasks including image recognition, instance retrieval and reinforcement learning. The source code is released at https://github.com/KaiyangZhou/mixstyle-release.
引用
收藏
页码:822 / 836
页数:15
相关论文
共 50 条
  • [31] On generalization in moment-based domain adaptation
    Werner Zellinger
    Bernhard A. Moser
    Susanne Saminger-Platz
    Annals of Mathematics and Artificial Intelligence, 2021, 89 : 333 - 369
  • [32] Robust unsupervised domain adaptation for neural networks via moment alignment
    Zellinger, Werner
    Moser, Bernhard A.
    Grubinger, Thomas
    Lughofer, Edwin
    Natschlaeger, Thomas
    Saminger-Platz, Susanne
    INFORMATION SCIENCES, 2019, 483 : 174 - 191
  • [33] Multisource Domain Adaptation for Remote Sensing Using Deep Neural Networks
    Elshamli, Ahmed
    Taylor, Graham W.
    Areibi, Shawki
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05): : 3328 - 3340
  • [34] Domain adaptation for ear recognition using deep convolutional neural networks
    Eyiokur, Fevziye Irem
    Yaman, Dogucan
    Ekenel, Hazim Kemal
    IET BIOMETRICS, 2018, 7 (03) : 199 - 206
  • [35] Small Is Beautiful: Compressing Deep Neural Networks for Partial Domain Adaptation
    Ma, Yuzhe
    Yao, Xufeng
    Chen, Ran
    Li, Ruiyu
    Shen, Xiaoyong
    Yu, Bei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3575 - 3585
  • [36] Convolutional Neural Networks for Road Detection: An Unsupervised Domain Adaptation Approach
    Collegio, Gustavo Rota
    Dal Poz, Aluir Porfirio
    Guimaraes Filho, Antonio Gaudencio
    Habib, Ayman
    MID-TERM SYMPOSIUM THE ROLE OF PHOTOGRAMMETRY FOR A SUSTAINABLE WORLD, VOL. 48-2, 2024, : 65 - 71
  • [37] Deep Feature Alignment Neural Networks for Domain Adaptation of Hyperspectral Data
    Zhou, Xiong
    Prasad, Saurabh
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (10): : 5863 - 5872
  • [38] Improving the Generalization Ability of Deep Neural Networks for Cross-Domain Visual Recognition
    Zheng, Jianwei
    Lu, Chao
    Hao, Cong
    Chen, Deming
    Guo, Donghui
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 607 - 620
  • [39] Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization
    Ghifary, Muhammad
    Balduzzi, David
    Kleijn, W. Bastiaan
    Zhang, Mengjie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) : 1414 - 1430
  • [40] Complementary Domain Adaptation and Generalization for Unsupervised Continual Domain Shift Learning
    Cho, Wonguk
    Park, Jinha
    Kim, Taesup
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11408 - 11418