Mixture-of-Experts Learner for Single Long-Tailed Domain Generalization

被引:6
|
作者
Wang, Mengzhu [1 ]
Yuan, Jianlong [1 ]
Wang, Zhibin [1 ]
机构
[1] Alibaba Grp, Beijing, Peoples R China
关键词
Domain Generalization; Mixture-of-Experts Learner; Saliency Map; Mutual Learning;
D O I
10.1145/3581783.3611871
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain generalization (DG) refers to the task of training a model on multiple source domains and test it on a different target domain with different distribution. In this paper, we address a more challenging and realistic scenario known as Single Long-Tailed Domain Generalization, where only one source domain is available and the minority class in this domain has an abundance of instances in other domains. To tackle this task, we propose a novel approach called Mixture-of-Experts Learner for Single Long-Tailed Domain Generalization (MoEL), which comprises two key strategies. The first strategy is a simple yet effective data augmentation technique that leverages saliency maps to identify important regions on the original images and preserves these regions during augmentation. The second strategy is a new skill-diverse expert learning approach that trains multiple experts from a single long-tailed source domain and leverages mutual learning to aggregate their learned knowledge for the unknown target domain. We evaluate our method on various benchmark datasets, including Digits-DG, CIFAR-10-C, PACS, and DomainNet, and demonstrate its superior performance compared to previous single domain generalization methods. Additionally, the ablation study is also conducted to illustrate the inner workings of our approach.
引用
收藏
页码:290 / 299
页数:10
相关论文
共 43 条
  • [1] MEID: Mixture-of-Experts with Internal Distillation for Long-Tailed Video Recognition
    Li, Xinjie
    Xu, Huijuan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1451 - 1459
  • [2] On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond
    Yang, Yuzhe
    Wang, Hao
    Katabi, Dina
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 57 - 75
  • [3] Sharpness-Aware Model-Agnostic Long-Tailed Domain Generalization
    Su, Houcheng
    Luo, Weihao
    Liu, Daixian
    Wang, Mengzhu
    Tang, Jing
    Chen, Junyang
    Wang, Cong
    Chen, Zhenghan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 15091 - 15099
  • [4] Balanced Product of Calibrated Experts for Long-Tailed Recognition
    Aimar, Emanuel Sanchez
    Jonnarth, Arvi
    Felsberg, Michael
    Kuhlmann, Marco
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19967 - 19977
  • [5] Multiple Contrastive Experts for long-tailed image classification
    Wang, Yandan
    Sun, Kaiyin
    Guo, Chenqi
    Zhong, Shiwei
    Liu, Huili
    Ma, Yinglong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [6] Global Balanced Experts for Federated Long-Tailed Learning
    Zeng, Yaopei
    Liu, Lei
    Liu, Li
    Shen, Li
    Liu, Shaoguo
    Wu, Baoyuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4792 - 4802
  • [7] Collaborative Mixture-of-Experts Model for Multi-Domain Fake News Detection
    Zhao, Jian
    Zhao, Zisong
    Shi, Lijuan
    Kuang, Zhejun
    Liu, Yazhou
    ELECTRONICS, 2023, 12 (16)
  • [8] Domain Balancing: Face Recognition on Long-Tailed Domains
    Cao, Dong
    Zhu, Xiangyu
    Huang, Xingyu
    Guo, Jianzhu
    Lei, Zhen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5670 - 5678
  • [9] Towards Long-Tailed Recognition for Graph Classification via Collaborative Experts
    Yi S.-Y.
    Mao Z.
    Ju W.
    Zhou Y.-D.
    Liu L.
    Luo X.
    Zhang M.
    IEEE Transactions on Big Data, 2023, 9 (06): : 1683 - 1696
  • [10] Learning Knowledge-diverse Experts for Long-tailed Graph Classification
    Mao, Zhengyang
    Ju, Wei
    Yi, Siyu
    Wang, Yifan
    Xiao, Zhiping
    Long, Qingqing
    Yin, Nan
    Liu, Xin wang
    Zhang, Ming
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (02)