Category-Stitch Learning for Union Domain Generalization

被引:4
|
作者
Liu, Yajing [1 ]
Xiong, Zhiwei [1 ]
Li, Ya [2 ]
Lu, Yuning [1 ]
Tian, Xinmei [1 ]
Zha, Zheng-Jun [1 ]
机构
[1] Univ Sci & Technol China, 443 Huangshan Rd, Hefei 230027, Peoples R China
[2] iFLYTEK Res, 666 Wangjiang West Rd, Hefei 230088, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Domain generalization; variational autoencoder; generator;
D O I
10.1145/3524136
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain generalization aims at generalizing the network trained on multiple domains to unknown but related domains. Under the assumption that different domains share the same classes, previous works can build relationships across domains. However, in realistic scenarios, the change of domains is always followed by the change of categories, which raises a difficulty for collecting sufficient aligned categories across domains. Bearing this in mind, this article introduces union domain generalization (UDG) as a new domain generalization scenario, in which the label space varies across domains, and the categories in unknown domains belong to the union of all given domain categories. The absence of categories in given domains is the main obstacle to aligning different domain distributions and obtaining domain-invariant information. To address this problem, we propose category-stitch learning (CSL), which aims at jointly learning the domain-invariant information and completing missing categories in all domains through an improved variational autoencoder and generators. The domain-invariant information extraction and sample generation cross-promote each other to better generalizability. Additionally, we decouple category and domain information and propose explicitly regularizing the semantic information by the classification loss with transferred samples. Thus our method can breakthrough the category limit and generate samples of missing categories in each domain. Extensive experiments and visualizations are conducted on MNIST, VLCS, PACS, Office-Home, and DomainNet datasets to demonstrate the effectiveness of our proposed method.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Generalization in category learning: One and two category problems
    Wills, AJ
    McLaren, IPL
    PROCEEDINGS OF THE NINETEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1997, : 1088 - 1088
  • [2] Curriculum learning-based domain generalization for cross-domain fault diagnosis with category shift
    Wang, Yu
    Gao, Jie
    Wang, Wei
    Yang, Xu
    Du, Jinsong
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 212
  • [3] EFFECTS OF CATEGORY SIZE ON CATEGORIZATION LEARNING AND GENERALIZATION
    BHATT, RS
    WASSERMAN, EA
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 508 - 508
  • [4] Effects of category diversity on learning, memory, and generalization
    Ulrike Hahn
    Todd M. Bailey
    Lucy B. C. Elvin
    Memory & Cognition, 2005, 33 : 289 - 302
  • [5] Effects of category diversity on learning, memory, and generalization
    Hahn, U
    Bailey, TM
    Elvin, LBC
    MEMORY & COGNITION, 2005, 33 (02) : 289 - 302
  • [6] Age effects on category learning, categorical perception, and generalization
    Bowman, Caitlin R.
    Ashby, Stefania R.
    Zeithamova, Dagmar
    MEMORY, 2022, 30 (03) : 230 - 247
  • [7] The distributional properties of exemplars affect category learning and generalization
    Paulo F. Carvalho
    Chi-hsin Chen
    Chen Yu
    Scientific Reports, 11
  • [8] Generalization in Category Learning: The Roles of Representational and Decisional Uncertainty
    Seger, Carol A.
    Braunlich, Kurt
    Wehe, Hillary S.
    Liu, Zhiya
    JOURNAL OF NEUROSCIENCE, 2015, 35 (23): : 8802 - 8812
  • [9] Amnesic Patients Show Superior Generalization in Category Learning
    O'Connell, Garret
    Myers, Catherine E.
    Hopkins, Ramona O.
    McLaren, R. P.
    Gluck, Mark A.
    Wills, Andy J.
    NEUROPSYCHOLOGY, 2016, 30 (08) : 915 - 919
  • [10] Generalization and exclusive allocation of credit in unsupervised category learning
    Marshall, JA
    Gupta, VS
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1998, 9 (02) : 279 - 302