Visual representations with texts domain generalization for semantic segmentation

被引:0
|
作者
Wanlin Yue
Zhiheng Zhou
Yinglie Cao
Weikang Wu
机构
[1] South China University of Technology,School of Electronics and Information
[2] Guangzhou City University of Technology,School of Electronic and Information Engineering
[3] The 54th Research Institute of China Electronics Technology Group Corporation,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Domain generalization; Semantic segmentation; Cross-modal;
D O I
暂无
中图分类号
学科分类号
摘要
At present, Domain generalization for semantic segmentation relying on deep neural networks has made little progress. Most of the current methods are mainly divided into domain randomization, standardization, and whitening. We propose a novel approach to achieve domain generalization for semantic segmentation: leveraging cross-modal information to supervise the model training and improve the generalization ability of the network. We align visual features with textual features in a subspace and enhance the contrast between categories. Our method enables the network to learn rich semantic knowledge from text features and clearer category boundaries. Our experiments also prove that our method can effectively improve the generalization ability of the network. We are the first to exploit multi-modal information for domain-generalized semantic segmentation.
引用
收藏
页码:30069 / 30079
页数:10
相关论文
共 50 条
  • [1] Visual representations with texts domain generalization for semantic segmentation
    Yue, Wanlin
    Zhou, Zhiheng
    Cao, Yinglie
    Wu, Weikang
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30069 - 30079
  • [2] Grounding Visual Representations with Texts for Domain Generalization
    Min, Seonwoo
    Park, Nokyung
    Kim, Siwon
    Park, Seunghyun
    Kim, Jinkyu
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 37 - 53
  • [3] Learning generalized visual relations for domain generalization semantic segmentation
    Li, Zijun
    Liao, Muxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [4] Domain generalization for semantic segmentation: a survey
    Rafi, Taki Hasan
    Mahjabin, Ratul
    Ghosh, Emon
    Ko, Young-Woong
    Lee, Jeong-Gun
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [5] Single Domain Generalization for LiDAR Semantic Segmentation
    Kim, Hyeonseong
    Kang, Yoonsu
    Oh, Changgyoon
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17587 - 17598
  • [6] Augmentation-based Domain Generalization for Semantic Segmentation
    Schwonberg, Manuel
    El Bouazati, Fadoua
    Schmidt, Nico M.
    Gottschalk, Hanno
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [7] Class-discriminative domain generalization for semantic segmentation
    Liao, Muxin
    Tian, Shishun
    Zhang, Yuhang
    Hua, Guoguang
    You, Rong
    Zou, Wenbin
    Li, Xia
    IMAGE AND VISION COMPUTING, 2025, 154
  • [8] A Study of RobustNet, a Domain Generalization Method for Semantic Segmentation
    Bou, Xavier
    IMAGE PROCESSING ON LINE, 2022, 12 : 469 - 479
  • [9] Concept-guided domain generalization for semantic segmentation
    Liao, Muxin
    Li, Wei
    Yin, Chengle
    Jin, Yuling
    Peng, Yingqiong
    PATTERN RECOGNITION, 2025, 164
  • [10] Domain-invariant information aggregation for domain generalization semantic segmentation
    Liao, Muxin
    Tian, Shishun
    Zhang, Yuhang
    Hua, Guoguang
    Zou, Wenbin
    Li, Xia
    NEUROCOMPUTING, 2023, 546