Visual representations with texts domain generalization for semantic segmentation

被引:0
|
作者
Wanlin Yue
Zhiheng Zhou
Yinglie Cao
Weikang Wu
机构
[1] South China University of Technology,School of Electronics and Information
[2] Guangzhou City University of Technology,School of Electronic and Information Engineering
[3] The 54th Research Institute of China Electronics Technology Group Corporation,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Domain generalization; Semantic segmentation; Cross-modal;
D O I
暂无
中图分类号
学科分类号
摘要
At present, Domain generalization for semantic segmentation relying on deep neural networks has made little progress. Most of the current methods are mainly divided into domain randomization, standardization, and whitening. We propose a novel approach to achieve domain generalization for semantic segmentation: leveraging cross-modal information to supervise the model training and improve the generalization ability of the network. We align visual features with textual features in a subspace and enhance the contrast between categories. Our method enables the network to learn rich semantic knowledge from text features and clearer category boundaries. Our experiments also prove that our method can effectively improve the generalization ability of the network. We are the first to exploit multi-modal information for domain-generalized semantic segmentation.
引用
收藏
页码:30069 / 30079
页数:10
相关论文
共 50 条
  • [21] Transportable Representations for Domain Generalization
    Jalaldoust, Kasra
    Bareinboim, Elias
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12790 - 12800
  • [22] Identity representations in visual texts
    Hayik, Rawia
    INTERNATIONAL JOURNAL OF RESEARCH & METHOD IN EDUCATION, 2012, 35 (03) : 293 - 309
  • [23] Robust internal representations for domain generalization
    Rostami, Mohammad
    AI MAGAZINE, 2023, 44 (04) : 467 - 481
  • [24] ClusterFit: Improving Generalization of Visual Representations
    Yan, Xueting
    Misra, Ishan
    Gupta, Abhinav
    Ghadiyaram, Deepti
    Mahajan, Dhruv
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6508 - 6517
  • [25] Source-free domain adaptation for semantic image segmentation using internal representations
    Stan, Serban
    Rostami, Mohammad
    FRONTIERS IN BIG DATA, 2024, 7
  • [26] Segmentation of Argumentative Texts with Contextualised Word Representations
    Petasis, Georgios
    6TH WORKSHOP ON ARGUMENT MINING (ARGMINING 2019), 2019, : 1 - 10
  • [27] Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic Segmentation
    Liao, Muxin
    Tian, Shishun
    Zhang, Yuhang
    Hua, Guoguang
    Zou, Wenbin
    Li, Xia
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2199 - 2210
  • [28] Kill Two Birds with One Stone: Domain Generalization for Semantic Segmentation via Network Pruning
    Luo, Yawei
    Liu, Ping
    Yang, Yi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 335 - 352
  • [29] ImDeeplabV3plus with instance selective whitening loss in domain generalization semantic segmentation
    Zhang, You
    Chen, Houjin
    Li, Yanfeng
    Zhou, Junqi
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (01) : 180 - 192
  • [30] Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving
    Schwonberg, Manuel
    Niemeijer, Joshua
    Termohlen, Jan-Aike
    Schafer, Jorg P.
    Schmidt, Nico M.
    Gottschalk, Hanno
    Fingscheidt, Tim
    IEEE ACCESS, 2023, 11 : 54296 - 54336