Visual representations with texts domain generalization for semantic segmentation

被引:1
|
作者
Yue, Wanlin [1 ]
Zhou, Zhiheng [1 ]
Cao, Yinglie [2 ]
Wu, Weikang [3 ]
机构
[1] South China Univ Technol, Sch Elect & Informat, Guangzhou 510640, Peoples R China
[2] Guangzhou City Univ Technol, Sch Elect & Informat Engn, Guangzhou 510850, Peoples R China
[3] 54th Res Inst China Elect Technol Grp Corp, Shijiazhuang 050050, Peoples R China
关键词
Domain generalization; Semantic segmentation; Cross-modal;
D O I
10.1007/s10489-023-05125-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, Domain generalization for semantic segmentation relying on deep neural networks has made little progress. Most of the current methods are mainly divided into domain randomization, standardization, and whitening. We propose a novel approach to achieve domain generalization for semantic segmentation: leveraging cross-modal information to supervise the model training and improve the generalization ability of the network. We align visual features with textual features in a subspace and enhance the contrast between categories. Our method enables the network to learn rich semantic knowledge from text features and clearer category boundaries. Our experiments also prove that our method can effectively improve the generalization ability of the network. We are the first to exploit multi-modal information for domain-generalized semantic segmentation.
引用
收藏
页码:30069 / 30079
页数:11
相关论文
共 50 条
  • [1] Visual representations with texts domain generalization for semantic segmentation
    Wanlin Yue
    Zhiheng Zhou
    Yinglie Cao
    Weikang Wu
    [J]. Applied Intelligence, 2023, 53 : 30069 - 30079
  • [2] Grounding Visual Representations with Texts for Domain Generalization
    Min, Seonwoo
    Park, Nokyung
    Kim, Siwon
    Park, Seunghyun
    Kim, Jinkyu
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 37 - 53
  • [4] Domain generalization for semantic segmentation: a survey
    Rafi, Taki Hasan
    Mahjabin, Ratul
    Ghosh, Emon
    Ko, Young-Woong
    Lee, Jeong-Gun
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [5] Single Domain Generalization for LiDAR Semantic Segmentation
    Kim, Hyeonseong
    Kang, Yoonsu
    Oh, Changgyoon
    Yoon, Kuk-Jin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17587 - 17598
  • [6] Augmentation-based Domain Generalization for Semantic Segmentation
    Schwonberg, Manuel
    El Bouazati, Fadoua
    Schmidt, Nico M.
    Gottschalk, Hanno
    [J]. 2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [7] A Study of RobustNet, a Domain Generalization Method for Semantic Segmentation
    Bou, Xavier
    [J]. IMAGE PROCESSING ON LINE, 2022, 12 : 469 - 479
  • [8] Domain-invariant information aggregation for domain generalization semantic segmentation
    Liao, Muxin
    Tian, Shishun
    Zhang, Yuhang
    Hua, Guoguang
    Zou, Wenbin
    Li, Xia
    [J]. NEUROCOMPUTING, 2023, 546
  • [9] March on Data Imperfections: Domain Division and Domain Generalization for Semantic Segmentation
    Xu, Hai
    Xie, Hongtao
    Zha, Zheng-Jun
    Liu, Sun-ao
    Zhang, Yongdong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3044 - 3053
  • [10] Unseen Domain Generalization for Prostate MRI Segmentation via Disentangled Representations
    Lu, Ye
    Xing, Xiaohan
    Meng, Max Q. -H.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1986 - 1991