Visual representations with texts domain generalization for semantic segmentation

被引：0

作者：

Wanlin Yue

Zhiheng Zhou

Yinglie Cao

Weikang Wu

机构：

[1] South China University of Technology,School of Electronics and Information

[2] Guangzhou City University of Technology,School of Electronic and Information Engineering

[3] The 54th Research Institute of China Electronics Technology Group Corporation,undefined

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Domain generalization; Semantic segmentation; Cross-modal;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

At present, Domain generalization for semantic segmentation relying on deep neural networks has made little progress. Most of the current methods are mainly divided into domain randomization, standardization, and whitening. We propose a novel approach to achieve domain generalization for semantic segmentation: leveraging cross-modal information to supervise the model training and improve the generalization ability of the network. We align visual features with textual features in a subspace and enhance the contrast between categories. Our method enables the network to learn rich semantic knowledge from text features and clearer category boundaries. Our experiments also prove that our method can effectively improve the generalization ability of the network. We are the first to exploit multi-modal information for domain-generalized semantic segmentation.

引用

页码：30069 / 30079

页数：10

共 50 条

[1] Visual representations with texts domain generalization for semantic segmentation
Yue, Wanlin
Zhou, Zhiheng
Cao, Yinglie
Wu, Weikang
APPLIED INTELLIGENCE, 2023, 53 (24) : 30069 - 30079
[2] Grounding Visual Representations with Texts for Domain Generalization
Min, Seonwoo
Park, Nokyung
Kim, Siwon
Park, Seunghyun
Kim, Jinkyu
COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 37 - 53
[3] Learning generalized visual relations for domain generalization semantic segmentation
Li, Zijun
Liao, Muxin
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
[4] Domain generalization for semantic segmentation: a survey
Rafi, Taki Hasan
Mahjabin, Ratul
Ghosh, Emon
Ko, Young-Woong
Lee, Jeong-Gun
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
[5] Single Domain Generalization for LiDAR Semantic Segmentation
Kim, Hyeonseong
Kang, Yoonsu
Oh, Changgyoon
Yoon, Kuk-Jin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17587 - 17598
[6] Augmentation-based Domain Generalization for Semantic Segmentation
Schwonberg, Manuel
El Bouazati, Fadoua
Schmidt, Nico M.
Gottschalk, Hanno
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[7] Class-discriminative domain generalization for semantic segmentation
Liao, Muxin
Tian, Shishun
Zhang, Yuhang
Hua, Guoguang
You, Rong
Zou, Wenbin
Li, Xia
IMAGE AND VISION COMPUTING, 2025, 154
[8] A Study of RobustNet, a Domain Generalization Method for Semantic Segmentation
Bou, Xavier
IMAGE PROCESSING ON LINE, 2022, 12 : 469 - 479
[9] Concept-guided domain generalization for semantic segmentation
Liao, Muxin
Li, Wei
Yin, Chengle
Jin, Yuling
Peng, Yingqiong
PATTERN RECOGNITION, 2025, 164
[10] Domain-invariant information aggregation for domain generalization semantic segmentation
Liao, Muxin
Tian, Shishun
Zhang, Yuhang
Hua, Guoguang
Zou, Wenbin
Li, Xia
NEUROCOMPUTING, 2023, 546

← 1 2 3 4 5 →