Visual representations with texts domain generalization for semantic segmentation

被引：0

作者：

Wanlin Yue

Zhiheng Zhou

Yinglie Cao

Weikang Wu

机构：

[1] South China University of Technology,School of Electronics and Information

[2] Guangzhou City University of Technology,School of Electronic and Information Engineering

[3] The 54th Research Institute of China Electronics Technology Group Corporation,undefined

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Domain generalization; Semantic segmentation; Cross-modal;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

At present, Domain generalization for semantic segmentation relying on deep neural networks has made little progress. Most of the current methods are mainly divided into domain randomization, standardization, and whitening. We propose a novel approach to achieve domain generalization for semantic segmentation: leveraging cross-modal information to supervise the model training and improve the generalization ability of the network. We align visual features with textual features in a subspace and enhance the contrast between categories. Our method enables the network to learn rich semantic knowledge from text features and clearer category boundaries. Our experiments also prove that our method can effectively improve the generalization ability of the network. We are the first to exploit multi-modal information for domain-generalized semantic segmentation.

引用

页码：30069 / 30079

页数：10

共 50 条

[21] Transportable Representations for Domain Generalization
Jalaldoust, Kasra
Bareinboim, Elias
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12790 - 12800
[22] Identity representations in visual texts
Hayik, Rawia
INTERNATIONAL JOURNAL OF RESEARCH & METHOD IN EDUCATION, 2012, 35 (03) : 293 - 309
[23] Robust internal representations for domain generalization
Rostami, Mohammad
AI MAGAZINE, 2023, 44 (04) : 467 - 481
[24] ClusterFit: Improving Generalization of Visual Representations
Yan, Xueting
Misra, Ishan
Gupta, Abhinav
Ghadiyaram, Deepti
Mahajan, Dhruv
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6508 - 6517
[25] Source-free domain adaptation for semantic image segmentation using internal representations
Stan, Serban
Rostami, Mohammad
FRONTIERS IN BIG DATA, 2024, 7
[26] Segmentation of Argumentative Texts with Contextualised Word Representations
Petasis, Georgios
6TH WORKSHOP ON ARGUMENT MINING (ARGMINING 2019), 2019, : 1 - 10
[27] Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic Segmentation
Liao, Muxin
Tian, Shishun
Zhang, Yuhang
Hua, Guoguang
Zou, Wenbin
Li, Xia
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2199 - 2210
[28] Kill Two Birds with One Stone: Domain Generalization for Semantic Segmentation via Network Pruning
Luo, Yawei
Liu, Ping
Yang, Yi
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 335 - 352
[29] ImDeeplabV3plus with instance selective whitening loss in domain generalization semantic segmentation
Zhang, You
Chen, Houjin
Li, Yanfeng
Zhou, Junqi
IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (01) : 180 - 192
[30] Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving
Schwonberg, Manuel
Niemeijer, Joshua
Termohlen, Jan-Aike
Schafer, Jorg P.
Schmidt, Nico M.
Gottschalk, Hanno
Fingscheidt, Tim
IEEE ACCESS, 2023, 11 : 54296 - 54336

← 1 2 3 4 5 →