Semantic combined network for zero-shot scene parsing

被引:2
|
作者
Wang, Yinduo [1 ]
Zhang, Haofeng [1 ]
Wang, Shidong [2 ]
Long, Yang [3 ]
Yang, Longzhi [4 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
[3] Univ Durham, Dept Comp Sci, Durham, England
[4] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
基金
中国国家自然科学基金; 英国医学研究理事会;
关键词
object recognition; unsupervised learning; learning (artificial intelligence); natural language processing; object detection; zero-shot scene parsing; image-based scene parsing; training set; discrete labels; meaningless labels; target domains; semantic combined network; SCN; scene parsing model; semantic embeddings; traditional fully supervised scene parsing methods; generalised ZSSP settings; state-of-the-art scenes; traditional fully supervised setting; original network models;
D O I
10.1049/iet-ipr.2019.0870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, image-based scene parsing has attracted increasing attention due to its wide application. However, conventional models can only be valid on images with the same domain of the training set and are typically trained using discrete and meaningless labels. Inspired by the traditional zero-shot learning methods which employ auxiliary side information to bridge the source and target domains, the authors propose a novel framework called semantic combined network (SCN), which aims at learning a scene parsing model only from the images of the seen classes while targeting on the unseen ones. In addition, with the assistance of semantic embeddings of classes, the proposed SCN can further improve the performances of traditional fully supervised scene parsing methods. Extensive experiments are conducted on the data set Cityscapes, and the results show that the proposed SCN can perform well on both zero-shot scene parsing (ZSSP) and generalised ZSSP settings based on several state-of-the-art scenes parsing architectures. Furthermore, the authors test the proposed model under the traditional fully supervised setting and the results show that the proposed SCN can also significantly improve the performances of the original network models.
引用
收藏
页码:757 / 765
页数:9
相关论文
共 50 条
  • [31] Semantic softmax loss for zero-shot learning
    Ji, Zhong
    Sun, Yuxin
    Yu, Yunlong
    Guo, Jichang
    Pang, Yanwei
    NEUROCOMPUTING, 2018, 316 : 369 - 375
  • [32] Robust deep alignment network with remote sensing knowledge graph for zero-shot and generalized zero-shot remote sensing image scene classification
    Li, Yansheng
    Kong, Deyu
    Zhang, Yongjun
    Tan, Yihua
    Chen, Ling
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 179 : 145 - 158
  • [33] On The Ingredients of an Effective Zero-shot Semantic Parser
    Yin, Pengcheng
    Wieting, John
    Sil, Avirup
    Neubig, Graham
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1455 - 1474
  • [34] Recursive Training for Zero-Shot Semantic Segmentation
    Wang, Ce
    Farazi, Moshiur
    Barnes, Nick
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [35] Semantic Parsing by Large Language Models for Intricate Updating Strategies of Zero-Shot Dialogue State Tracking
    Wu, Yuxiang
    Dong, Guanting
    Xu, Weiran
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11093 - 11099
  • [36] Deep Semantic-Visual Alignment for zero-shot remote sensing image scene classification
    Xu, Wenjia
    Wang, Jiuniu
    Wei, Zhiwei
    Peng, Mugen
    Wu, Yirong
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 198 : 140 - 152
  • [37] Deep semantic-aware network for zero-shot visual urban perception
    Chunyun Zhang
    Tianze Wu
    Yunfeng Zhang
    Baolin Zhao
    Tingwen Wang
    Chaoran Cui
    Yilong Yin
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1197 - 1211
  • [38] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [39] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [40] Zero-Shot Multi-object Scene Completion
    Iwase, Shun
    Liu, Katherine
    Guizilini, Vitor
    Gaidon, Adrien
    Kitani, Kris
    Ambrus, Rares
    Zakharov, Sergey
    COMPUTER VISION - ECCV 2024, PT III, 2025, 15061 : 96 - 113