Semantic combined network for zero-shot scene parsing

被引:2
|
作者
Wang, Yinduo [1 ]
Zhang, Haofeng [1 ]
Wang, Shidong [2 ]
Long, Yang [3 ]
Yang, Longzhi [4 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
[3] Univ Durham, Dept Comp Sci, Durham, England
[4] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
基金
中国国家自然科学基金; 英国医学研究理事会;
关键词
object recognition; unsupervised learning; learning (artificial intelligence); natural language processing; object detection; zero-shot scene parsing; image-based scene parsing; training set; discrete labels; meaningless labels; target domains; semantic combined network; SCN; scene parsing model; semantic embeddings; traditional fully supervised scene parsing methods; generalised ZSSP settings; state-of-the-art scenes; traditional fully supervised setting; original network models;
D O I
10.1049/iet-ipr.2019.0870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, image-based scene parsing has attracted increasing attention due to its wide application. However, conventional models can only be valid on images with the same domain of the training set and are typically trained using discrete and meaningless labels. Inspired by the traditional zero-shot learning methods which employ auxiliary side information to bridge the source and target domains, the authors propose a novel framework called semantic combined network (SCN), which aims at learning a scene parsing model only from the images of the seen classes while targeting on the unseen ones. In addition, with the assistance of semantic embeddings of classes, the proposed SCN can further improve the performances of traditional fully supervised scene parsing methods. Extensive experiments are conducted on the data set Cityscapes, and the results show that the proposed SCN can perform well on both zero-shot scene parsing (ZSSP) and generalised ZSSP settings based on several state-of-the-art scenes parsing architectures. Furthermore, the authors test the proposed model under the traditional fully supervised setting and the results show that the proposed SCN can also significantly improve the performances of the original network models.
引用
收藏
页码:757 / 765
页数:9
相关论文
共 50 条
  • [41] ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING
    Song, Jing
    Peng, Peixi
    Zhai, Yunpeng
    Zhang, Chong
    Tian, Yonghong
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [42] Deep semantic-aware network for zero-shot visual urban perception
    Zhang, Chunyun
    Wu, Tianze
    Zhang, Yunfeng
    Zhao, Baolin
    Wang, Tingwen
    Cui, Chaoran
    Yin, Yilong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (05) : 1197 - 1211
  • [43] Spatiotemporal visual-semantic embedding network for zero-shot action recognition
    An, Rongqiao
    Miao, Zhenjiang
    Li, Qingyu
    Xu, Wanru
    Zhang, Qiang
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [44] Iterative Zero-Shot Localization via Semantic-Assisted Location Network
    Yang, Yukun
    Zhao, Liang
    Liu, Xiangdong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 5974 - 5981
  • [45] Generative Zero-shot Network Quantization
    He, Xiangyu
    Lu, Jiahao
    Xu, Weixiang
    Hu, Qinghao
    Wang, Peisong
    Cheng, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2994 - 3005
  • [46] Intent Focused Semantic Parsing and Zero-Shot Learning for Out-of-Domain Detection in Spoken Language Understanding
    Kumar, Niraj
    Baghel, Bhiman Kumar
    IEEE ACCESS, 2021, 9 (09): : 165786 - 165794
  • [47] Few-Shot and Zero-Shot Semantic Segmentation for Food Images
    Honbu, Yuma
    Yanai, Keiji
    PROCEEDINGS OF THE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA FOR COOKING AND EATING ACTIVITIES (CEA '21), 2021, : 25 - 28
  • [48] Zero-Shot Audio Classification Via Semantic Embeddings
    Xie, Huang
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1233 - 1242
  • [49] A meaningful learning method for zero-shot semantic segmentation
    Liu, Xianglong
    Bai, Shihao
    An, Shan
    Wang, Shuo
    Liu, Wei
    Zhao, Xiaowei
    Ma, Yuqing
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
  • [50] SEMANTIC AUGMENTATION HASHING FOR ZERO-SHOT IMAGE RETRIEVAL
    Zhong, Fangming
    Chen, Zhikui
    Min, Geyong
    Xia, Feng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1943 - 1947