Semantic combined network for zero-shot scene parsing

被引:2
|
作者
Wang, Yinduo [1 ]
Zhang, Haofeng [1 ]
Wang, Shidong [2 ]
Long, Yang [3 ]
Yang, Longzhi [4 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
[3] Univ Durham, Dept Comp Sci, Durham, England
[4] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
基金
中国国家自然科学基金; 英国医学研究理事会;
关键词
object recognition; unsupervised learning; learning (artificial intelligence); natural language processing; object detection; zero-shot scene parsing; image-based scene parsing; training set; discrete labels; meaningless labels; target domains; semantic combined network; SCN; scene parsing model; semantic embeddings; traditional fully supervised scene parsing methods; generalised ZSSP settings; state-of-the-art scenes; traditional fully supervised setting; original network models;
D O I
10.1049/iet-ipr.2019.0870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, image-based scene parsing has attracted increasing attention due to its wide application. However, conventional models can only be valid on images with the same domain of the training set and are typically trained using discrete and meaningless labels. Inspired by the traditional zero-shot learning methods which employ auxiliary side information to bridge the source and target domains, the authors propose a novel framework called semantic combined network (SCN), which aims at learning a scene parsing model only from the images of the seen classes while targeting on the unseen ones. In addition, with the assistance of semantic embeddings of classes, the proposed SCN can further improve the performances of traditional fully supervised scene parsing methods. Extensive experiments are conducted on the data set Cityscapes, and the results show that the proposed SCN can perform well on both zero-shot scene parsing (ZSSP) and generalised ZSSP settings based on several state-of-the-art scenes parsing architectures. Furthermore, the authors test the proposed model under the traditional fully supervised setting and the results show that the proposed SCN can also significantly improve the performances of the original network models.
引用
收藏
页码:757 / 765
页数:9
相关论文
共 50 条
  • [1] Zero-Shot Semantic Parsing for Instructions
    Givoli, Ofer
    Reichart, Roi
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4454 - 4464
  • [2] Zero-Shot Predicate Prediction for Scene Graph Parsing
    Li, Yiming
    Yang, Xiaoshan
    Huang, Xuhui
    Ma, Zhe
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3140 - 3153
  • [3] Grounded Adaptation for Zero-shot Executable Semantic Parsing
    Zhong, Victor
    Lewis, Mike
    Wang, Sida I.
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6869 - 6882
  • [4] Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing
    Herzig, Jonathan
    Berant, Jonathan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1619 - 1629
  • [5] Zero-Shot Cross-lingual Semantic Parsing
    Sherborne, Tom
    Lapata, Mirella
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4134 - 4153
  • [6] Towards Zero-Shot Frame Semantic Parsing for Domain Scaling
    Bapna, Ankur
    Tur, Gokhan
    Hakkani-Tur, Dilek
    Heck, Larry
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2476 - 2480
  • [7] Zero-shot Object Prediction using Semantic Scene Knowledge
    Grzeszick, Rene
    Fink, Gernot A.
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 120 - 129
  • [8] Ambiguous Learning from Retrieval: Towards Zero-shot Semantic Parsing
    Wu, Shan
    Xin, Chunlei
    Lin, Hongyu
    Han, Xianpei
    Liu, Cao
    Chen, Jiansong
    Yang, Fan
    Wan, Guanglu
    Sun, Le
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14081 - 14094
  • [9] Attentive Semantic Preservation Network for Zero-Shot Learning
    Lu, Ziqian
    Yu, Yunlong
    Lu, Zhe-Ming
    Shen, Feng-Li
    Zhang, Zhongfei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2919 - 2925
  • [10] Zero-shot Semantic Segmentation Using Relation Network
    Zhang, Yindong
    Khriyenko, Oleksiy
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 516 - 527