Towards Cross-View Consistency in Semantic Segmentation While Varying View Direction

被引:0
|
作者
Tong, Xin [1 ]
Ying, Xianghua [1 ]
Shi, Yongjie [1 ]
Zhao, He [1 ]
Wang, Ruibin [1 ]
机构
[1] Peking Univ, Sch EECS, Key Lab Machine Percept MOE, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several images are taken for the same scene with many view directions. Given a pixel in any one image of them, its correspondences may appear in the other images. However, by using existing semantic segmentation methods, we find that the pixel and its correspondences do not always have the same inferred label as expected. Fortunately, from the knowledge of multiple view geometry, if we keep the position of a camera unchanged, and only vary its orientation, there is a homography transformation to describe the relationship of corresponding pixels in such images. Based on this fact, we propose to generate images which are the same as real images of the scene taken in certain novel view directions for training and evaluation. We also introduce gradient guided deformable convolution to alleviate the inconsistency, by learning dynamic proper receptive field from feature gradients. Furthermore, a novel consistency loss is presented to enforce feature consistency. Compared with previous approaches, the proposed method gets significant improvement in both cross-view consistency and semantic segmentation performance on images with abundant view directions, while keeping comparable or better performance on the existing datasets.
引用
收藏
页码:1054 / 1060
页数:7
相关论文
共 50 条
  • [1] Cross-View Semantic Segmentation for Sensing Surroundings
    Pan, Bowen
    Sun, Jiankai
    Leung, Ho Yin Tiga
    Andonian, Alex
    Zhou, Bolei
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4867 - 4873
  • [2] Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation
    Wang, Zicheng
    Zhao, Zhen
    Xing, Xiaoxia
    Xu, Dong
    Kong, Xiangyu
    Zhou, Luping
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19585 - 19595
  • [3] Semantic Cross-View Matching
    Castaldo, Francesco
    Zamir, Amir
    Angst, Roland
    Palmieri, Francesco
    Savarese, Silvio
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 1044 - 1052
  • [4] Cross-view Transformers for real-time Map-view Semantic Segmentation
    Zhou, Brady
    Krahenbuhl, Philipp
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13750 - 13759
  • [5] Cross-view Semantic Alignment for Livestreaming Product Recognition
    Yang, Wenjie
    Chen, Yiyi
    Li, Yan
    Cheng, Yanhua
    Liu, Xudong
    Chen, Quan
    Li, Han
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13358 - 13367
  • [6] Cross-View Regularization for Domain Adaptive Panoptic Segmentation
    Huang, Jiaxing
    Guan, Dayan
    Xiao, Aoran
    Lu, Shijian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10128 - 10139
  • [7] Blind Image Quality Assessment via Cross-View Consistency
    Zhu, Yucheng
    Li, Yunhao
    Sun, Wei
    Min, Xiongkuo
    Zhai, Guangtao
    Yang, Xiaokang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7607 - 7620
  • [8] CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency
    Zhu, Hanxin
    Chen, Zhibo
    [J]. 2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 960 - 968
  • [9] A Convex Discriminant Semantic Correlation Analysis for Cross-View Recognition
    Tian, Qing
    Ma, Chuang
    Cao, Meng
    Chen, Songcan
    Yin, Hujun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (02) : 849 - 861
  • [10] CVSformer: Cross-View Synthesis Transformer for Semantic Scene Completion
    Dong, Haotian
    Ma, Enhui
    Wang, Lubo
    Wang, Miaohui
    Xie, Wuyuan
    Guo, Qing
    Li, Ping
    Liang, Lingyu
    Yang, Kairui
    Lin, Di
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8840 - 8849