Omnisupervised Omnidirectional Semantic Segmentation

被引:25
|
作者
Yang, Kailun [1 ]
Hu, Xinxin [2 ]
Fang, Yicheng [2 ]
Wang, Kaiwei [2 ]
Stiefelhagen, Rainer [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Anthropomat & Robot, D-76131 Karlsruhe, Germany
[2] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
关键词
Semantics; Image segmentation; Training; Data models; Sensors; Task analysis; Cameras; Intelligent vehicles; scene understanding; semantic segmentation; scene parsing; omnisupervised learning; omnidirectional images; VIDEO;
D O I
10.1109/TITS.2020.3023331
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Modern efficient Convolutional Neural Networks (CNNs) are able to perform semantic segmentation both swiftly and accurately, which covers typically separate detection tasks desired by Intelligent Vehicles (IV) in a unified way. Most of the current semantic perception frameworks are designed to work with pinhole cameras and benchmarked against public datasets with narrow Field-of-View (FoV) images. However, there is a large accuracy downgrade when a pinhole-yielded CNN is taken to omnidirectional imagery, causing it unreliable for surrounding perception. In this paper, we propose an omnisupervised learning framework for efficient CNNs, which bridges multiple heterogeneous data sources that are already available in the community, bypassing the labor-intensive process to have manually annotated panoramas, while improving their reliability in unseen omnidirectional domains. Being omnisupervised, the efficient CNN exploits both labeled pinhole images and unlabeled panoramas. The framework is based on our specialized ensemble method that considers the wide-angle and wrap-around features of omnidirectional images, to automatically generate panoramic labels for data distillation. A comprehensive variety of experiments demonstrates that the proposed solution helps to attain significant generalizability gains in panoramic imagery domains. Our approach outperforms state-of-the-art efficient segmenters on highly unconstrained IDD20K and PASS datasets.
引用
收藏
页码:1184 / 1199
页数:16
相关论文
共 50 条
  • [1] Simultaneously Learning Semantic Segmentation and Depth Estimation from Omnidirectional Image
    Yokota, Atsushi
    Li, Shigang
    Kamio, Takeshi
    Kosaku, Toshiharu
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2024, 144 (06) : 560 - 567
  • [2] A comparative study of semantic segmentation of omnidirectional images from a motorcycle perspective
    Sekkat, Ahmed Rida
    Dupuis, Yohan
    Honeine, Paul
    Vasseur, Pascal
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] A comparative study of semantic segmentation of omnidirectional images from a motorcycle perspective
    Ahmed Rida Sekkat
    Yohan Dupuis
    Paul Honeine
    Pascal Vasseur
    [J]. Scientific Reports, 12
  • [4] Semantic Mapping with Omnidirectional Vision
    Posada, Luis Felipe
    Velasquez-Lopez, Alejandro
    Hoffmann, Frank
    Bertram, Torsten
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1901 - 1907
  • [5] Omnidirectional semantic segmentation fusion network with cross-stage and cross-dimensional remodeling
    Zhang, Miaohui
    Li, Shilong
    Wang, Dakai
    Cui, Zhisheng
    Xin, Ming
    [J]. Computers and Electrical Engineering, 2025, 122
  • [6] 1D Self-Attention Network for Point Cloud Semantic Segmentation Using Omnidirectional LiDAR
    Suzuki, Takahiro
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    [J]. PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 257 - 270
  • [7] Semantic Classification of Scenes and Places with Omnidirectional Vision
    Posada, Luis Felipe
    Narayanan, Krishna Kumar
    Hoffmann, Frank
    Bertram, Torsten
    [J]. 2013 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2013), 2013, : 113 - 118
  • [8] NOVEL TILE SEGMENTATION SCHEME FOR OMNIDIRECTIONAL VIDEO
    Li, Jisheng
    Wen, Ziyu
    Li, Sihan
    Zhao, Yikai
    Guo, Bichuan
    Wen, Jiangtao
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 370 - 374
  • [9] Semantic Amodal Segmentation
    Zhu, Yan
    Tian, Yuandong
    Metaxas, Dimitris
    Dollar, Piotr
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3001 - 3009
  • [10] Semantic Soft Segmentation
    Aksoy, Yagiz
    Oh, Tae-Hyun
    Paris, Sylvain
    Pollefeys, Marc
    Matusik, Wojciech
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):