Spatial as Deep: Spatial CNN for Traffic Scene Understanding

被引:0
|
作者
Pan, Xingang [1 ]
Shi, Jianping [2 ]
Luo, Ping [1 ]
Wang, Xiaogang [1 ]
Tang, Xiaoou [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] SenseTime Grp Ltd, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) are usually built by stacking convolutional operations layer-by-layer. Although CNN has shown strong capability to extract semantics from raw pixels, its capacity to capture spatial relationships of pixels across rows and columns of an image is not fully explored. These relationships are important to learn semantic objects with strong shape priors but weak appearance coherences, such as traffic lanes, which are often occluded or not even painted on the road surface as shown in Fig. 1 (a). In this paper, we propose Spatial CNN (SCNN), which generalizes traditional deep layer-by-layer convolutions to slice-by-slice convolutions within feature maps, thus enabling message passings between pixels across rows and columns in a layer. Such SCNN is particular suitable for long continuous shape structure or large objects, with strong spatial relationship but less appearance clues, such as traffic lanes, poles, and wall. We apply SCNN on a newly released very challenging traffic lane detection dataset and Cityscapse dataset(1). The results show that SCNN could learn the spatial relationship for structure output and significantly improves the performance. We show that SCNN outperforms the recurrent neural network (RNN) based ReNet and MRF+CNN (MRFNet) in the lane detection dataset by 8.7% and 4.6% respectively. Moreover, our SCNN won the 1st place on the TuSimple Benchmark Lane Detection Challenge, with an accuracy of 96.53%.
引用
收藏
页码:7276 / 7283
页数:8
相关论文
共 50 条
  • [1] TRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS
    Wu, Fang-Yu
    Yan, Shi-Yang
    Smith, Jeremy S.
    Zhang, Bai-Ling
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 156 - 161
  • [2] A Fast Multi-Task CNN for Spatial Understanding of Traffic Scenes
    Oeljeklaus, Malte
    Hoffmann, Frank
    Bertram, Torsten
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2825 - 2830
  • [3] ABSSNet: Attention-Based Spatial Segmentation Network for Traffic Scene Understanding
    Li, Xuelong
    Zhao, Zhiyuan
    Wang, Qi
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9352 - 9362
  • [4] Robotic Understanding of Scene Contents and Spatial Constraints
    Wilson, Dustin
    Yan, Fujian
    Sinha, Kaushik
    He, Hongsheng
    [J]. SOCIAL ROBOTICS, ICSR 2018, 2018, 11357 : 93 - 102
  • [5] Spatial Sampling Network for Fast Scene Understanding
    Mazzini, Davide
    Schettini, Raimondo
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1286 - 1296
  • [6] Egocentric Scene Understanding via Multimodal Spatial Rectifier
    Do, Tien
    Vuong, Khiem
    Park, Hyun Soo
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2822 - 2831
  • [7] Probabilistic spatial context models for scene content understanding
    Singhal, A
    Luo, RB
    Zhu, WY
    [J]. 2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 235 - 241
  • [8] Deep Spatial Pyramid Match Kernel for Scene Classification
    Gupta, Shikha
    Pradhan, Deepak Kumar
    Dinesh, Dileep Aroor
    Thenkanidiyoor, Veena
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 141 - 148
  • [9] A Comparison of Qualitative and Metric Spatial Relation Models for Scene Understanding
    Thippur, Akshaya
    Burbridge, Chris
    Kunze, Lars
    Alberti, Marina
    Folkesson, John
    Jensfelt, Patric
    Hawes, Nick
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 1632 - 1640
  • [10] Spatial Positioning Method of Vehicle in Cross-Camera Traffic Scene
    Wang, Wei
    Tang, Xinyao
    Zhang, Chaoyang
    Song, Huansheng
    Cui, Hua
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (06): : 873 - 882