Capturing the spatio-temporal continuity for video semantic segmentation

被引:3
|
作者
Chen, Xin [1 ]
Wu, Aming [1 ]
Han, Yahong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Yaguan Rd, Tianjin, Peoples R China
关键词
feature extraction; image segmentation; image representation; video signal processing; neural nets; probability; video semantic segmentation; image semantic segmentation; convolutional neural network; image segmentation algorithms; video frame; temporal region continuity inherent; videos; deep neural network architecture; newly devised spatio-temporal continuity; encoding network; STC module; decoding network; high-level feature map; STC feature map; current feature representation; consecutive video frames; segmentation result;
D O I
10.1049/iet-ipr.2018.6479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, image semantic segmentation based on a convolutional neural network has achieved many advances. However, the development of video semantic segmentation is relatively slow. Directly applying the image segmentation algorithms to each video frame separately may ignore the temporal region continuity inherent in videos. In this study, the authors propose a novel deep neural network architecture with a newly devised spatio-temporal continuity (STC) module for video semantic segmentation. Particularly, the architecture includes an encoding network, an STC module, and a decoding network. The encoding network is used to extract a high-level feature map. The STC module then uses the high-level feature map as input to extract the STC feature map. For decoding, they use four dilated convolutional layers to obtain more abstract representation and a deconvolutional layer to increase the size of the representation. Finally, they fuse the current feature representation and the previous feature representation and get the class probabilities. Thus, this architecture receives a sequence of consecutive video frames and outputs the segmentation result of the current frame. They extensively evaluate the proposed approach on the CamVid and KITTI datasets. Compared with other methods, the authors' approach not only achieves competitive performance but also has lower complexity.
引用
收藏
页码:2813 / 2820
页数:8
相关论文
共 50 条
  • [41] A New Spatio-Temporal Saliency-Based Video Object Segmentation
    Zhengzheng Tu
    Andrew Abel
    Lei Zhang
    Bin Luo
    Amir Hussain
    [J]. Cognitive Computation, 2016, 8 : 629 - 647
  • [42] STREAMING SPATIO-TEMPORAL VIDEO SEGMENTATION USING GAUSSIAN MIXTURE MODEL
    Mukherjee, Dibyendu
    Wu, Q. M. Jonathan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4388 - 4392
  • [43] Unsupervised spatio-temporal segmentation for extracting moving objects in video sequences
    Li R.-J.
    Yu S.-Y.
    Wang X.-W.
    [J]. Journal of Shanghai Jiaotong University (Science), 2009, 14 (2) : 154 - 161
  • [44] Unsupervised Spatio-Temporal Segmentation for Extracting Moving Objects in Video Sequences
    李仁杰
    余松煜
    王向文
    [J]. Journal of Shanghai Jiaotong University(Science), 2009, 14 (02) : 154 - 161
  • [45] Spatio-temporal compression for semi-supervised video object segmentation
    Chuanjun Ji
    Yadang Chen
    Zhi-Xin Yang
    Enhua Wu
    [J]. The Visual Computer, 2023, 39 : 4929 - 4942
  • [46] Automatic video object segmentation algorithm based on spatio-temporal information
    Zhang, Xiao-Bo
    Liu, Wen-Yao
    Lu, Da-Wei
    [J]. Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2008, 19 (03): : 384 - 387
  • [47] Spatio-temporal Quasi-Flat Zones for Morphological Video Segmentation
    Weber, Jonathan
    Lefevre, Sebastien
    Gancarski, Pierre
    [J]. MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, (ISMM 2011), 2011, 6671 : 178 - 189
  • [48] Object-based video segmentation using spatio-temporal energy
    Bao, HQ
    Zhang, ZY
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1260 - 1263
  • [49] A robust region merging technique for video sequences spatio-temporal segmentation
    Leonardi, R
    Migliorati, P
    Tofanicchio, G
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1258 - 1268
  • [50] Active video flow control technique based on spatio-temporal segmentation of video sequences
    Urdiales, C
    Bandera, A
    Rodríguez, JA
    Sandoval, F
    [J]. INTELLIGENT ROBOTS AND COMPUTER VISION XX: ALGORITHMS, TECHNIQUES, AND ACTIVE VISION, 2001, 4572 : 379 - 390