Model-based approach to spatial-temporal sampling of video clips for video object detection by classification

被引:10
|
作者
Chuang, Chi-Han [1 ]
Cheng, Shyi-Chyi [1 ]
Chang, Chin-Chun [1 ]
Chen, Yi-Ping Phoebe [2 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung 202, Taiwan
[2] La Trobe Univ, Dept Comp Sci & Comp Engn, Bundoora, Vic 3086, Australia
关键词
Semantic video objects; Spatial-temporal sampling; Human action detection; Video object model; Dynamic programming; Multiple alignment; Model-based tracking; Video object detetcion; RECOGNITION; FRAMEWORK;
D O I
10.1016/j.jvcir.2014.02.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For a variety of applications such as video surveillance and event annotation, the spatial-temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial-temporal sampling as a unified process of extracting video objects and computing their spatial-temporal boundaries using a learnt video object model. We first :provide a computational approach for learning an optimal key-object codebook sequence from a set of training video clips to characterize the semantics of the detected video objects. Then, dynamic programming with the learnt codebook sequence is used to locate the video objects with spatial-temporal boundaries in a test video clip. To verify the performance of the proposed method, a human action detection and recognition system is constructed. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate. (c) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:1018 / 1030
页数:13
相关论文
共 50 条
  • [31] Optimum Video Subset and Spatial-Temporal Video Retrieval
    Wang, Mei-Zhen
    Liu, Xue-Jun
    Sun, Kai-Xin
    Wang, Zi-Ran
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (09): : 2004 - 2023
  • [32] Exploring Spatial-Temporal Instance Relationships in an Intermediate Domain for Image-to-Video Object Detection
    Wen, Zihan
    Chen, Jin
    Wu, Xinxiao
    [J]. COMPUTER VISION - ACCV 2022 WORKSHOPS, 2023, 13848 : 360 - 375
  • [33] ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation
    Xu, Muzhou
    Zhong, Shan
    Liu, Chunping
    Gong, Shengrong
    Wang, Zhaohui
    Xia, Yu
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2089 - 2096
  • [34] Comparative histogram: A spatial-temporal segmentation algorithm for video object segmentation
    Su, DW
    Zhou, LL
    Wang, JF
    [J]. Soft Computing as Transdisciplinary Science and Technology, 2005, : 142 - 152
  • [35] Contrast Based Hierarchical Spatial-Temporal Saliency for Video
    Le, Trung-Nghia
    Sugimoto, Akihiro
    [J]. IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 734 - 748
  • [36] A video segmentation algorithm based on spatial-temporal information
    Zhu, H
    Li, ZM
    [J]. 2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 566 - 569
  • [37] Video Captioning Based on the Spatial-Temporal Saliency Tracing
    Zhou, Yuanen
    Hu, Zhenzhen
    Liu, Xueliang
    Wang, Meng
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 59 - 70
  • [38] Video Quality Assessment Based on Spatial-temporal Distortion
    Yang, Chunting
    Liu, Yang
    Yu, Jing
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 818 - +
  • [39] A 3D wavelet based spatial-temporal approach for video watermarking
    Li, Y
    Gao, XB
    Ji, HB
    [J]. ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2003, : 260 - 265
  • [40] Temporal Pattern Recognition in Video Clips Detection
    Zheng, Aihua
    Ma, Jixin
    Luo, Bin
    Petridis, Miltos
    Zhai, Sulan
    Tang, Jin
    [J]. PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 416 - +