Model-based approach to spatial-temporal sampling of video clips for video object detection by classification

被引:10
|
作者
Chuang, Chi-Han [1 ]
Cheng, Shyi-Chyi [1 ]
Chang, Chin-Chun [1 ]
Chen, Yi-Ping Phoebe [2 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung 202, Taiwan
[2] La Trobe Univ, Dept Comp Sci & Comp Engn, Bundoora, Vic 3086, Australia
关键词
Semantic video objects; Spatial-temporal sampling; Human action detection; Video object model; Dynamic programming; Multiple alignment; Model-based tracking; Video object detetcion; RECOGNITION; FRAMEWORK;
D O I
10.1016/j.jvcir.2014.02.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For a variety of applications such as video surveillance and event annotation, the spatial-temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial-temporal sampling as a unified process of extracting video objects and computing their spatial-temporal boundaries using a learnt video object model. We first :provide a computational approach for learning an optimal key-object codebook sequence from a set of training video clips to characterize the semantics of the detected video objects. Then, dynamic programming with the learnt codebook sequence is used to locate the video objects with spatial-temporal boundaries in a test video clip. To verify the performance of the proposed method, a human action detection and recognition system is constructed. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate. (c) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:1018 / 1030
页数:13
相关论文
共 50 条
  • [1] Video Object Detection with an Aligned Spatial-Temporal Memory
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 494 - 510
  • [2] Object Detection-Based Video Retargeting With Spatial-Temporal Consistency
    Lee, Seung Joon
    Lee, Siyeong
    Cho, Sung In
    Kang, Suk-Ju
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4434 - 4439
  • [3] SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION
    Chen, Zhu
    Li, Weihai
    Fei, Chi
    Liu, Bin
    Yu, Nenghai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1858 - 1862
  • [4] Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
    Xu, Chao
    Zhang, Jiangning
    Wang, Mengmeng
    Tian, Guanzhong
    Liu, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7809 - 7820
  • [5] A spatial-temporal approach for video caption detection and recognition
    Tang, X
    Gao, XB
    Liu, JZ
    Zhang, HJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 961 - 971
  • [6] RANDOM-SAMPLING-BASED SPATIAL-TEMPORAL FEATURE FOR CONSUMER VIDEO CONCEPT CLASSIFICATION
    Wei, Anjun
    Pei, Yuru
    Zha, Hongbin
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1861 - 1864
  • [7] Deep Spatial-Temporal Joint Feature Representation for Video Object Detection
    Zhao, Baojun
    Zhao, Boya
    Tang, Linbo
    Han, Yuqi
    Wang, Wenzheng
    [J]. SENSORS, 2018, 18 (03)
  • [8] Learning Complementary Spatial-Temporal Transformer for Video Salient Object Detection
    Liu, Nian
    Nan, Kepan
    Zhao, Wangbo
    Yao, Xiwen
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10663 - 10673
  • [9] End-to-End Video Object Detection with Spatial-Temporal Transformers
    He, Lu
    Zhou, Qianyu
    Li, Xiangtai
    Niu, Li
    Cheng, Guangliang
    Li, Xiao
    Liu, Wenxuan
    Tong, Yunhai
    Ma, Lizhuang
    Zhang, Liqing
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1507 - 1516
  • [10] Lightweight unmanned aerial vehicle video object detection based on spatial-temporal correlation
    Zhou, Pei
    Liu, GuanJun
    Wang, Jiacun
    Weng, QianLi
    Zhang, KaiWen
    Zhou, ZiYuan
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (17)