SPATIO-TEMPORAL MID-LEVEL FEATURE BANK FOR ACTION RECOGNITION IN LOW QUALITY VIDEO

被引:0
|
作者
Rahman, Saimunur [1 ]
See, John [1 ]
机构
[1] Multimedia Univ, Fac Comp & Informat, Ctr Visual Comp, Cyberjaya 63100, Malaysia
关键词
Action recognition; Low quality video; Mid-level representation; Texture features; BSIF;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is a great challenge to perform high level recognition tasks on videos that are poor in quality. In this paper, we propose a new spatio-temporal mid-level (STEM) feature bank for recognizing human actions in low quality videos. The feature bank comprises of a trio of local spatio-temporal features, i.e. shape, motion and textures, which respectively encode structural, dynamic and statistical information in video. These features are encoded into mid-level representations and aggregated to construct STEM. Based on the recent binarized statistical image feature (BSIF), we also design a new spatio-temporal textural feature that extracts discriminately from 3D salient patches. Extensive experiments on the poor quality versions/subsets of the KTH and HMDB51 datasets demon-strate the effectiveness of the proposed approach.
引用
收藏
页码:1846 / 1850
页数:5
相关论文
共 50 条
  • [21] Histogram of Fuzzy Local Spatio-Temporal Descriptors for Video Action Recognition
    Zuo, Zheming
    Yang, Longzhi
    Liu, Yonghuai
    Chao, Fei
    Song, Ran
    Qu, Yanpeng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (06) : 4059 - 4067
  • [22] Action Recognition with Discriminative Mid-Level Features
    Liu, Cuiwei
    Kong, Yu
    Wu, Xinxiao
    Jia, Yunde
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3366 - 3369
  • [23] Learning spatio-temporal features for action recognition from the side of the video
    Pei, Lishen
    Ye, Mao
    Zhao, Xuezhuan
    Xiang, Tao
    Li, Tao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (01) : 199 - 206
  • [24] VIDEO ACTION RECOGNITION WITH SPATIO-TEMPORAL GRAPH EMBEDDING AND SPLINE MODELING
    Yuan, Yin
    Zheng, Haomian
    Li, Zhu
    Zhang, David
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2422 - 2425
  • [25] Learning spatio-temporal features for action recognition from the side of the video
    Lishen Pei
    Mao Ye
    Xuezhuan Zhao
    Tao Xiang
    Tao Li
    Signal, Image and Video Processing, 2016, 10 : 199 - 206
  • [26] Human Action Recognition in Video by Fusion of Structural and Spatio-temporal Features
    Borzeshi, Ehsan Zare
    Concha, Oscar Perez
    Piccardi, Massimo
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 474 - 482
  • [27] Blind video quality assessment based on Spatio-Temporal Feature Resolver
    Bi, Xiaodong
    He, Xiaohai
    Xiong, Shuhua
    Zhao, Zeming
    Chen, Honggang
    Sheriff, Raymond Edward
    NEUROCOMPUTING, 2024, 574
  • [28] A novel mid-level distinctive feature learning for action recognition via diffusion map
    Xu, Wanru
    Miao, Zhenjiang
    Tian, Yi
    NEUROCOMPUTING, 2016, 218 : 185 - 196
  • [29] Spatio-temporal Multi-level Fusion for Human Action Recognition
    Manh-Hung Lu
    Thi-Oanh Nguyen
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 298 - 305
  • [30] IMAGE QUALITY ASSESSMENT FOR FREE VIEWPOINT VIDEO BASED ON MID-LEVEL CONTOURS FEATURE
    Ling, Suiyi
    Le Callet, Patrick
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 79 - 84