Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features

被引:458
|
作者
Ullah, Amin [1 ]
Ahmad, Jamil [1 ]
Muhammad, Khan [1 ]
Sajjad, Muhammad [2 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Intelligent Media Lab, Seoul 143747, South Korea
[2] Islamia Coll Peshawar, Dept Comp Sci, Digital Image Proc Lab, Peshawar 25000, Pakistan
来源
IEEE ACCESS | 2018年 / 6卷
基金
新加坡国家研究基金会;
关键词
Action recognition; deep learning; recurrent neural network; deep bidirectional long short-term memory; convolution neural network;
D O I
10.1109/ACCESS.2017.2778011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recurrent neural network (RNN) and long short-term memory (LSTM) have achieved great success in processing sequential multimedia data and yielded the state-of-the-art results in speech recognition, digital signal processing, video processing, and text data analysis. In this paper, we propose a novel action recognition method by processing the video data using convolutional neural network (CNN) and deep bidirectional LSTM (DB-LSTM) network. First, deep features are extracted from every sixth frame of the videos, which helps reduce the redundancy and complexity. Next, the sequential information among frame features is learnt using DB-LSTM network, where multiple layers are stacked together in both forward pass and backward pass of DB-LSTM to increase its depth. The proposed method is capable of learning long term sequences and can process lengthy videos by analyzing features for a certain time interval. Experimental results show significant improvements in action recognition using the proposed method on three benchmark data sets including UCF-101, YouTube 11 Actions, and HMDB51 compared with the state-of-the-art action recognition methods.
引用
收藏
页码:1155 / 1166
页数:12
相关论文
共 50 条
  • [41] A Bi-Directional LSTM-CNN Model with Attention for Aspect-Level Text Classification
    Zhu, Yonghua
    Gao, Xun
    Zhang, Weilin
    Liu, Shenkai
    Zhang, Yuanyuan
    [J]. FUTURE INTERNET, 2018, 10 (12):
  • [42] Gated Bi-directional CNN for Object Detection
    Zeng, Xingyu
    Ouyang, Wanli
    Yang, Bin
    Yan, Junjie
    Wang, Xiaogang
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 354 - 369
  • [43] Human Action Recognition in Video Sequences Using Deep Belief Networks
    Abdellaoui, Mehrez
    Douik, Ali
    [J]. TRAITEMENT DU SIGNAL, 2020, 37 (01) : 37 - 44
  • [44] Part-Of-Speech Tagger in Malayalam Using Bi-directional LSTM
    Rajan, Rajeev
    Joseph, Anna J.
    Robin, Elizabeth K.
    Nishma, Fathima T. K.
    [J]. PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 22 - 27
  • [45] Classification of pulmonary arterial pressure using photoplethysmography and bi-directional LSTM
    Zhang, Qian
    Ma, Pei
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [46] Air pollutant severity prediction using Bi-directional LSTM Network
    Verma, Ishan
    Ahuja, Rahul
    Meisheri, Hardik
    Dey, Lipika
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 651 - 654
  • [47] Lost circulation monitoring using bi-directional LSTM and data augmentation
    Sun, Weifeng
    Li, Weihua
    Zhang, Dezhi
    Liu, Kai
    Wang, Chen
    Dai, Yongshou
    Huang, Weimin
    [J]. GEOENERGY SCIENCE AND ENGINEERING, 2023, 225
  • [48] Authorship Attribution on Kannada Text using Bi-Directional LSTM Technique
    Chandrika, C. P.
    Kallimani, Jagadish S.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 963 - 971
  • [49] An Optimization-Based Diabetes Prediction Model Using CNN and Bi-Directional LSTM in Real-Time Environment
    Madan, Parul
    Singh, Vijay
    Chaudhari, Vaibhav
    Albagory, Yasser
    Dumka, Ankur
    Singh, Rajesh
    Gehlot, Anita
    Rashid, Mamoon
    Alshamrani, Sultan S.
    AlGhamdi, Ahmed Saeed
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [50] A Robust CSI-Based Passive Perception Method Using CNN and Attention-Based Bi-Directional LSTM
    He, Zhengran
    Xu, Guozhen
    Xu, Siyuan
    Wang, Yu
    Gui, Guan
    Gacanin, Haris
    Adachi, Fumiyuki
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1862 - 1867