Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features

被引:458
|
作者
Ullah, Amin [1 ]
Ahmad, Jamil [1 ]
Muhammad, Khan [1 ]
Sajjad, Muhammad [2 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Intelligent Media Lab, Seoul 143747, South Korea
[2] Islamia Coll Peshawar, Dept Comp Sci, Digital Image Proc Lab, Peshawar 25000, Pakistan
来源
IEEE ACCESS | 2018年 / 6卷
基金
新加坡国家研究基金会;
关键词
Action recognition; deep learning; recurrent neural network; deep bidirectional long short-term memory; convolution neural network;
D O I
10.1109/ACCESS.2017.2778011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recurrent neural network (RNN) and long short-term memory (LSTM) have achieved great success in processing sequential multimedia data and yielded the state-of-the-art results in speech recognition, digital signal processing, video processing, and text data analysis. In this paper, we propose a novel action recognition method by processing the video data using convolutional neural network (CNN) and deep bidirectional LSTM (DB-LSTM) network. First, deep features are extracted from every sixth frame of the videos, which helps reduce the redundancy and complexity. Next, the sequential information among frame features is learnt using DB-LSTM network, where multiple layers are stacked together in both forward pass and backward pass of DB-LSTM to increase its depth. The proposed method is capable of learning long term sequences and can process lengthy videos by analyzing features for a certain time interval. Experimental results show significant improvements in action recognition using the proposed method on three benchmark data sets including UCF-101, YouTube 11 Actions, and HMDB51 compared with the state-of-the-art action recognition methods.
引用
收藏
页码:1155 / 1166
页数:12
相关论文
共 50 条
  • [1] Video Saliency Detection Using Bi-directional LSTM
    Chi, Yang
    Li, Jinjiang
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (06): : 2444 - 2463
  • [2] Yawn Detection for Driver's Drowsiness Prediction Using Bi-Directional LSTM with CNN Features
    Saurav, Sumeet
    Mathur, Shubhad
    Sang, Ishan
    Prasad, Shyam Sunder
    Singh, Sanjay
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 189 - 200
  • [3] Incremental Parsing with Minimal Features Using Bi-Directional LSTM
    Cross, James
    Huang, Liang
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 32 - 37
  • [4] Prediction of rebound in shotcrete using deep bi-directional LSTM
    Suzen, Ahmet A.
    Cakiroglu, Melda A.
    [J]. COMPUTERS AND CONCRETE, 2019, 24 (06): : 555 - 560
  • [5] Improving Phonetic Recognition with Sequence-length Standardized MFCC Features and Deep Bi-Directional LSTM
    Toan Pham Van
    Hau Nguyen Thanh
    Ta Minh Thanh
    [J]. PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 322 - 325
  • [6] DB-LSTM: Densely-connected Bi-directional LSTM for human action recognition
    He, Jun-Yan
    Wu, Xiao
    Cheng, Zhi-Qi
    Yuan, Zhaoquan
    Jiang, Yu-Gang
    [J]. NEUROCOMPUTING, 2021, 444 : 319 - 331
  • [7] Bi-directional lstm network speech-to-gesture generation using bi-directional lstm network
    Kaneko, Naoshi
    Takeuchi, Kenta
    Hasegawa, Dai
    Shirakawa, Shinichi
    Sakuta, Hiroshi
    Sumi, Kazuhiko
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2019, 34 (06):
  • [8] CNN features with bi-directional LSTM for real-time anomaly detection in surveillance networks
    Waseem Ullah
    Amin Ullah
    Ijaz Ul Haq
    Khan Muhammad
    Muhammad Sajjad
    Sung Wook Baik
    [J]. Multimedia Tools and Applications, 2021, 80 : 16979 - 16995
  • [9] CNN features with bi-directional LSTM for real-time anomaly detection in surveillance networks
    Ullah, Waseem
    Ullah, Amin
    Ul Haq, Ijaz
    Muhammad, Khan
    Sajjad, Muhammad
    Baik, Sung Wook
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 16979 - 16995
  • [10] A hybrid deep learning framework with CNN and Bi-directional LSTM for store item demand forecasting
    Joseph, Reuben Varghese
    Mohanty, Anshuman
    Tyagi, Soumyae
    Mishra, Shruti
    Satapathy, Sandeep Kumar
    Mohanty, Sachi Nandan
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103