Simultaneous Utilization of Inertial and Video Sensing for Action Detection and Recognition in Continuous Action Streams

被引:19
|
作者
Wei, Haoran [1 ]
Kehtarnavaz, Nasser [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Richardson, TX 75080 USA
关键词
Sports; Acceleration; Cameras; Image segmentation; Streaming media; Action detection and recognition in continuous action streams; simultaneous utilization of video and inertialsensing; deep learning-based continuous action detection and recognition; CLASSIFICATION; DEPTH; FUSION;
D O I
10.1109/JSEN.2020.2973361
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes the simultaneous utilization of inertial and video sensing for the purpose of achieving human action detection and recognition in continuous action streams. Continuous action streams mean that actions of interest are performed randomly among actions of non-interest in a continuous manner. The inertial and video data are captured simultaneously via a wearable inertial sensor and a video camera, which are turned into 2D and 3D images. These images are then fed into a 2D and a 3D convolutional neural network with their decisions fused in order to detect and recognize a specified set of actions of interest from continuous action streams. The developed fusion approach is applied to two sets of actions of interest consisting of smart TV gestures and sports actions. The results obtained indicate the fusion approach is more effective than when each sensing modality is used individually. The average accuracy of the fusion approach is found to be 5.8% above inertial and 14.3% above video for the TV gesture actions of interest, and 23.2% above inertial and 1.9% above video for the sports actions of interest.
引用
收藏
页码:6055 / 6063
页数:9
相关论文
共 50 条
  • [1] Action Detection and Recognition in Continuous Action Streams by Deep Learning-Based Sensing Fusion
    Dawar, Neha
    Kehtarnavaz, Nasser
    IEEE SENSORS JOURNAL, 2018, 18 (23) : 9660 - 9668
  • [2] C-MHAD: Continuous Multimodal Human Action Dataset of Simultaneous Video and Inertial Sensing
    Wei, Haoran
    Chopada, Pranav
    Kehtarnavaz, Nasser
    SENSORS, 2020, 20 (10)
  • [3] Action recognition on continuous video
    Chang, Y. L.
    Chan, C. S.
    Remagnino, P.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1233 - 1243
  • [4] Action recognition on continuous video
    Y. L. Chang
    C. S. Chan
    P. Remagnino
    Neural Computing and Applications, 2021, 33 : 1233 - 1243
  • [5] A Method of Simultaneously Action Recognition and Video Segmentation of Video Streams
    Ji, Liang
    Xiong, Rong
    Wang, Yue
    Yu, Hongsheng
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 1515 - 1520
  • [6] Fusion of Video and Inertial Sensing for Deep Learning-Based Human Action Recognition
    Wei, Haoran
    Jafari, Roozbeh
    Kehtarnavaz, Nasser
    SENSORS, 2019, 19 (17)
  • [7] A Comparison Study on Human Action Recognition from Video Streams
    Lin, S. C. F.
    Wong, C. Y.
    Ren, T. R.
    Kwok, N. M.
    2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1162 - 1166
  • [8] Vision and Inertial Sensing Fusion for Human Action Recognition: A Review
    Majumder, Sharmin
    Kehtarnavaz, Nasser
    IEEE SENSORS JOURNAL, 2021, 21 (03) : 2454 - 2467
  • [9] Real-Time Continuous Action Detection and Recognition Using Depth Images and Inertial Signals
    Dawar, Neha
    Chen, Chen
    Jafari, Roozbeh
    Kehtarnavaz, Nasser
    2017 IEEE 26TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2017, : 1342 - 1347
  • [10] A novel online action detection framework from untrimmed video streams
    Yoon, Da-Hye
    Cho, Nam-Gyu
    Lee, Seong-Whan
    PATTERN RECOGNITION, 2020, 106