Simultaneous Utilization of Inertial and Video Sensing for Action Detection and Recognition in Continuous Action Streams

被引:19
|
作者
Wei, Haoran [1 ]
Kehtarnavaz, Nasser [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Richardson, TX 75080 USA
关键词
Sports; Acceleration; Cameras; Image segmentation; Streaming media; Action detection and recognition in continuous action streams; simultaneous utilization of video and inertialsensing; deep learning-based continuous action detection and recognition; CLASSIFICATION; DEPTH; FUSION;
D O I
10.1109/JSEN.2020.2973361
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes the simultaneous utilization of inertial and video sensing for the purpose of achieving human action detection and recognition in continuous action streams. Continuous action streams mean that actions of interest are performed randomly among actions of non-interest in a continuous manner. The inertial and video data are captured simultaneously via a wearable inertial sensor and a video camera, which are turned into 2D and 3D images. These images are then fed into a 2D and a 3D convolutional neural network with their decisions fused in order to detect and recognize a specified set of actions of interest from continuous action streams. The developed fusion approach is applied to two sets of actions of interest consisting of smart TV gestures and sports actions. The results obtained indicate the fusion approach is more effective than when each sensing modality is used individually. The average accuracy of the fusion approach is found to be 5.8% above inertial and 14.3% above video for the TV gesture actions of interest, and 23.2% above inertial and 1.9% above video for the sports actions of interest.
引用
收藏
页码:6055 / 6063
页数:9
相关论文
共 50 条
  • [41] SAND in Action: Subsequence Anomaly Detection for Streams
    Boniol, Paul
    Paparrizos, John
    Palpanas, Themis
    Franklin, Michael J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (12): : 2867 - 2870
  • [42] Explainable Action-Recognition Based Approach for Unsupervised Video Anomaly Detection
    Yang, Hui
    Khouadjia, Mostepha
    Seghouani, Nacera
    Ma, Yue
    Delmas, Serge
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 182 - 195
  • [43] Simultaneous Human Detection and Action Recognition Employing 2DPCA-HOG
    Naiel, Mohamed A.
    Abdelwahab, Moataz M.
    Elsaban, M.
    Mikhael, Wasfy B.
    2011 IEEE 54TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2011,
  • [44] Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network
    Dey, Arnab
    Biswas, Samit
    Le, Dac-Nhuong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 3067 - 3087
  • [45] Learning hierarchical video representation for action recognition
    Li Q.
    Qiu Z.
    Yao T.
    Mei T.
    Rui Y.
    Luo J.
    International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 85 - 98
  • [46] Leveraging Temporal Contextualization for Video Action Recognition
    Kim, Minji
    Han, Dongyoon
    Kim, Taekyung
    Han, Bohyung
    COMPUTER VISION - ECCV 2024, PT XXI, 2025, 15079 : 74 - 91
  • [47] Averaging Video Sequences to Improve Action Recognition
    Gao, Zhen
    Lu, Guoliang
    Yan, Peng
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 89 - 93
  • [48] Spatiotemporal Fusion Networks for Video Action Recognition
    Liu, Zheng
    Hu, Haifeng
    Zhang, Junxuan
    NEURAL PROCESSING LETTERS, 2019, 50 (02) : 1877 - 1890
  • [49] A Robust and Efficient Video Representation for Action Recognition
    Wang, Heng
    Oneata, Dan
    Verbeek, Jakob
    Schmid, Cordelia
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (03) : 219 - 238
  • [50] Deep Local Video Feature for Action Recognition
    Lan, Zhenzhong
    Zhu, Yi
    Hauptmann, Alexander G.
    Newsam, Shawn
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1219 - 1225