Simultaneous Utilization of Inertial and Video Sensing for Action Detection and Recognition in Continuous Action Streams

被引:19
|
作者
Wei, Haoran [1 ]
Kehtarnavaz, Nasser [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Richardson, TX 75080 USA
关键词
Sports; Acceleration; Cameras; Image segmentation; Streaming media; Action detection and recognition in continuous action streams; simultaneous utilization of video and inertialsensing; deep learning-based continuous action detection and recognition; CLASSIFICATION; DEPTH; FUSION;
D O I
10.1109/JSEN.2020.2973361
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes the simultaneous utilization of inertial and video sensing for the purpose of achieving human action detection and recognition in continuous action streams. Continuous action streams mean that actions of interest are performed randomly among actions of non-interest in a continuous manner. The inertial and video data are captured simultaneously via a wearable inertial sensor and a video camera, which are turned into 2D and 3D images. These images are then fed into a 2D and a 3D convolutional neural network with their decisions fused in order to detect and recognize a specified set of actions of interest from continuous action streams. The developed fusion approach is applied to two sets of actions of interest consisting of smart TV gestures and sports actions. The results obtained indicate the fusion approach is more effective than when each sensing modality is used individually. The average accuracy of the fusion approach is found to be 5.8% above inertial and 14.3% above video for the TV gesture actions of interest, and 23.2% above inertial and 1.9% above video for the sports actions of interest.
引用
收藏
页码:6055 / 6063
页数:9
相关论文
共 50 条
  • [21] Short-Term Action Learning for Video Action Recognition
    Ting-Long, Liu
    IEEE ACCESS, 2024, 12 : 30867 - 30875
  • [22] Meta-action descriptor for action recognition in RGBD video
    Huang, Min
    Su, Song-Zhi
    Cai, Guo-Rong
    Zhang, Hong-Bo
    Cao, Donglin
    Li, Shao-Zi
    IET COMPUTER VISION, 2017, 11 (04) : 301 - 308
  • [23] Video-based In-vehicle Action Recognition for Continuous Health Monitoring
    Ramachandran, Ashwin
    Gokhale, Kartike
    Kripps, Maike
    Deserno, Thomas
    MEDICAL IMAGING 2023, 2023, 12469
  • [24] Real-time action detection and temporal segmentation in continuous video
    Liu, Xueping
    Li, Yibo
    Shen, Qing
    IMAGING SCIENCE JOURNAL, 2017, 65 (07): : 418 - 427
  • [25] Coupling Video Segmentation and Action Recognition
    Ghodrati, Amir
    Pedersoli, Marco
    Tuytelaars, Tinne
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 618 - 625
  • [26] Breaking video into pieces for action recognition
    Ying Zheng
    Hongxun Yao
    Xiaoshuai Sun
    Xuesong Jiang
    Fatih Porikli
    Multimedia Tools and Applications, 2017, 76 : 22195 - 22212
  • [27] Action recognition in broadcast tennis video
    Zhu, Guangyu
    Xu, Changsheng
    Huang, Qingming
    Gao, Wen
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 251 - +
  • [28] Modeling Video Evolution For Action Recognition
    Fernando, Basura
    Gavves, Efstratios
    Oramas, Jose M.
    Ghodrati, Amir
    Tuytelaars, Tinne
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5378 - 5387
  • [29] Recurring the Transformer for Video Action Recognition
    Yang, Jiewen
    Dong, Xingbo
    Liu, Liujun
    Zhang, Chao
    Shen, Jiajun
    Yu, Dahai
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14043 - 14053
  • [30] Breaking video into pieces for action recognition
    Zheng, Ying
    Yao, Hongxun
    Sun, Xiaoshuai
    Jiang, Xuesong
    Porikli, Fatih
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) : 22195 - 22212