A sequence models-based real-time multi-person action recognition method with monocular vision

被引:2
|
作者
Yang, Aolei [1 ]
Lu, Wei [1 ]
Naeem, Wasif [2 ]
Chen, Ling [3 ]
Fei, Minrui [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[2] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
[3] Hunan Normal Univ, Sch Engn & Design, Changsha 410081, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Action recognition; Human body skeleton; Feature construction; Sequence models; Computer vision;
D O I
10.1007/s12652-021-03399-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In intelligent video surveillance under complex scenes, it is vital to identify the current actions of multi-target human bodies accurately and in real time. In this paper, a real-time multi-person action recognition method with monocular vision is proposed based on sequence models. Firstly, the key points of multi-target human body skeleton in the video are extracted by using the OpenPose algorithm. Then, the human action features are constructed, including limb direction vector and the skeleton height-width ratio. The multi-target human bodies tracking is then achieved by using the tracking algorithm. Next, the tracking results are matched with the action features, and the action recognition model is constructed, which includes the spatial branch based on Deep neural networks and the temporal branch based on Bi-directional RNN and Bi-directional long short-term memory networks. After pre-training, the model can be used to recognize the human body action from action features, and a recognition stabilizer is designed to minimize false alarms. Finally, extensive evaluations on the JHMDB dataset validate the effectiveness and the superiority of the proposed approach.
引用
收藏
页码:1877 / 1887
页数:11
相关论文
共 50 条
  • [31] Multi-task neural network with physical constraint for real-time multi-person 3D pose estimation from monocular camera
    Luo, Dingli
    Du, Songlin
    Ikenaga, Takeshi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27223 - 27244
  • [32] MDPose: Real-Time Multi-Person Pose Estimation via Mixture Density Model
    Seo, Seunghyeon
    Yoo, Jaeyoung
    Hwang, Jihye
    Kwak, Nojun
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1868 - 1878
  • [33] Real-Time Multi-Person Video Synthesis with Controllable Prior-Guided Matting
    Chen, Aoran
    Huang, Hai
    Zhu, Yueyan
    Xue, Junsheng
    [J]. SENSORS, 2024, 24 (09)
  • [34] Real-time postural training effects on single and multi-person ergonomic risk scores
    Berti, Nicola
    Finco, Serena
    Guidolin, Mattia
    Reggiani, Monica
    Battini, Daria
    [J]. IFAC PAPERSONLINE, 2022, 55 (10): : 163 - 168
  • [35] Multi-Person Action Recognition Based on Millimeter-Wave Radar Point Cloud
    Dang, Xiaochao
    Fan, Kai
    Li, Fenfang
    Tang, Yangyang
    Gao, Yifei
    Wang, Yue
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [36] Real-Time Lane Detection for Intelligent Vehicles Based on Monocular Vision
    Xu Fangfang
    Wang Bo
    Zhou Zhiqiang
    Zheng Zhihui
    [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 7332 - 7337
  • [37] Real-Time Estimation of Drivable Image Area based on Monocular Vision
    Miranda Neto, A.
    Victorino, A. Correa
    Fantoni, I.
    Ferreira, J. V.
    [J]. 2013 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2013, : 63 - 68
  • [38] Real-time region-based obstacle detection with monocular vision
    Wang, Hui
    Yuan, Kui
    Zou, Wei
    Peng, Yizhun
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, 2006, : 615 - +
  • [39] A REAL-TIME POLICE DOG ACTION RECOGNITION SYSTEM BASED ON VISION AND IMU SENSORS
    Zhan, Xiaoman
    Huang, Qian
    Zhu, Chaozheng
    Li, Xing
    Liu, Guangyun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [40] The research on binocular vision based real-time object indication recognition method
    Li, Chuncan
    Zhang, Zhijiang
    Dong, Zhihua
    [J]. 27TH INTERNATIONAL CONGRESS ON HIGH SPEED PHOTOGRAPHY AND PHOTONICS, PRTS 1-3, 2007, 6279