A sequence models-based real-time multi-person action recognition method with monocular vision

被引:2
|
作者
Yang, Aolei [1 ]
Lu, Wei [1 ]
Naeem, Wasif [2 ]
Chen, Ling [3 ]
Fei, Minrui [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[2] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
[3] Hunan Normal Univ, Sch Engn & Design, Changsha 410081, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Action recognition; Human body skeleton; Feature construction; Sequence models; Computer vision;
D O I
10.1007/s12652-021-03399-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In intelligent video surveillance under complex scenes, it is vital to identify the current actions of multi-target human bodies accurately and in real time. In this paper, a real-time multi-person action recognition method with monocular vision is proposed based on sequence models. Firstly, the key points of multi-target human body skeleton in the video are extracted by using the OpenPose algorithm. Then, the human action features are constructed, including limb direction vector and the skeleton height-width ratio. The multi-target human bodies tracking is then achieved by using the tracking algorithm. Next, the tracking results are matched with the action features, and the action recognition model is constructed, which includes the spatial branch based on Deep neural networks and the temporal branch based on Bi-directional RNN and Bi-directional long short-term memory networks. After pre-training, the model can be used to recognize the human body action from action features, and a recognition stabilizer is designed to minimize false alarms. Finally, extensive evaluations on the JHMDB dataset validate the effectiveness and the superiority of the proposed approach.
引用
收藏
页码:1877 / 1887
页数:11
相关论文
共 50 条
  • [1] A sequence models-based real-time multi-person action recognition method with monocular vision
    Aolei Yang
    Wei Lu
    Wasif Naeem
    Ling Chen
    Minrui Fei
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1877 - 1887
  • [2] Real-Time Multi-Person Action Recognition with a Neural Compute Stick
    Yoon, Young-Chul
    Jung, Hyeonseok
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 1135 - 1140
  • [3] Real-Time Multi-Camera Multi-Person Action Recognition using Pose Estimation
    Phang, Jonathan Then Sien
    Lim, King Hann
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2019), 2019, : 175 - 180
  • [4] Real-Time Multi-Person Video-Based Pose Estimation
    Yan Fenting
    Wang Peng
    Lu Zhigang
    Ding Zhe
    Qiao Mengyu
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [5] GazeOnce: Real-Time Multi-Person Gaze Estimation
    Zhang, Mingfang
    Liu, Yunfei
    Lu, Feng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4187 - 4196
  • [6] Real-Time Multi-Person Smoking Event Detection
    Cabanto, Waynebert Jan D.
    Jocson, Aira Danielle B.
    Lateo, Renzel Laurence T.
    De Goma, Joel C.
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND BIG DATA (ICCBD 2019), 2019, : 126 - 130
  • [7] Real-time multi-person tracking in video surveillance
    Niu, W
    Jiao, L
    Han, D
    Wang, YF
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1144 - 1148
  • [8] The Method of Real-time Distance Measurement Based on Monocular Vision
    Lu, Wei
    Wang, Tingting
    Chu, Jinghui
    [J]. MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1451 - 1454
  • [9] Real Time Vision Based Multi-person Tracking for Mobile Robotics and Intelligent Vehicles
    Mitzel, Dennis
    Floros, Georgios
    Sudowe, Patrick
    van der Zander, Benito
    Leibe, Bastian
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, PT II, 2011, 7102 : 105 - 115
  • [10] Real-Time Multi-Person Tracking with Time-Constrained Detection
    Mitzel, Dennis
    Sudowe, Patrick
    Leibe, Bastian
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,