A sequence models-based real-time multi-person action recognition method with monocular vision

被引:2
|
作者
Yang, Aolei [1 ]
Lu, Wei [1 ]
Naeem, Wasif [2 ]
Chen, Ling [3 ]
Fei, Minrui [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[2] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
[3] Hunan Normal Univ, Sch Engn & Design, Changsha 410081, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Action recognition; Human body skeleton; Feature construction; Sequence models; Computer vision;
D O I
10.1007/s12652-021-03399-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In intelligent video surveillance under complex scenes, it is vital to identify the current actions of multi-target human bodies accurately and in real time. In this paper, a real-time multi-person action recognition method with monocular vision is proposed based on sequence models. Firstly, the key points of multi-target human body skeleton in the video are extracted by using the OpenPose algorithm. Then, the human action features are constructed, including limb direction vector and the skeleton height-width ratio. The multi-target human bodies tracking is then achieved by using the tracking algorithm. Next, the tracking results are matched with the action features, and the action recognition model is constructed, which includes the spatial branch based on Deep neural networks and the temporal branch based on Bi-directional RNN and Bi-directional long short-term memory networks. After pre-training, the model can be used to recognize the human body action from action features, and a recognition stabilizer is designed to minimize false alarms. Finally, extensive evaluations on the JHMDB dataset validate the effectiveness and the superiority of the proposed approach.
引用
收藏
页码:1877 / 1887
页数:11
相关论文
共 50 条
  • [41] Real-Time Hand Gesture Recognition Based on Vision
    Ren, Yu
    Gu, Chengcheng
    [J]. ENTERTAINMENT FOR EDUCATION: DIGITAL TECHNIQUES AND SYSTEMS, 2010, 6249 : 468 - 475
  • [42] End-to-End Feature Pyramid Network for Real-Time Multi-Person Pose Estimation
    Luo, Dingli
    Du, Songlin
    Ikenaga, Takeshi
    [J]. PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [43] Deep Learning-Based Real-Time Multiple-Person Action Recognition System
    Tsai, Jen-Kai
    Hsu, Chen-Chien
    Wang, Wei-Yen
    Huang, Shao-Kang
    [J]. SENSORS, 2020, 20 (17) : 1 - 17
  • [44] Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose
    Osokin, Daniil
    [J]. ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 744 - 748
  • [45] Real-Time 3-D Human Action Recognition Based on Hyperpoint Sequence
    Li, Xing
    Huang, Qian
    Wang, Zhijian
    Yang, Tianjin
    Hou, Zhenjie
    Miao, Zhuang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (08) : 8933 - 8942
  • [46] Robot Vision System for Real-Time Human Detection and Action Recognition
    Hoshino, Satoshi
    Niimura, Kyohei
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15, 2019, 867 : 507 - 519
  • [47] AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
    Fang, Hao-Shu
    Li, Jiefeng
    Tang, Hongyang
    Xu, Chao
    Zhu, Haoyi
    Xiu, Yuliang
    Li, Yong-Lu
    Lu, Cewu
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7157 - 7173
  • [48] A review of real-time human action recognition involving vision sensing
    Majumder, S.
    Kehtarnavaz, N.
    [J]. REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2021, 2021, 11736
  • [49] Multi-camera, multi-person, and real-time fall detection using long short term memory
    Taufeeque, Mohammad
    Koita, Samad
    Spicher, Nicolai
    Deserno, Thomas M.
    [J]. MEDICAL IMAGING 2021: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2021, 11601
  • [50] Monocular Vision-Based Real-Time Vehicle Detection at Container Terminals
    Liu, Zijian
    Zhang, Tianlei
    He, Bei
    Liu, Yu
    Sun, Li
    Tang, Wenyang
    [J]. PROCEEDINGS OF CHINA SAE CONGRESS 2018: SELECTED PAPERS, 2020, 574 : 821 - 830