Multi-Stream Interaction Networks for Human Action Recognition

被引:30
|
作者
Wang, Haoran [1 ]
Yu, Baosheng [2 ]
Li, Jiaqi [1 ]
Zhang, Linlin [1 ]
Chen, Dongyue [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Univ Sydney, Fac Engn, Sch Comp Sci, Darlington, NSW 2008, Australia
基金
澳大利亚研究理事会;
关键词
Skeleton; Proposals; Footwear; Deep learning; Image recognition; Fuses; Adaptation models; Temporal HOI analysis; dynamic object appearance; deep learning; human action recognition; ATTENTION;
D O I
10.1109/TCSVT.2021.3098839
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Skeleton-based human action recognition has received extensive attention due to its efficiency and robustness to complex backgrounds. Though the human skeleton can accurately capture the dynamics of human poses, it fails to recognize human actions induced by the interaction between human and objects, making it is of great importance to further explore the interaction between the human and objects for human action recognition. In this paper, we devise the multi-stream interaction networks (MSIN), to simultaneously explore the dynamics of human skeleton, objects, and the interaction between human and objects. Specifically, apart from the traditional human skeleton stream, 1) the second stream explores the dynamics of object appearance from the objects surrounding the human body joints; and 2) the third stream captures the dynamics of object position in regard to the distance between the object and different human body joints. Experimental results on three popular skeleton-based human action recognition datasets, NTU RGB + D, NTU RGB + D 120, and SYSU, demonstrate the effectiveness of the proposed method, especially for recognizing the human actions with human-object interactions.
引用
收藏
页码:3050 / 3060
页数:11
相关论文
共 50 条
  • [1] Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Liu, Xiao
    Yang, Xudong
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 251 - 262
  • [2] Multi-stream pose convolutional neural networks for human interaction recognition in images
    Tanisik, Gokhan
    Zalluhoglu, Cemil
    Ikizler-Cinbis, Nazli
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 95
  • [3] A Multi-Stream Sequence Learning Framework for Human Interaction Recognition
    Haroon, Umair
    Ullah, Amin
    Hussain, Tanveer
    Ullah, Waseem
    Sajjad, Muhammad
    Muhammad, Khan
    Lee, Mi Young
    Baik, Sung Wook
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (03) : 435 - 444
  • [4] Multi-stream network with key frame sampling for human action recognition
    Xia, Limin
    Wen, Xin
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (09): : 11958 - 11988
  • [5] An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
    Slade, Sam
    Zhang, Li
    Yu, Yonghong
    Lim, Chee Peng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 9205 - 9231
  • [6] An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
    Sam Slade
    Li Zhang
    Yonghong Yu
    Chee Peng Lim
    [J]. Neural Computing and Applications, 2022, 34 : 9205 - 9231
  • [7] A novel multi-stream hand-object interaction network for assembly action recognition
    Li, Shaochen
    Liu, Zhenyu
    Huang, Yu
    Liu, Daxin
    Duan, Guifang
    Tan, Jianrong
    [J]. ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, : 854 - 870
  • [8] Multi-stream Convolutional Networks for Indoor Scene Recognition
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Laaksonen, Jorma
    Zaki, Nazar
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 196 - 208
  • [9] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
    Sun, Ning
    Leng, Ling
    Liu, Jixin
    Han, Guang
    [J]. IMAGE AND VISION COMPUTING, 2021, 109
  • [10] Multi-stream mixed graph convolutional networks for skeleton-based action recognition
    Zhuang, Boyuan
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)