Human action recognition based on scene semantics

被引:5
|
作者
Hu, Tao [1 ,2 ,3 ,4 ]
Zhu, Xinyan [1 ,2 ]
Guo, Wei [1 ,2 ]
Wang, Shaohua [5 ]
Zhu, Jianfeng [6 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430070, Peoples R China
[2] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430070, Peoples R China
[3] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 430070, Peoples R China
[4] Kent State Univ, Sch Informat, Kent, OH 44240 USA
[5] Wuhan Univ, Int Sch Software, Wuhan 430070, Peoples R China
[6] Kent State Univ, Sch Digital Sci, Kent, OH 44240 USA
基金
中国国家自然科学基金;
关键词
Action recognition; Depth sensor; Semantic context; Trajectory clustering;
D O I
10.1007/s11042-017-5496-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Like outdoors, indoor security is also a critical problem and human action recognition in indoor area is still a hot topic. Most studies on human action recognition ignored the semantic information of a scene, whereas indoors contains varieties of semantics. Meanwhile, the depth sensor with color and depth data is more suitable for extracting the semantics context in human actions. Hence, this paper proposed an indoor action recognition method using Kinect based on the semantics of a scene. First, we proposed a trajectory clustering algorithm for a three-dimensional (3D) scene by combining the different characteristics of people such as the spatial location, movement direction, and speed. Based on the clustering results and scene context, it concludes a region of interest (ROI) extraction method for indoors, and dynamic time warping (DTW) is used to study the abnormal action sequences. Finally, the color and depth-data-based 3D motion history image (3D-MHI) features and the semantics context of the scene were combined to recognize human action. In the experiment, two datasets were tested and the results demonstrate that our semantics-based method performs better than other methods.
引用
收藏
页码:28515 / 28536
页数:22
相关论文
共 50 条
  • [1] Human action recognition based on scene semantics
    Tao Hu
    Xinyan Zhu
    Wei Guo
    Shaohua Wang
    Jianfeng Zhu
    [J]. Multimedia Tools and Applications, 2019, 78 : 28515 - 28536
  • [2] SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition
    Zhang, Xing
    Wu, Zuxuan
    Jiang, Yu-Gang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 313 - 322
  • [3] Action-Scene Model for Human Action Recognition from Videos*
    Zhang, Yifei
    Qu, Wen
    Wang, Daling
    [J]. 2ND AASRI CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, 2014, 6 : 111 - 117
  • [4] Scene image and human skeleton-based dual-stream human action recognition☆
    Xu, Qingyang
    Zheng, Wanqiang
    Song, Yong
    Zhang, Chengjin
    Yuan, Xianfeng
    Li, Yibin
    [J]. PATTERN RECOGNITION LETTERS, 2021, 148 : 136 - 145
  • [5] Probability-based method for boosting human action recognition using scene context
    Zhang, Hong-Bo
    Lei, Qing
    Chen, Duan-Sheng
    Zhong, Bi-Neng
    Peng, Jialin
    Du, Ji-Xiang
    Su, Song-Zhi
    [J]. IET COMPUTER VISION, 2016, 10 (06) : 528 - 536
  • [6] Research on Image Scene Semantics Recognition System
    Yin, Lijie
    Zhen, Ya
    Fan, Qili
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING II, PTS 1-3, 2013, 433-435 : 330 - +
  • [7] Automatic human action recognition in a scene from visual inputs
    Bouma, Henri
    Hanckmann, Patrick
    Marck, Jan-Willem
    Penning, Leo
    den Hollander, Richard
    ten Hove, Johan-Martijn
    van den Broek, Sebastiaan
    Schutte, Klamer
    Burghouts, Gertjan
    [J]. UNATTENDED GROUND, SEA, AND AIR SENSOR TECHNOLOGIES AND APPLICATIONS XIV, 2012, 8388
  • [8] Recognition and action for scene understanding
    Marfil, Rebeca
    Dias, Jorge
    Escolano, Francisco
    [J]. NEUROCOMPUTING, 2015, 161 : 1 - 2
  • [9] Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
    Zhang, Pengfei
    Lan, Cuiling
    Zeng, Wenjun
    Xing, Junliang
    Xue, Jianru
    Zheng, Nanning
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1109 - 1118
  • [10] RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
    Guo, Yuhui
    Liang, Xun
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 127 - 134