A 4D strong spatio-temporal feature learning network for behavior recognition of point cloud sequences

被引:0
|
作者
You, Kaijun [1 ]
Hou, Zhenjie [1 ]
Liang, Jiuzhen [1 ]
Lin, En [2 ]
Shi, Haiyong [1 ]
Zhong, Zhuokun [1 ]
机构
[1] Changzhou Univ, Engn Res Ctr Key Equipment Petrochem Proc, Changzhou 213100, Jiangsu, Peoples R China
[2] Goldcard Smart Grp Co Ltd, Hangzhou 310000, Zhejiang, Peoples R China
关键词
Point cloud sequence; Coordinate transformation; Spatio-temporal information; Feature enhancement;
D O I
10.1007/s11042-023-18045-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although the depth map sequence widely used in behavior recognition can provide depth information. However, depth pixels are not strongly correlated with each other, and the spatio-temporal structure information of behavior data is largely lost. Point cloud data can provide rich spatial information and geometric features, which make up for the lack of depth images. In order to further utilize the geometric information of behavior action and improve the utilization rate of spatio-temporal structure information, this paper proposed a 4D strong spatio-temporal feature learning network for behavior recognition of point cloud sequences. Coordinate transformation was performed on a depth dataset to generate a point cloud dataset, then our network processed each frame of point cloud data and learned 4D strong spatio-temporal features (three spatial and one temporal dimension). The network consists of two modules, a spatial-level feature learning module and a temporal-level position encoding module. In the spatial-level feature learning module, the spatial dimension of the point cloud is processed and learned. Each frame of point cloud data outputs a feature sequence through two progressive structure enhanced set abstract layers, which represents the strong spatial structure. Then, it becomes a complete spatial-level feature sequence through a maxing pooling operation. In the temporal-level position coding module, the processing and learning of the time dimension of the point cloud are performed. The time-series information is injected into the feature sequence through position coding and so on. Finally, the multi-level features of human actions are aggregated and classified. It was carried out on three public datasets. Extensive experiments showed that the network structure proposed in this paper outperformed the current state-of-the-art methods.
引用
收藏
页码:63193 / 63211
页数:19
相关论文
共 50 条
  • [1] Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos
    Fan, Hehe
    Yang, Yi
    Kankanhalli, Mohan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14199 - 14208
  • [2] Self-supervised 4D Spatio-temporal Feature Learning via Order Prediction of Sequential Point Cloud Clips
    Wang, Haiyan
    Yang, Liang
    Rong, Xuejian
    Feng, Jinglun
    Tian, Yingli
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3761 - 3770
  • [3] Robust Spatio-Temporal Registration of 4D Cardiac Ultrasound Sequences
    Bersveldsen, Jorn
    Toews, Matthew
    Danudibroto, Adriyana
    Wells, William M., III
    Urheim, Stig
    Estepar, Raul San Jose
    Samset, Eigil
    MEDICAL IMAGING 2016: ULTRASONIC IMAGING AND TOMOGRAPHY, 2016, 9790
  • [4] Action recognition method of spatio-temporal feature fusion deep learning network
    Pei, Xiaomin
    Fan, Huijie
    Tang, Yandong
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
  • [5] Spatial and spatio-temporal feature extraction from 4D echocardiography images
    Awan, Ruqayya
    Rajpoot, Kashif
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 64 : 138 - 147
  • [6] Learning Temporal Variations for 4D Point Cloud Segmentation
    Shi, Hanyu
    Wei, Jiacheng
    Wang, Hao
    Liu, Fayao
    Lin, Guosheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 5603 - 5617
  • [7] Point Spatio-Temporal Pyramid Network for Point Cloud Video Understanding
    Shen, Zhiqiang
    Wang, Longguang
    Guo, Yulan
    Liu, Qiong
    Zhou, Xi
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 (1209-1213) : 1209 - 1213
  • [8] A Spatio-temporal Feature Learning Approach for Dynamic Scene Recognition
    Ullah, Ihsan
    Petrosino, Alfredo
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 591 - 598
  • [9] Collaborative Perception Based on Point Cloud Spatio-Temporal Feature Compensation Network for Intelligent Connected Vehicles
    Zhang, Mingfang
    Liu, Ying
    Ma, Jian
    He, Ye
    Wang, Li
    Qiche Gongcheng/Automotive Engineering, 2024, 46 (12): : 2279 - 2289
  • [10] Keypoints of Interest Based on Spatio-temporal Feature and MRF for Cloud Recognition System
    Suzuki, Takahiro
    Ikenaga, Takeshi
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,