Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction

被引:2
|
作者
Zhong, Jianqi [1 ]
Cao, Wenming [1 ]
机构
[1] Shenzhen Univ, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
3D human motion prediction; Geometric algebra; Graph convolution networks; NEURAL-NETWORK;
D O I
10.1007/s10489-023-04908-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human motion prediction is one of the essential and challenging problems in computer vision, which has attracted extensive research attention in the past decades. Many previous methods sought to predict the motion state of the next moment using the traditional recurrent neural network in Euclidean space. However, most methods did not explicitly exploit the relationships or constraints between different body components, which carry crucial information for motion prediction. In addition, human motion representation in Euclidean space has high distortion and shows a weak semantic expression when using deep learning models. Based on these observations, we propose a novel Geometric Algebra-based Multiscale Encoder-Decoder network (GAMEDnet) to predict the future 3D poses. In the encoder, the core module is a novel multiscale Geometric Algebra-based multiscale feature extractor(GA-MFE) , which extracts motion features given the multiscale human motion graph. In the decoder, we propose a novel GA-Graph-based Gated Recurrent Unit (GAG-GRU) to sequentially produce predictions. Extensive experiments are conducted to show that the proposed GAMEDnet outperforms state-of-the-art methods in both short and long-term motion prediction on the datasets of Human 3.6M, CMU Mocap.
引用
收藏
页码:26967 / 26987
页数:21
相关论文
共 50 条
  • [1] Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction
    Jianqi Zhong
    Wenming Cao
    Applied Intelligence, 2023, 53 : 26967 - 26987
  • [2] Geometric algebra-based multiview interaction networks for 3D human motion prediction
    Zhong, Jianqi
    Cao, Wenming
    PATTERN RECOGNITION, 2023, 138
  • [3] Convolutional Encoder-Decoder Networks for Robust Image-to-Motion Prediction
    Ridge, Barry
    Pahic, Rok
    Ude, Ales
    Morimoto, Jun
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 514 - 523
  • [4] Encoder-Decoder Architecture for 3D Seismic Inversion
    Gelboim, Maayan
    Adler, Amir
    Sun, Yen
    Araya-Polo, Mauricio
    SENSORS, 2023, 23 (01)
  • [5] A Hierarchical Static-Dynamic Encoder-Decoder Structure for 3D Human Motion Prediction with Residual CNNs
    Tang, Jin
    Liu, Jin
    Yin, JianQin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [6] QMEDNet: A quaternion-based multi-order differential encoder-decoder model for 3D human motion prediction
    Cao, Wenming
    Li, Shuangshuang
    Zhong, Jianqi
    NEURAL NETWORKS, 2022, 154 : 141 - 151
  • [7] Data Prediction Based Encoder-Decoder Learning in Wireless Sensor Networks
    Njoya, Arouna Ndam
    Tchangmena, Allassan A. Nken
    Ari, Ado Adamou Abba
    Gueroui, Abdelhak
    Thron, Christopher
    Mpinda, Berthine Nyunga
    Thiare, Ousmane
    Tonye, Emmanuel
    IEEE ACCESS, 2022, 10 : 109340 - 109356
  • [8] Multimodal 3D medical image registration guided by shape encoder-decoder networks
    Blendowski, Max
    Bouteldja, Nassim
    Heinrich, Mattias P.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (02) : 269 - 276
  • [9] 3D Skeleton-Based Non-Autoregressive Human Motion Prediction Using Encoder-Decoder Attention-Based Model
    Lovanshi, Mayank
    Tiwari, Vivek
    Ingle, Rajesh
    Jain, Swati
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 271 - 280
  • [10] Stacked residual blocks based encoder-decoder framework for human motion prediction
    Liu, Xiaoli
    Yin, Jianqin
    COGNITIVE COMPUTATION AND SYSTEMS, 2020, 2 (04) : 242 - 246