Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction

被引:2
|
作者
Zhong, Jianqi [1 ]
Cao, Wenming [1 ]
机构
[1] Shenzhen Univ, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
3D human motion prediction; Geometric algebra; Graph convolution networks; NEURAL-NETWORK;
D O I
10.1007/s10489-023-04908-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human motion prediction is one of the essential and challenging problems in computer vision, which has attracted extensive research attention in the past decades. Many previous methods sought to predict the motion state of the next moment using the traditional recurrent neural network in Euclidean space. However, most methods did not explicitly exploit the relationships or constraints between different body components, which carry crucial information for motion prediction. In addition, human motion representation in Euclidean space has high distortion and shows a weak semantic expression when using deep learning models. Based on these observations, we propose a novel Geometric Algebra-based Multiscale Encoder-Decoder network (GAMEDnet) to predict the future 3D poses. In the encoder, the core module is a novel multiscale Geometric Algebra-based multiscale feature extractor(GA-MFE) , which extracts motion features given the multiscale human motion graph. In the decoder, we propose a novel GA-Graph-based Gated Recurrent Unit (GAG-GRU) to sequentially produce predictions. Extensive experiments are conducted to show that the proposed GAMEDnet outperforms state-of-the-art methods in both short and long-term motion prediction on the datasets of Human 3.6M, CMU Mocap.
引用
收藏
页码:26967 / 26987
页数:21
相关论文
共 50 条
  • [21] Attention-based encoder-decoder networks for workflow recognition
    Zhang, Min
    Hu, Haiyang
    Li, Zhongjin
    Chen, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 34973 - 34995
  • [22] Video Summarization With Attention-Based Encoder-Decoder Networks
    Ji, Zhong
    Xiong, Kailin
    Pang, Yanwei
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1709 - 1717
  • [23] Attention-based encoder-decoder networks for workflow recognition
    Min Zhang
    Haiyang Hu
    Zhongjin Li
    Jie Chen
    Multimedia Tools and Applications, 2021, 80 : 34973 - 34995
  • [24] PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks
    Tai, Xue-Cheng
    Liu, Hao
    Chan, Raymond
    SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (01): : 540 - 594
  • [25] DeepSEED: 3D Squeeze-and-Excitation Encoder-Decoder Convolutional Neural Networks for Pulmonary Nodule Detection
    Li, Yuemeng
    Fan, Yong
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1866 - 1869
  • [26] PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction
    Wang, Hongsong
    Dong, Jian
    Cheng, Bin
    Feng, Jiashi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6096 - 6106
  • [27] Multi-objective evolutionary 3D face reconstruction based on improved encoder-decoder network
    Cai, Xingjuan
    Cao, Yihao
    Ren, Yeqing
    Cui, Zhihua
    Zhang, Wensheng
    INFORMATION SCIENCES, 2021, 581 : 233 - 248
  • [28] Automatic 3D Landmark Extraction System Based on an Encoder-Decoder Using Fusion of Vision and LiDAR
    Kwak, Jeonghoon
    Sung, Yunsick
    REMOTE SENSING, 2020, 12 (07)
  • [29] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
    Chung, Yi-Hao
    Chen, Yen-Lin
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,
  • [30] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
    Chung, Yi-Hao
    Chen, Yen-Lin
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,