Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction

被引:2
|
作者
Zhong, Jianqi [1 ]
Cao, Wenming [1 ]
机构
[1] Shenzhen Univ, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
3D human motion prediction; Geometric algebra; Graph convolution networks; NEURAL-NETWORK;
D O I
10.1007/s10489-023-04908-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human motion prediction is one of the essential and challenging problems in computer vision, which has attracted extensive research attention in the past decades. Many previous methods sought to predict the motion state of the next moment using the traditional recurrent neural network in Euclidean space. However, most methods did not explicitly exploit the relationships or constraints between different body components, which carry crucial information for motion prediction. In addition, human motion representation in Euclidean space has high distortion and shows a weak semantic expression when using deep learning models. Based on these observations, we propose a novel Geometric Algebra-based Multiscale Encoder-Decoder network (GAMEDnet) to predict the future 3D poses. In the encoder, the core module is a novel multiscale Geometric Algebra-based multiscale feature extractor(GA-MFE) , which extracts motion features given the multiscale human motion graph. In the decoder, we propose a novel GA-Graph-based Gated Recurrent Unit (GAG-GRU) to sequentially produce predictions. Extensive experiments are conducted to show that the proposed GAMEDnet outperforms state-of-the-art methods in both short and long-term motion prediction on the datasets of Human 3.6M, CMU Mocap.
引用
收藏
页码:26967 / 26987
页数:21
相关论文
共 50 条
  • [31] CED-Net: contextual encoder-decoder network for 3D face reconstruction
    Zhu, Lei
    Wang, Shanmin
    Zhao, Zengqun
    Xu, Xiang
    Liu, Qingshan
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1713 - 1722
  • [32] HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds
    Zhang, Gang
    Chen, Junnan
    Gao, Guohuan
    Li, Jianmin
    Hu, Xiaolin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK FOR 3D FACE RECONSTRUCTION
    Li, Xianfeng
    Weng, Zichun
    Liang, Juntao
    Cai, Lei
    Xiang, Youjun
    Fu, Yuli
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3682 - 3686
  • [34] Uncertainty-Aware Recurrent Encoder-Decoder Networks for Vessel Trajectory Prediction
    Capobianco, Samuele
    Forti, Nicola
    Millefiori, Leonardo M.
    Braca, Paolo
    Willett, Peter
    2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 117 - 121
  • [35] ProLanGO2: Protein Function Prediction with Ensemble of Encoder-Decoder Networks
    Hippe, Kyle
    Gbenro, Sola
    Cao, Renzhi
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [36] Deep Encoder-Decoder Neural Networks for Retinal Blood Vessels Dense Prediction
    Zhang, Wenlu
    Li, Lusi
    Cheong, Vincent
    Fu, Bo
    Aliasgari, Mehrdad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 1078 - 1086
  • [37] EDChannel: channel prediction of backscatter communication network based on encoder-decoder
    Dengao Li
    Yongxin Wen
    Shuang Xu
    Qiang Wang
    Ruiqin Bai
    Jumin Zhao
    Telecommunication Systems, 2022, 81 : 99 - 114
  • [38] Multi-task prediction model based on ConvLSTM and encoder-decoder
    Luo, Tao
    Cao, Xudong
    Li, Jin
    Dong, Kun
    Zhang, Rui
    Wei, Xueliang
    INTELLIGENT DATA ANALYSIS, 2021, 25 (02) : 359 - 382
  • [39] EDChannel: channel prediction of backscatter communication network based on encoder-decoder
    Li, Dengao
    Wen, Yongxin
    Xu, Shuang
    Wang, Qiang
    Bai, Ruiqin
    Zhao, Jumin
    TELECOMMUNICATION SYSTEMS, 2022, 81 (01) : 99 - 114
  • [40] Encoder-decoder based convolutional neural networks for image forgery detection
    El Biach, Fatima Zahra
    Iala, Imad
    Laanaya, Hicham
    Minaoui, Khalid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (16) : 22611 - 22628