Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction

被引:2
|
作者
Zhong, Jianqi [1 ]
Cao, Wenming [1 ]
机构
[1] Shenzhen Univ, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
3D human motion prediction; Geometric algebra; Graph convolution networks; NEURAL-NETWORK;
D O I
10.1007/s10489-023-04908-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human motion prediction is one of the essential and challenging problems in computer vision, which has attracted extensive research attention in the past decades. Many previous methods sought to predict the motion state of the next moment using the traditional recurrent neural network in Euclidean space. However, most methods did not explicitly exploit the relationships or constraints between different body components, which carry crucial information for motion prediction. In addition, human motion representation in Euclidean space has high distortion and shows a weak semantic expression when using deep learning models. Based on these observations, we propose a novel Geometric Algebra-based Multiscale Encoder-Decoder network (GAMEDnet) to predict the future 3D poses. In the encoder, the core module is a novel multiscale Geometric Algebra-based multiscale feature extractor(GA-MFE) , which extracts motion features given the multiscale human motion graph. In the decoder, we propose a novel GA-Graph-based Gated Recurrent Unit (GAG-GRU) to sequentially produce predictions. Extensive experiments are conducted to show that the proposed GAMEDnet outperforms state-of-the-art methods in both short and long-term motion prediction on the datasets of Human 3.6M, CMU Mocap.
引用
收藏
页码:26967 / 26987
页数:21
相关论文
共 50 条
  • [41] Encoder-decoder based convolutional neural networks for image forgery detection
    Fatima Zahra El Biach
    Imad Iala
    Hicham Laanaya
    Khalid Minaoui
    Multimedia Tools and Applications, 2022, 81 : 22611 - 22628
  • [42] CT IMAGE DENOISING WITH ENCODER-DECODER BASED GRAPH CONVOLUTIONAL NETWORKS
    Chen, Yu-Jen
    Tsai, Cheng-Yen
    Xu, Xiaowei
    Shi, Yiyu
    Ho, Tsung-Yi
    Huang, Meiping
    Yuan, Haiyun
    Zhuang, Jian
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 400 - 404
  • [43] Classification of Arrhythmia Based on Convolutional Neural Networks and Encoder-Decoder Model
    Liu, Jian
    Xia, Xiaodong
    Han, Chunyang
    Hui, Jiao
    Feng, Jim
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 265 - 278
  • [44] AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
    Kass, Dmitrijs
    Vats, Ekta
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 507 - 522
  • [45] Learning to Write Anywhere with Spatial Transformer Image-to-Motion Encoder-Decoder Networks
    Ridge, Barry
    Pahic, Rok
    Ude, Ales
    Morimoto, Jun
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2111 - 2117
  • [46] QMEDNet: A quaternion-based multi-order differential encoder–decoder model for 3D human motion prediction
    Cao, Wenming
    Li, Shuangshuang
    Zhong, Jianqi
    Neural Networks, 2022, 154 : 141 - 151
  • [47] PointAtrousGraph: Deep Hierarchical Encoder-Decoder with Point Atrous Convolution for Unorganized 3D Points
    Pan, Liang
    Chew, Chee-Meng
    Lee, Gim Hee
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1113 - 1120
  • [48] SISR of Hyperspectral Remote Sensing Imagery Using 3D Encoder-Decoder RUNet Architecture
    Aburaed, Nour
    Alkhatib, Mohammed Q.
    Marshall, Stephen
    Zabalza, Jaime
    Al Ahmad, Hussain
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1516 - 1519
  • [49] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation
    Wan, Ziniu
    Li, Zhengjia
    Tian, Maoqing
    Liu, Jianbo
    Yi, Shuai
    Li, Hongsheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13013 - 13022
  • [50] PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
    Zhu, Yiran
    Xu, Xing
    Shen, Fumin
    Ji, Yanli
    Gao, Lianli
    Shen, Heng Tao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1359 - 1365