Geometric algebra-based multiscale encoder-decoder networks for 3D motion prediction

被引：2

作者：

Zhong, Jianqi ^{[1
]}

Cao, Wenming ^{[1
]}

机构：

[1] Shenzhen Univ, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 22期

基金：

中国国家自然科学基金;

关键词：

3D human motion prediction; Geometric algebra; Graph convolution networks; NEURAL-NETWORK;

D O I：

10.1007/s10489-023-04908-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D human motion prediction is one of the essential and challenging problems in computer vision, which has attracted extensive research attention in the past decades. Many previous methods sought to predict the motion state of the next moment using the traditional recurrent neural network in Euclidean space. However, most methods did not explicitly exploit the relationships or constraints between different body components, which carry crucial information for motion prediction. In addition, human motion representation in Euclidean space has high distortion and shows a weak semantic expression when using deep learning models. Based on these observations, we propose a novel Geometric Algebra-based Multiscale Encoder-Decoder network (GAMEDnet) to predict the future 3D poses. In the encoder, the core module is a novel multiscale Geometric Algebra-based multiscale feature extractor(GA-MFE) , which extracts motion features given the multiscale human motion graph. In the decoder, we propose a novel GA-Graph-based Gated Recurrent Unit (GAG-GRU) to sequentially produce predictions. Extensive experiments are conducted to show that the proposed GAMEDnet outperforms state-of-the-art methods in both short and long-term motion prediction on the datasets of Human 3.6M, CMU Mocap.

引用

页码：26967 / 26987

页数：21

共 50 条

[21] Attention-based encoder-decoder networks for workflow recognition
Zhang, Min
Hu, Haiyang
Li, Zhongjin
Chen, Jie
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 34973 - 34995
[22] Video Summarization With Attention-Based Encoder-Decoder Networks
Ji, Zhong
Xiong, Kailin
Pang, Yanwei
Li, Xuelong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1709 - 1717
[23] Attention-based encoder-decoder networks for workflow recognition
Min Zhang
Haiyang Hu
Zhongjin Li
Jie Chen
Multimedia Tools and Applications, 2021, 80 : 34973 - 34995
[24] PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks
Tai, Xue-Cheng
Liu, Hao
Chan, Raymond
SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (01): : 540 - 594
[25] DeepSEED: 3D Squeeze-and-Excitation Encoder-Decoder Convolutional Neural Networks for Pulmonary Nodule Detection
Li, Yuemeng
Fan, Yong
2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1866 - 1869
[26] PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction
Wang, Hongsong
Dong, Jian
Cheng, Bin
Feng, Jiashi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6096 - 6106
[27] Multi-objective evolutionary 3D face reconstruction based on improved encoder-decoder network
Cai, Xingjuan
Cao, Yihao
Ren, Yeqing
Cui, Zhihua
Zhang, Wensheng
INFORMATION SCIENCES, 2021, 581 : 233 - 248
[28] Automatic 3D Landmark Extraction System Based on an Encoder-Decoder Using Fusion of Vision and LiDAR
Kwak, Jeonghoon
Sung, Yunsick
REMOTE SENSING, 2020, 12 (07)
[29] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
Chung, Yi-Hao
Chen, Yen-Lin
IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,
[30] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
Chung, Yi-Hao
Chen, Yen-Lin
IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,

← 1 2 3 4 5 →