Multi-Task Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising

被引:0
|
作者
Zhou, Kanglei [1 ]
Shum, Hubert P. H. [2 ]
Li, Frederick W. B. [2 ]
Liang, Xiaohui [1 ,3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
[3] Zhongguancun Lab, Beijing 100081, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Graph convolutional network; hand motion denoising; hand motion prediction; multi-task learning; GENERATIVE ADVERSARIAL NETWORK;
D O I
10.1109/TVCG.2023.3337868
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In many human-computer interaction applications, fast and accurate hand tracking is necessary for an immersive experience. However, raw hand motion data can be flawed due to issues such as joint occlusions and high-frequency noise, hindering the interaction. Using only current motion for interaction can lead to lag, so predicting future movement is crucial for a faster response. Our solution is the Multi-task Spatial-Temporal Graph Auto-Encoder (Multi-STGAE), a model that accurately denoises and predicts hand motion by exploiting the inter-dependency of both tasks. The model ensures a stable and accurate prediction through denoising while maintaining motion dynamics to avoid over-smoothed motion and alleviate time delays through prediction. A gate mechanism is integrated to prevent negative transfer between tasks and further boost multi-task performance. Multi-STGAE also includes a spatial-temporal graph autoencoder block, which models hand structures and motion coherence through graph convolutional networks, reducing noise while preserving hand physiology. Additionally, we design a novel hand partition strategy and hand bone loss to improve natural hand motion generation. We validate the effectiveness of our proposed method by contributing two large-scale datasets with a data corruption algorithm based on two benchmark datasets. To evaluate the natural characteristics of the denoised and predicted hand motion, we propose two structural metrics. Experimental results show that our method outperforms the state-of-the-art, showcasing how the multi-task framework enables mutual benefits between denoising and prediction.
引用
收藏
页码:6754 / 6769
页数:16
相关论文
共 50 条
  • [31] Denoising Protein-Protein interaction network via variational graph auto-encoder for protein complex detection
    Yao, Heng
    Guan, Jihong
    Liu, Tianying
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2020, 18 (03)
  • [32] Quantitative Stock Selection Model Using Graph Learning and a Spatial-Temporal Encoder
    Cao, Tianyi
    Wan, Xinrui
    Wang, Huanhuan
    Yu, Xin
    Xu, Libo
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2024, 19 (03): : 1756 - 1775
  • [33] Walking Imagery Evaluation Based on Multi-view Features and Stacked Denoising Auto-encoder Network
    Liang, Enmin
    Elazab, Ahmed
    Liang, Shuang
    Wang, Qiong
    Wang, Tianfu
    Lei, Baiying
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1896 - 1899
  • [34] Multi-Stage Hybrid Planning Method for Charging Stations Based on Graph Auto-Encoder
    Wu, Andrew Y.
    Wu, Juai
    Lau, Yui-yip
    ELECTRONICS, 2025, 14 (01):
  • [35] Spatial-Temporal Graph Neural Network based Hand Gesture Recognition
    Yuan G.
    Bing R.
    Liu X.
    Dai W.
    Zhang Y.-M.
    Cai Z.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (04): : 921 - 931
  • [36] BiGATAE: a bipartite graph attention auto-encoder enhancing spatial domain identification from single-slice to multi-slices
    Tao, Yuhao
    Sun, Xiaoang
    Wang, Fei
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [37] Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search
    Wang, Chunnan
    Zhang, Kaixin
    Wang, Hongzhi
    Chen, Bozhou
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (05)
  • [38] Mining Spatial-Temporal Patterns and Structural Sparsity for Human Motion Data Denoising
    Feng, Yinfu
    Ji, Mingming
    Xiao, Jun
    Yang, Xiaosong
    Zhang, Jian J.
    Zhuang, Yueting
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (12) : 2693 - 2706
  • [39] Association prediction of CircRNAs and diseases using multi-homogeneous graphs and variational graph auto-encoder
    Fu, Yao
    Yang, Runtao
    Zhang, Lina
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [40] Prediction of Synthetic Lethal Interactions in Human Cancers Using Multi-View Graph Auto-Encoder
    Hao, Zhifeng
    Wu, Di
    Fang, Yuan
    Wu, Min
    Cai, Ruichu
    Li, Xiaoli
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (10) : 4041 - 4051