Multi-Task Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising

被引:0
|
作者
Zhou, Kanglei [1 ]
Shum, Hubert P. H. [2 ]
Li, Frederick W. B. [2 ]
Liang, Xiaohui [1 ,3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
[3] Zhongguancun Lab, Beijing 100081, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Graph convolutional network; hand motion denoising; hand motion prediction; multi-task learning; GENERATIVE ADVERSARIAL NETWORK;
D O I
10.1109/TVCG.2023.3337868
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In many human-computer interaction applications, fast and accurate hand tracking is necessary for an immersive experience. However, raw hand motion data can be flawed due to issues such as joint occlusions and high-frequency noise, hindering the interaction. Using only current motion for interaction can lead to lag, so predicting future movement is crucial for a faster response. Our solution is the Multi-task Spatial-Temporal Graph Auto-Encoder (Multi-STGAE), a model that accurately denoises and predicts hand motion by exploiting the inter-dependency of both tasks. The model ensures a stable and accurate prediction through denoising while maintaining motion dynamics to avoid over-smoothed motion and alleviate time delays through prediction. A gate mechanism is integrated to prevent negative transfer between tasks and further boost multi-task performance. Multi-STGAE also includes a spatial-temporal graph autoencoder block, which models hand structures and motion coherence through graph convolutional networks, reducing noise while preserving hand physiology. Additionally, we design a novel hand partition strategy and hand bone loss to improve natural hand motion generation. We validate the effectiveness of our proposed method by contributing two large-scale datasets with a data corruption algorithm based on two benchmark datasets. To evaluate the natural characteristics of the denoised and predicted hand motion, we propose two structural metrics. Experimental results show that our method outperforms the state-of-the-art, showcasing how the multi-task framework enables mutual benefits between denoising and prediction.
引用
收藏
页码:6754 / 6769
页数:16
相关论文
共 50 条
  • [21] Multi-Object Tracking with Spatial-Temporal Embedding Perception and Multi-Task Synergistic Optimization
    Liang, Xiaoguo
    Li, Hui
    Cheng, Yuanzhi
    Chen, Shuangmin
    Liu, Hengyuan
    Computer Engineering and Applications, 2024, 60 (06) : 282 - 292
  • [22] Human action recognition via multi-task learning base on spatial-temporal feature
    Guo, Wenzhong
    Chen, Guolong
    INFORMATION SCIENCES, 2015, 320 : 418 - 428
  • [23] Integrated Prediction of Regional Traffic Situation Based on Multi-Task Spatial-Temporal Network
    Yu, Jiaao
    Zhang, Kangshuai
    Peng, Lei
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [24] Spatial-Temporal Multi-Task Learning for Within-Field Cotton Yield Prediction
    Nguyen, Long H.
    Zhu, Jiazhen
    Lin, Zhe
    Du, Hanxiang
    Yang, Zhou
    Guo, Wenxuan
    Jin, Fang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 343 - 354
  • [25] Cluster-guided denoising graph auto-encoder for enhanced traffic data imputation and fault detection
    Huang, Yongcan
    Zhen, Hao
    Yang, Jidong J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 261
  • [26] SGCAST: symmetric graph convolutional auto-encoder for scalable and accurate study of spatial transcriptomics
    Li, Jinzhao
    Wang, Jiong
    Lin, Zhixiang
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [27] Analysis of spatial-temporal dynamics of cool flame oscillation phenomenon occurred around a fuel droplet array by using variational auto-encoder
    Iemura, Kazuki
    Saito, Masanori
    Suganuma, Yusuke
    Kikuchi, Masao
    Inatomi, Yuko
    Nomura, Hiroshi
    Tanabe, Mitsuaki
    PROCEEDINGS OF THE COMBUSTION INSTITUTE, 2023, 39 (02) : 2523 - 2532
  • [28] Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder
    Kangning Dong
    Shihua Zhang
    Nature Communications, 13
  • [29] Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder
    Dong, Kangning
    Zhang, Shihua
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [30] STGNNks: Identifying cell types in spatial transcriptomics data based on graph neural network, denoising auto-encoder, and k-sums clustering
    Peng, Lihong
    He, Xianzhi
    Peng, Xinhuai
    Li, Zejun
    Zhang, Li
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166