Realistic Depth Image Synthesis for 3D Hand Pose Estimation

被引:0
|
作者
Zhou, Jun [1 ,2 ,3 ]
Xu, Chi [1 ,2 ,3 ]
Ge, Yuting [1 ,2 ,3 ]
Cheng, Li [4 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat Co, Wuhan 430074, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China
[4] Univ Alberta, Dept Elect & Comp Engn, Vis & Learning Lab, Edmonton, AB T6G 2R3, Canada
基金
中国国家自然科学基金;
关键词
Depth noise modeling; 3D hand pose estimation; realistic depth synthesis;
D O I
10.1109/TMM.2023.3330522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The training of depth image-based hand pose estimation model typically relies on real-life datasets which are expected to be 1) largescale and cover a diverse range of hand poses and hand shapes, and 2) always come with high-precision annotations. However, existing datasets in reality are rather limited in the above regards due to multitude practical constraints, with time and cost being the major concerns. This observation motivates us to propose an alternative approach, where hand pose model is primarily trained with synthesized hand depth images that closely mimicking the characteristic noise patterns of a specific depth camera make under consideration. It is achieved by firstly mapping a Gaussian distributed variable to certain specific non-i.i.d. (independent and identically distributed) depth noise pattern, and then transforming a "vanilla" noise-free synthetic depth image to a realistic-looking image. Extensive empirical experiments demonstrate that our approach is capable of generating camera-specific realistic-looking hand depth images with precise annotations; comparing to entirely relying on annotated real images, a hand pose model with better performance is obtained by using only a small fraction (10%) of annotated real images as well as our synthesized images.
引用
收藏
页码:5246 / 5256
页数:11
相关论文
共 50 条
  • [31] Estimating 3D hand pose from a cluttered image
    Athitsos, V
    Sclaroff, S
    [J]. 2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2003, : 432 - 439
  • [32] A Survey on Depth Ambiguity of 3D Human Pose Estimation
    Zhang, Siqi
    Wang, Chaofang
    Dong, Wenlong
    Fan, Bin
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [33] Aligning Latent Spaces for 3D Hand Pose Estimation
    Yang, Linlin
    Li, Shile
    Lee, Dongheui
    Yao, Angela
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2335 - 2343
  • [34] Ordinal Depth Supervision for 3D Human Pose Estimation
    Pavlakos, Georgios
    Zhou, Xiaowei
    Daniilidis, Kostas
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7307 - 7316
  • [35] PEAN: 3D Hand Pose Estimation Adversarial Network
    Sun, Linhui
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1251 - 1258
  • [36] Residual Attention Regression for 3D Hand Pose Estimation
    Li, Jing
    Zhang, Long
    Ju, Zhaojie
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 605 - 614
  • [37] CASCADED POINT NETWORK FOR 3D HAND POSE ESTIMATION
    Dou, Yikun
    Wang, Xuguang
    Zhu, Yuying
    Deng, Xiaoming
    Ma, Cuixia
    Chang, Liang
    Wang, Hongan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1982 - 1986
  • [38] Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation
    Lee, Seongyeong
    Park, Hansoo
    Kim, Dong Uk
    Kim, Jihyeon
    Boboev, Muhammadjon
    Baek, Seungryul
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2933 - 2943
  • [39] 3D Hand Pose Estimation on Conventional Capacitive Touchscreens
    Choi, Frederick
    Mayer, Sven
    Harrison, Chris
    [J]. PROCEEDINGS OF 23RD ACM INTERNATIONAL CONFERENCE ON MOBILE HUMAN-COMPUTER INTERACTION (MOBILEHCI 2021): MOBILE APART, MOBILE TOGETHER, 2021,
  • [40] Database indexing methods for 3D hand pose estimation
    Athitsos, V
    Sclaroff, S
    [J]. GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 2003, 2915 : 288 - 299