Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos

被引:77
|
作者
Pfister, Tomas [1 ]
Simonyan, Karen [1 ]
Charles, James [2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford OX1 3PJ, England
[2] Univ Leeds, Sch Comp, Comp Vis Grp, Leeds, W Yorkshire, England
来源
关键词
D O I
10.1007/978-3-319-16865-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our objective is to efficiently and accurately estimate the upper body pose of humans in gesture videos. To this end, we build on the recent successful applications of deep convolutional neural networks (ConvNets). Our novelties are: (i) our method is the first to our knowledge to use ConvNets for estimating human pose in videos; (ii) a new network that exploits temporal information from multiple frames, leading to better performance; (iii) showing that pre-segmenting the foreground of the video improves performance; and (iv) demonstrating that even without foreground segmentations, the network learns to abstract away from the background and can estimate the pose even in the presence of a complex, varying background. We evaluate our method on the BBC TV Signing dataset and show that our pose predictions are significantly better, and an order of magnitude faster to compute, than the state of the art [3].
引用
收藏
页码:538 / 552
页数:15
相关论文
共 50 条
  • [1] Age Estimation of Faces in Videos Using Head Pose Estimation and Convolutional Neural Networks
    Zhang, Beichen
    Bao, Yue
    [J]. SENSORS, 2022, 22 (11)
  • [2] Ensemble convolutional neural networks for pose estimation
    Kawana, Yuki
    Ukita, Norimichi
    Huang, Jia-Bin
    Yang, Ming-Hsuan
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 169 : 62 - 74
  • [3] Vehicle Pose Estimation in WAMI Imagery via Deep Convolutional Neural Networks
    Yi, Meng
    Wang, Dong
    Yang, Fan
    Xu, Jonathan
    Cai, Yiran
    Blasch, Erik
    Sheaff, Carolyn
    Chen, Genshe
    Ling, Haibin
    [J]. PROCEEDINGS OF THE 2016 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON) AND OHIO INNOVATION SUMMIT (OIS), 2016, : 233 - 240
  • [4] Deep Refinement Convolutional Networks for Human Pose Estimation
    Marras, Ioannis
    Palasek, Petar
    Patras, Ioannis
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 446 - 453
  • [5] Human Pose Estimation Using Convolutional Neural Networks
    Singh, Anubhav
    Agarwal, Shruti
    Nagrath, Preeti
    Saxena, Anmol
    Thakur, Narina
    [J]. PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 946 - 952
  • [6] Driver pose estimation using convolutional neural networks
    Chen, Ren-Wen
    Yuan, Ting-Ting
    Huang, Wen-Bin
    Zhang, Yu-Xiang
    [J]. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (04): : 813 - 821
  • [7] Hand Gesture Recognition Using Deep Convolutional Neural Networks
    Strezoski, Gjorgji
    Stojanovski, Dario
    Dimitrovski, Ivica
    Madjarov, Gjorgji
    [J]. ICT INNOVATIONS 2016: COGNITIVE FUNCTIONS AND NEXT GENERATION ICT SYSTEMS, 2018, 665 : 49 - 58
  • [8] Road pollution estimation from vehicle tracking in surveillance videos by deep convolutional neural networks
    Garcia-Gonzalez, Jorge
    Molina-Cabello, Miguel A.
    Luque-Baena, Rafael M.
    Ortiz-de-Lazcano-Lobato, Juan M.
    Lopez-Rubio, Ezequiel
    [J]. APPLIED SOFT COMPUTING, 2021, 113
  • [9] Pose Aided Deep Convolutional Neural Networks for Face Alignment
    Liu, Shuying
    Hu, Jiani
    Deng, Weihong
    [J]. BIOMETRIC RECOGNITION, 2016, 9967 : 59 - 67
  • [10] Benchmarking Convolutional Neural Networks for Object Segmentation and Pose Estimation
    Le, Tiffany
    Hamilton, Lei
    Torralba, Antonio
    [J]. 2017 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2017,