3D Pose Regression using Convolutional Neural Networks

被引:70
|
作者
Mahendran, Siddharth [1 ]
Ali, Haider [1 ]
Vidal, Rene [1 ]
机构
[1] Johns Hopkins Univ, Ctr Imaging Sci, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCVW.2017.254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data augmentation and loss function that captures the geometry of the pose space. Experiments on PASCAL3D+ show that the proposed 3D pose regression approach achieves competitive performance compared to the state-of-the-art.
引用
下载
收藏
页码:2174 / 2182
页数:9
相关论文
共 50 条
  • [31] Efficient Violence Detection Using 3D Convolutional Neural Networks
    Li, Ji
    Jiang, Xinghao
    Sun, Tanfeng
    Xu, Ke
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [32] Lung Cancer Detection using 3D Convolutional Neural Networks
    Pradhan, Adarsh
    Sarma, Bhaskarjyothi
    Dey, Bhiman Kr
    2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 765 - 770
  • [33] Violence Detection in Video by Using 3D Convolutional Neural Networks
    Ding, Chunhui
    Fan, Shouke
    Zhu, Ming
    Feng, Weiguo
    Jia, Baozhi
    ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT II, 2014, 8888 : 551 - 558
  • [34] SIGN LANGUAGE RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Jie
    Zhou, Wengang
    Li, Houqiang
    Li, Weiping
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [35] Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network
    Elbasiony, Reda
    Gomaa, Walid
    Ogata, Tetsuya
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 310 - 320
  • [36] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [37] Disparity Filtering with 3D Convolutional Neural Networks
    Mao, Wendong
    Gong, Minglun
    2018 15TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2018, : 246 - 253
  • [38] 3D CONVOLUTIONAL NEURAL NETWORKS BY MODAL FUSION
    Yoshiyasu, Yusuke
    Yoshida, Eiichi
    Pirk, Soeren
    Guibas, Leonidas
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1777 - 1781
  • [39] TOWARDS 3D CONVOLUTIONAL NEURAL NETWORKS WITH MESHES
    Dominguez, Miguel
    Such, Felipe Petroski
    Sah, Shagan
    Ptucha, Raymond
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3929 - 3933
  • [40] 3D GESTURE CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Duffner, Stefan
    Berlemont, Samuel
    Lefebvre, Gregoire
    Garcia, Christophe
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,