Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network

被引:43
|
作者
Ahn, Byungtae [1 ]
Park, Jaesik [1 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
来源
关键词
D O I
10.1007/978-3-319-16811-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an efficient and accurate head orientation estimation algorithm using a monocular camera. Our approach is leveraged by deep neural network and we exploit the architecture in a data regression manner to learn the mapping function between visual appearance and three dimensional head orientation angles. Therefore, in contrast to classification based approaches, our system outputs continuous head orientation. The algorithm uses convolutional filters trained with a large number of augmented head appearances, thus it is user independent and covers large pose variations. Our key observation is that an input image having 32 x 32 resolution is enough to achieve about 3 degrees of mean square error, which can be used for efficient head orientation applications. Therefore, our architecture takes only 1ms on roughly localized head positions with the aid of GPU. We also propose particle filter based post-processing to enhance stability of the estimation further in video sequences. We compare the performance with the state-of-the-art algorithm which utilizes depth sensor and we validate our head orientation estimator on Internet photos and video.
引用
收藏
页码:82 / 96
页数:15
相关论文
共 50 条
  • [1] Real-time head orientation estimation using neural networks
    Zhao, L
    Pingali, G
    Carlbom, I
    [J]. 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 297 - 300
  • [2] Real-Time Detection for Wheat Head Applying Deep Neural Network
    Gong, Bo
    Ergu, Daji
    Cai, Ying
    Ma, Bo
    [J]. SENSORS, 2021, 21 (01) : 1 - 13
  • [3] Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network
    Yang, Xin
    Chen, Jingyu
    Wang, Zhiwei
    Zhang, Qiaozhe
    Liu, Wenyu
    Liao, Chunyuan
    Cheng, Kwang-Ting
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 896 - 904
  • [4] Real-time head pose estimation using multi-task deep neural network
    Ahn, Byungtae
    Choi, Dong-Geol
    Park, Jaesik
    Kweon, In So
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 103 : 1 - 12
  • [5] Real-Time Depth Estimation from a Monocular Moving Camera
    Handa, Aniket
    Sharma, Prateek
    [J]. CONTEMPORARY COMPUTING, 2012, 306 : 494 - 495
  • [6] Real-time fish animation generation by monocular camera
    Meng, Xiangfei
    Pan, Junjun
    Qin, Hong
    Ge, Pu
    [J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 55 - 65
  • [7] Real-Time Anomaly Detection and Classification from Surveillance Cameras using Deep Neural Network
    Rahman, Md Mijanur
    Afrin, Mst Sadia
    Atikuzzaman, Md
    Rahaman, Muhammad Aminur
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
  • [8] Real-time obstacle detection in a darkroom using a monocular camera and a line laser
    Sota Akamine
    Shingo Totoki
    Taku Itami
    Jun Yoneyama
    [J]. Artificial Life and Robotics, 2022, 27 : 828 - 833
  • [9] Real-time obstacle detection in a darkroom using a monocular camera and a line laser
    Akamine, Sota
    Totoki, Shingo
    Itami, Taku
    Yoneyama, Jun
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2022, 27 (04) : 828 - 833
  • [10] Real-Time Facial Expression Recognition Using Deep Convolutional Neural Network
    Zeng, Yuwen
    Xiao, Nan
    Wang, Kaidi
    Yuan, Hang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 1536 - 1541