LPHD: A LARGE-SCALE HEAD POSE DATASET FOR RGB IMAGES

被引:1
|
作者
Sun, Wei [1 ]
Fan, Yezhao [1 ]
Min, Xiongkuo [1 ]
Peng, Shihao [1 ]
Ma, Siwei [2 ]
Zhai, Guangtao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commu & Infor Proce, Shanghai, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
head pose dataset; head pose estimation; facial landmark detection; convolution nerual network; MOTION;
D O I
10.1109/ICME.2019.00190
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Head pose estimation has attracted many research interest in recent years. With the advent of deep learning, it is possible to predict the head pose accurately from the RGB images without the help of facial landmarks or depth information. However, existing head pose datasets often lack large pose head images, which extremely limits the development of head pose estimation algorithms. In this paper, we build the large-scale head pose dataset (LHPD) including more than 140,000 images with the diverse and accurate head poses. The LHPD dataset includes the head images recorded from different shooting angles between the camera and the human body for the first time, which greatly expands the range of head pose compared to previous datasets. Therefore, the range of head pose can cover +/- 90. for each Euler angle. The accurate and reliable head pose annotation is labeled by the motion capture system and careful calibration procedures. We then propose a head pose estimation method through fine-tuning the ResNet on the LHPD dataset when using the Euclidean distance of quaternions as the loss function. The results show that our method achieves better performance than current state-of-the-art algorithms.
引用
收藏
页码:1084 / 1089
页数:6
相关论文
共 50 条
  • [1] DriveAHead - A Large-Scale Driver Head Pose Dataset
    Schwarz, Anke
    Haurilet, Monica
    Martinez, Manuel
    Stiefelhagen, Rainer
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1165 - 1174
  • [2] A Large-Scale Mouse Pose Dataset for Mouse Pose Estimation
    Sun, Jun
    Wu, Jing
    Liao, Xianghui
    Wang, Sijia
    Wang, Mantao
    SYMMETRY-BASEL, 2022, 14 (05):
  • [3] Large-scale annotation dataset for fetal head biometry in ultrasound images
    Alzubaidi, Mahmood
    Agus, Marco
    Makhlouf, Michel
    Anver, Fatima
    Alyafei, Khalid
    Househ, Mowafa
    DATA IN BRIEF, 2023, 51
  • [4] Large-scale annotation dataset for fetal head biometry in ultrasound images
    Alzubaidi, Mahmood
    Agus, Marco
    Makhlouf, Michel
    Anver, Fatima
    Alyafei, Khalid
    Househ, Mowafa
    Data in Brief, 2023, 51
  • [5] AutoPOSE: Large-scale Automotive Driver Head Pose and Gaze Dataset with Deep Head Orientation Baseline
    Selim, Mohamed
    Firintepe, Ahmet
    Pagani, Alain
    Stricker, Didier
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 599 - 606
  • [6] DD-Pose - A large-scale Driver Head Pose Benchmark
    Roth, Markus
    Gavrila, Dariu M.
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 927 - 934
  • [7] PACE: A Large-Scale Dataset with Pose Annotations in Cluttered Environments
    You, Yang
    Xiong, Kai
    Yang, Zhening
    Huang, Zhengxiang
    Zhou, Junwei
    Shi, Ruoxi
    Fang, Zhou
    Harley, Adam W.
    Guibas, Leonidas
    Lu, Cewu
    COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 : 473 - 489
  • [8] Large-scale multiview 3D hand pose dataset
    Gomez-Donoso, Francisco
    Orts-Escolano, Sergio
    Cazorla, Miguel
    IMAGE AND VISION COMPUTING, 2019, 81 : 25 - 33
  • [9] 2DHeadPose: A simple and effective annotation method for the head pose in RGB images and its dataset
    Wang, Yang
    Zhou, Wanlin
    Zhou, Jiakai
    NEURAL NETWORKS, 2023, 160 : 50 - 62
  • [10] JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and Tracking
    Vendrow, Edward
    Le, Duy Tho
    Cai, Jianfei
    Rezatofighi, Hamid
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4811 - 4820