EasyGaze3D: Towards Effective and Flexible 3D Gaze Estimation from a Single RGB Camera

被引:0
|
作者
Li, Jinkai [1 ]
Yang, Jianxin [1 ]
Liu, Yuxuan [1 ]
Li, Zhen [2 ]
Yang, Guang-Zhong [1 ]
Guo, Yao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Med Robot, Sch Biomed Engn, Shanghai, Peoples R China
[2] Chinese Univ Hong Kong Shenzhen, Shenzhen, Peoples R China
关键词
D O I
10.1109/IROS55552.2023.10342361
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Eye gaze can convey rich information of human intentions, which enables the social robots to comprehend the cognition and behavior of human targets. However, the existing 3D gaze estimation methods generally have high requirements either on the dedicated hardware or the quantity and quality of training databases, which largely limits their practical application values. This paper proposes EasyGaze3D, an effective 3D gaze estimation framework using a single RGB camera. First, the framework detects the 2D facial landmarks and recovers the 3D facial shape from the input image, and derives the required camera parameters with these features. Then, without loss of generality, the gaze direction can be regarded as the vector pointing from the eyeball center to the pupil center, which are derived respectively from the detected facial landmarks and the spherical fitting performed on the recovered 3D facial shape. Besides, we propose a flexible yet efficient calibration module, namely Easy-Cali, for deriving the subject-specific 3D facial shape and eyeball centers. The features calibrated by Easy-Cali can further boost the performance of EasyGaze3D. Experimental results show that our proposed method, being plug-and-play and without the need of training on large-scale dataset, can achieve superior performance against the existing methods based on deep models.
引用
收藏
页码:6537 / 6543
页数:7
相关论文
共 50 条
  • [21] 3D gaze estimation and interaction
    Ki, Jeongseok
    Kwon, Yong-Moo
    2008 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2008, : 353 - 356
  • [22] Evaluation of RGB-D Multi-Camera Pose Estimation for 3D Reconstruction
    de Medeiros Esper, Ian
    Smolkin, Oleh
    Manko, Maksym
    Popov, Anton
    From, Pal Johan
    Mason, Alex
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [23] 3D Hand Posture Tracking with Depth Gradient Estimation on a RGB-D Camera
    Lin, Jan-Cheng
    Huang, Cheng-Ming
    2013 IEEE 17TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2013, : 109 - 110
  • [24] Single camera 3D
    Lunazzi, Jose J.
    REVISTA BRASILEIRA DE ENSINO DE FISICA, 2011, 33 (02):
  • [25] A practical framework of multi-person 3D human pose estimation with a single RGB camera
    Ma, Le
    Lian, Sen
    Wang, Shandong
    Meng, Weiliang
    Xiao, Jun
    Zhang, Xiaopeng
    2021 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2021), 2021, : 420 - 421
  • [26] Model-Based 3D Gaze Estimation Using a TOF Camera
    Shen, Kuanxin
    Li, Yingshun
    Guo, Zhannan
    Gao, Jintao
    Wu, Yingjian
    SENSORS, 2024, 24 (04)
  • [27] 3D Visual SLAM using RGB-D Camera
    Krerngkamjornkit, Rapee
    Simic, Milan
    SMART DIGITAL FUTURES 2014, 2014, 262 : 533 - 544
  • [28] Camera distance helps 3D hand pose estimated from a single RGB image
    Cui, Yuan
    Li, Moran
    Gao, Yuan
    Gao, Changxin
    Wu, Fan
    Wen, Hao
    Li, Jiwei
    Sang, Nong
    GRAPHICAL MODELS, 2023, 127
  • [29] Highly Accurate and Fully Automatic 3D Head Pose Estimation and Eye Gaze Estimation Using RGB-D Sensors and 3D Morphable Models
    Ghiass, Reza Shoja
    Arandjelovic, Ognjen
    Laurendeau, Denis
    SENSORS, 2018, 18 (12)
  • [30] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36