Robust Heart Rate Estimation With Spatial-Temporal Attention Network From Facial Videos

被引:20
|
作者
Hu, Min [1 ]
Qian, Fei [1 ]
Wang, Xiaohua [1 ]
He, Lei [2 ]
Guo, Dong [1 ]
Ren, Fuji [3 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Anhui Prov Key Lab Affect Comp & Adv Intelligent, Hefei 230602, Peoples R China
[2] Hefei Univ Technol, Sch Math, Hefei 230602, Peoples R China
[3] Univ Tokushima, Grad Sch Adv Technol & Sci, Tokushima 7708502, Japan
基金
中国国家自然科学基金;
关键词
Feature extraction; Videos; Heart rate; Facial features; Estimation; Data mining; Signal processing; Aggregation function; remote heart rate (HR) estimation; remote photoplethysmography (rPPG); spatial-temporal attention; spatial-temporal strip pooling; REMOTE PHOTOPLETHYSMOGRAPHY; NONCONTACT;
D O I
10.1109/TCDS.2021.3062370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problems of highly redundant spatial information and motion noise in the heart rate (HR) estimation from facial videos based on remote photoplethysmography (rPPG), this article proposes a novel HR estimation method based on spatial-temporal attention model. First, to reduce the redundant information and strengthen the association relationships of long-range videos, the spatial-temporal facial features are extracted by the 2-D convolutional neural network (2DCNN) and 3-D convolutional neural network (3DCNN), respectively. The aggregation function is adopted to incorporate feature maps into short segment spatial-temporal feature maps. Second, the spatial-temporal strip pooling is designed in the spatial-temporal attention module to reduce head movement noises. Then, via the two-part loss function, the model can focus more on the rPPG signal rather than the interference. We conduct extensive experiments on two public data sets to verify the effectiveness of our model. The experimental results show that the proposed method achieves significantly better performances than the state-of-the-art baselines: The mean absolute error could be reduced by 11% on the PURE data set, and by 25% on the COHFACE data set.
引用
收藏
页码:639 / 647
页数:9
相关论文
共 50 条
  • [1] Spatial-Temporal Attention Network for Depression Recognition from facial videos
    Pan, Yuchen
    Shang, Yuanyuan
    Liu, Tie
    Shao, Zhuhong
    Guo, Guodong
    Ding, Hui
    Hu, Qiang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [2] Robust Remote Heart Rate Estimation from Face Utilizing Spatial-temporal Attention
    Niu, Xuesong
    Zhao, Xingyuan
    Han, Hu
    Das, Abhijit
    Dantcheva, Antitza
    Shan, Shiguang
    Chen, Xilin
    [J]. 2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 582 - 589
  • [3] rPPG-Based Heart Rate Estimation Using Spatial-Temporal Attention Network
    Hu, Min
    Guo, Dong
    Jiang, Mingxing
    Qian, Fei
    Wang, Xiaohua
    Ren, Fuji
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1630 - 1641
  • [4] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [5] Heart Rate Estimation from Facial Videos Based on Convolutional Neural Network
    Yang, Wen
    Li, Xiaoqi
    Zhang, Bin
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 45 - 49
  • [6] Anti-jamming heart rate estimation using a spatial-temporal fusion network
    Wu, Chunlei
    Yuan, Ziyu
    Wan, Shaohua
    Wang, Leiquan
    Zhang, Weishan
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 216
  • [7] Robust Heart Rate Variability Measurement from Facial Videos
    Odinaev, Ismoil
    Wong, Kwan Long
    Chin, Jing Wei
    Goyal, Raghav
    Chan, Tsz Tai
    So, Richard H. Y.
    [J]. BIOENGINEERING-BASEL, 2023, 10 (07):
  • [8] Heart rate estimation network from facial videos using spatiotemporal feature image
    Jaiswal, Kokila Bharti
    Meenpal, T.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [9] Information-Enhanced Network for Noncontact Heart Rate Estimation From Facial Videos
    Liu, Lili
    Xia, Zhaoqiang
    Zhang, Xiaobiao
    Peng, Jinye
    Feng, Xiaoyi
    Zhao, Guoying
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2136 - 2150
  • [10] MSDN: A Multistage Deep Network for Heart-Rate Estimation From Facial Videos
    Zhang, Xiaobiao
    Xia, Zhaoqiang
    Dai, Jing
    Liu, Lili
    Peng, Jinye
    Feng, Xiaoyi
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 15