MSDN: A Multistage Deep Network for Heart-Rate Estimation From Facial Videos

被引:5
|
作者
Zhang, Xiaobiao [1 ]
Xia, Zhaoqiang [1 ,2 ]
Dai, Jing [3 ]
Liu, Lili [1 ]
Peng, Jinye [4 ]
Feng, Xiaoyi [1 ,5 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Innovat Ctr NPU Chongqing, Chongqing 401120, Peoples R China
[3] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
[4] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
[5] Res & Dev Inst NPU Shenzhen, Shenzhen 518057, Peoples R China
关键词
Feature extraction; Estimation; Heart rate; Videos; Training; Band-pass filters; Skin; Feature extractor; heart rate (HR) estimation; interbeat interval (IBI); multistage deep network (MSDN); remote photoplethysmography (rPPG) generator; NONCONTACT; PPG; CNN;
D O I
10.1109/TIM.2023.3329095
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Noncontact heart-rate (HR) measurement is a very important trend in clinical medicine. Recently, a variety of deep networks have been applied to estimate HRs from facial videos. However, due to limited data resources and poor parameter optimization, few existing models have achieved incredible performance in complicated scenarios, such as those with illumination changes, different skin tones, and facial motion. To address these challenges, this article proposes a novel multistage deep network (MSDN) that can decentralize the learnable parameters into different stages to reduce the difficulty of learning through multiple training steps. Specifically, the proposed network consists of three stages in an end-to-end way. In the first stage, an HR-aware feature extractor uses the next convolutional neural network (ConvNeXt) embedded with a newly designed bandpass filter as its backbone to extract spatial-temporal features for determining HR changes. Moreover, pseudolabels are generated to make the features compatible with illumination, motion, and color variance. In the second stage, various modules, including singular value decomposition (SVD) pooling and enhanced difference convolution (EDC) modules, are then designed and combined with a transformer encoder to construct a feature-compressed remote photoplethysmography (rPPG) generator. In the third stage, an HR estimator with an interbeat interval (IBI) analyzer and a 1-D filter is newly designed for HR estimation. Extensive experiments are performed on three publicly available databases (i.e., VIPL-HR, COHFACE, and PURE), and the results demonstrate the effectiveness of the proposed method through ablation studies and comparison experiments with state-of-the-art (SOTA) methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Heart rate estimation from facial videos using nonlinear mode decomposition and improved consistency check
    Halil Demirezen
    Cigdem Eroglu Erdem
    Signal, Image and Video Processing, 2021, 15 : 1415 - 1423
  • [22] Heart rate estimation from facial videos using nonlinear mode decomposition and improved consistency check
    Demirezen, Halil
    Eroglu Erdem, Cigdem
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1415 - 1423
  • [23] ON LOW HEART-RATE RESPONSE TO MULTISTAGE EXERCISE TEST
    TAKADA, K
    FUJINAMI, T
    OKUDA, N
    MOROZUMI, K
    HOKIMOTO, S
    OUHASHI, N
    NAKAYAMA, K
    OKAMOTO, M
    OKUTANI, H
    JAPANESE CIRCULATION JOURNAL-ENGLISH EDITION, 1981, 45 (08): : 937 - 938
  • [24] RealSense = Real Heart Rate: Illumination Invariant Heart Rate Estimation from Videos
    Chen, Jie
    Chang, Zhuoqing
    Qiu, Qiang
    Li, Xiaobai
    Sapiro, Guillermo
    Bronstein, Alex
    Pietikainen, Matti
    2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [25] EXERCISE HEART-RATE RESPONSE TO FACIAL COOLING
    RIGGS, CE
    JOHNSON, DJ
    KONOPKA, BJ
    KILGOUR, RD
    EUROPEAN JOURNAL OF APPLIED PHYSIOLOGY AND OCCUPATIONAL PHYSIOLOGY, 1981, 47 (04): : 323 - 330
  • [26] Serial Fusion of Eulerian and Lagrangian Approaches for Accurate Heart-rate Estimation using Face Videos
    Gupta, Puneet
    Bhowmick, Brojeshwar
    Pal, Arpan
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 2834 - 2837
  • [27] PPGnet: Deep Network for Device Independent Heart Rate Estimation from Photoplethysmogram
    Shyam, A.
    Ravichandran, Vignesh
    Preejith, S. P.
    Joseph, Jayaraj
    Sivaprakasam, Mohanasankar
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 1899 - 1902
  • [28] Multimodal Heartbeat Rate Estimation from the Fusion of Facial RGB and Thermal Videos
    Johansen, Anders S.
    Henriksen, Jesper W.
    Haque, Mohammad A.
    Jahromi, Mohammad Naser Sabet
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [29] A novel approach for contactless heart rate monitoring from pet facial videos
    Hu, Renjie
    Gao, Yu
    Peng, Guoying
    Yang, Hongyu
    Zhang, Jiajin
    FRONTIERS IN VETERINARY SCIENCE, 2024, 11
  • [30] ROBUST ADAPTIVE HEART-RATE MONITORING USING FACE VIDEOS
    Gupta, Puneet
    Bhowmik, Brojeshwar
    Pal, Arpan
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 530 - 538