Heart Rate and Oxygen Level Estimation from Facial Videos Using a Hybrid Deep Learning Model

被引:1
|
作者
Zheng, Yufeng [1 ]
机构
[1] Univ Mississippi, Med Ctr, Jackson, MS 38677 USA
关键词
Vital sign; Facial video; convolutional neural network (CNN); Convolutional long short-term memory (convLSTM); Video vision transformer (ViViT); Deep learning; Telehealth; NONCONTACT; FUSION;
D O I
10.1117/12.3013956
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vital signs can be inferred from facial videos for health monitoring remotely, while facial videos can be easily obtained through phone cameras, webcams, or surveillance systems. In this study, we propose a hybrid deep learning model to estimate heart rate (HR) and blood oxygen saturation level (SpO2) from facial videos. The hybrid model has a mixed network architecture consisting of convolutional neural network (CNN), convolutional long short-term memory (convLSTM), and video vision transformer (ViViT). Temporal resolution is emphasized in feature extraction since both HR and SpO2 are varying over time. A clip of video consists of a set of frame images within a time segment. CNN is performed with regard to each frame (e.g., time distributed), convLSTM and ViViT can be configured to process a sequence of frames. These high-resolution temporal features are combined to predict HR and SpO2, which are expected to capture these signal variations. Our vital video dataset is fairly large by including 891 subjects from difference races and ages. Facial detection and data normalization are performed in preprocessing. Our experiments show that the proposed hybrid model can predict HR and SpO2 accurately. In addition, those models can be extended to infer HR fluctuations, respiratory rates, and blood pressure variations from facial videos.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Contactless blood oxygen estimation from face videos: A multi-model fusion method based on deep learning
    Hu, Min
    Wu, Xia
    Wang, Xiaohua
    Xing, Yan
    An, Ning
    Shi, Piao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [32] A Novel Respiratory Rate Estimation Algorithm from Photoplethysmogram Using Deep Learning Model
    Chin, Wee Jian
    Kwan, Ban-Hoe
    Lim, Wei Yin
    Tee, Yee Kai
    Darmaraju, Shalini
    Liu, Haipeng
    Goh, Choon-Hian
    DIAGNOSTICS, 2024, 14 (03)
  • [33] RealSense = Real Heart Rate: Illumination Invariant Heart Rate Estimation from Videos
    Chen, Jie
    Chang, Zhuoqing
    Qiu, Qiang
    Li, Xiaobai
    Sapiro, Guillermo
    Bronstein, Alex
    Pietikainen, Matti
    2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [34] Facial expression recognition of online learners from real-time videos using a novel deep learning model
    M. Jagadeesh
    B. Baranidharan
    Multimedia Systems, 2022, 28 : 2285 - 2305
  • [35] Facial expression recognition of online learners from real-time videos using a novel deep learning model
    Jagadeesh, M.
    Baranidharan, B.
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2285 - 2305
  • [36] Hybrid Deep Learning Model for Endoscopic Lesion Detection and Classification Using Endoscopy Videos
    Ayyaz, M. Shahbaz
    Lali, Muhammad Ikram Ullah
    Hussain, Mubbashar
    Rauf, Hafiz Tayyab
    Alouffi, Bader
    Alyami, Hashem
    Wasti, Shahbaz
    DIAGNOSTICS, 2022, 12 (01)
  • [37] Multimodal Information Fusion Approach for Noncontact Heart Rate Estimation Using Facial Videos and Graph Convolutional Network
    Yue, Zijie
    Ding, Shuai
    Yang, Shanlin
    Wang, Linjie
    Li, Yinghui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [38] Hybrid-Deep Learning Model for Emotion Recognition Using Facial Expressions
    Verma, Garima
    Verma, Hemraj
    REVIEW OF SOCIONETWORK STRATEGIES, 2020, 14 (02): : 171 - 180
  • [39] Heart rate estimation using facial video: A review
    Hassan, M. A.
    Malik, A. S.
    Fofi, D.
    Saad, N.
    Karasfi, B.
    Ali, Y. S.
    Meriaudeau, F.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 38 : 346 - 360
  • [40] Hybrid-Deep Learning Model for Emotion Recognition Using Facial Expressions
    Garima Verma
    Hemraj Verma
    The Review of Socionetwork Strategies, 2020, 14 : 171 - 180