Multimodal Information Fusion Approach for Noncontact Heart Rate Estimation Using Facial Videos and Graph Convolutional Network

被引:9
|
作者
Yue, Zijie [1 ]
Ding, Shuai [1 ]
Yang, Shanlin [1 ]
Wang, Linjie [2 ]
Li, Yinghui [2 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Anhui, Peoples R China
[2] Chinese Astronaut Res & Training Ctr, State Key Lab Space Med Fundamentals & Applicat, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Heart rate; Estimation; Videos; Spatiotemporal phenomena; Feature extraction; Blood; Skin; Attention mechanism; deep learning; Graph convolution network (GCN); multimodal information fusion; noncontact heart rate (HR) estimation; DIAGNOSIS;
D O I
10.1109/TIM.2021.3129498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Heart rate (HR) is a critical signal for reflecting human physical and mental conditions, and it is beneficial for diagnosing neurological and cardiovascular diseases due to its excellent accessibility. However, traditional HR measurement devices have limited usability and convenience. Recent studies have shown that the optical absorption variation of human skin due to blood volume variation in cardiac cycles can be acquired from facial videos and used to estimate HR in a noncontact manner. However, the advanced noncontact HR estimation approaches are based on a single HR information source, resulting in unsatisfactory estimation results due to noise corruption and insufficient information. To address these problems, this article proposes a multimodal information fusion framework for noncontact HR estimation. First, feature representation maps are used to effectively extract periodic signals from facial visible-light and thermal infrared videos. Then, a temporal-information-aware HR feature extraction network (THR-Net) for encoding discriminative spatiotemporal information from the representation maps is presented. Finally, based on a graph convolution network (GCN), an information fusion model is proposed for feature integration and HR estimation. Experimental and evaluation results of five different metrics on two datasets show that the proposed approach outperforms the state-of-the-art approaches. This article demonstrates the advantage of multimodal information fusion for noncontact HR estimation.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Information-Enhanced Network for Noncontact Heart Rate Estimation From Facial Videos
    Liu, Lili
    Xia, Zhaoqiang
    Zhang, Xiaobiao
    Peng, Jinye
    Feng, Xiaoyi
    Zhao, Guoying
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2136 - 2150
  • [2] Heart Rate Estimation from Facial Videos Based on Convolutional Neural Network
    Yang, Wen
    Li, Xiaoqi
    Zhang, Bin
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 45 - 49
  • [3] Multimodal Heartbeat Rate Estimation from the Fusion of Facial RGB and Thermal Videos
    Johansen, Anders S.
    Henriksen, Jesper W.
    Haque, Mohammad A.
    Jahromi, Mohammad Naser Sabet
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [4] Heart Rate Estimation From Facial Videos Using a Spatiotemporal Representation With Convolutional Neural Networks
    Song, Rencheng
    Zhang, Senle
    Li, Chang
    Zhang, Yunfei
    Cheng, Juan
    Chen, Xun
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (10) : 7411 - 7421
  • [5] Heart rate estimation network from facial videos using spatiotemporal feature image
    Jaiswal, Kokila Bharti
    Meenpal, T.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [6] Supervised Learning Approach to Remote Heart Rate Estimation from Facial Videos
    Osman, Ahmed
    Turcot, Jay
    El Kaliouby, Rana
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [7] MSDN: A Multistage Deep Network for Heart-Rate Estimation From Facial Videos
    Zhang, Xiaobiao
    Xia, Zhaoqiang
    Dai, Jing
    Liu, Lili
    Peng, Jinye
    Feng, Xiaoyi
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 15
  • [8] Person re-identification in indoor videos by information fusion using Graph Convolutional Networks
    Soni, Komal
    Dogra, Debi Prosad
    Sekh, Arif Ahmed
    Kar, Samarjit
    Choi, Heeseung
    Kim, Ig-Jae
    [J]. Expert Systems with Applications, 2022, 210
  • [9] Heart Rate Estimation From Facial Videos for Depression Analysis
    Mustafa, Aamir
    Bhatia, Shalini
    Hayat, Munawar
    Goecke, Roland
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 498 - 503
  • [10] Web Page Information Extraction Service Based on Graph Convolutional Neural Network and Multimodal Data Fusion
    Zhang, Mingzhu
    Yang, Zhongguo
    Ali, Sikandar
    Ding, Weilong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 681 - 687