A Deep Neural Network based Multimodal Video Recognition System for Caring

被引:0
|
作者
Yan, Chao [1 ]
Xu, Jiahua [1 ]
Klopfer, Bastian [1 ]
Nuernberger, Andreas [1 ]
机构
[1] Otto von Guericke Univ, Data & Knowledge Engn Grp, Fac Comp Sci, Magdeburg, Germany
关键词
Computer Vision; Deep Learning; Smart Surveillance; styling;
D O I
10.1109/ichms49158.2020.9209395
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Home caring usually refers to taking care of the elder, the young kids and the patients at home, and this depends on both caregivers and caring systems. The system which can provide sufficient and accurate information to doctors or caregivers without a delay will bring benefits for all the persons who need to be cared and allow doctor or care givers to take right action based on the information immediately. Available corresponding products in the market mainly are some smart home devices or some medical facilities based on electroencephalo-graph (EEG), electrocardiograph(ECG) or blood pressure check, however, these parameters cannot give doctors, caregivers or family members a direct feedback. To address the problem, this paper introduces a deep learning based design - a visual recognition system developed for clinical monitoring which can supervise both the emotions and gestures of patients at the same time and give responsible persons instant and direct feedback so that the right treatment will be taken by them. This product uses a Raspberry Pi computer with its camera as hardware and implementing several deep learning models to fulfill three main functions which are: Facial Recognition, Emotion Detection and Pose Estimation onto a portable device, which enhances the utilization of theapplication and this brings more possibilities. Compared to theavailable products, this application emphasizes monitoring per-sons via visual analysis and gives more direct feedback all thetime instead of traditional ways which are not always timely orpractical.
引用
收藏
页码:472 / 476
页数:5
相关论文
共 50 条
  • [1] Multimodal Emotion Recognition and State Analysis of Classroom Video and Audio Based on Deep Neural Network
    Li, Mingyong
    Liu, Mingyue
    Jiang, Zheng
    Zhao, Zongwei
    Zhang, Jiayan
    Ge, Mingyuan
    Duan, Huiming
    Wang, Yanxia
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
  • [2] Artificial Neural Network Based Multimodal Biometrics Recognition System
    Lathika, B. A.
    Devaraj, D.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 973 - 978
  • [3] Texture recognition system based on the Deep Neural Network
    Kapela, R.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2020, 68 (06) : 1503 - 1511
  • [4] Video-based face recognition based on deep convolutional neural network
    Zhai, Yilong
    He, Dongzhi
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 23 - 27
  • [5] Audio-Visual (Multimodal) Speech Recognition System Using Deep Neural Network
    Paulin, Hebsibah
    Milton, R. S.
    JanakiRaman, S.
    Chandraprabha, K.
    JOURNAL OF TESTING AND EVALUATION, 2019, 47 (06) : 3963 - 3974
  • [6] Video-based Disguise Face Recognition Based on Deep Spiking Neural Network
    Liu, Daqi
    Yue, Shigang
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 837 - 844
  • [7] Human Activity Recognition Based On Video Summarization And Deep Convolutional Neural Network
    Kushwaha, Arati
    Khare, Manish
    Bommisetty, Reddy Mounika
    Khare, Ashish
    Computer Journal, 1600, 67 (08): : 2601 - 2609
  • [8] Human Activity Recognition Based On Video Summarization And Deep Convolutional Neural Network
    Kushwaha, Arati
    Khare, Manish
    Bommisetty, Reddy Mounika
    Khare, Ashish
    COMPUTER JOURNAL, 2024,
  • [9] Video-based facial expression recognition using multimodal deep convolutional neural networks
    Pan X.-Z.
    Zhang S.-Q.
    Guo W.-P.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2019, 27 (04): : 963 - 970
  • [10] Multimodal Deep Neural Network with Image Sequence Features for Video Captioning
    Oura, Soichiro
    Matsukawa, Tetsu
    Suzuki, Einoshin
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,