ENCODING TEMPORAL INFORMATION FOR AUTOMATIC DEPRESSION RECOGNITION FROM FACIAL ANALYSIS

被引:0
|
作者
de Melo, Wheidima Carneiro [1 ]
Granger, Eric [2 ]
Lopez, Miguel Bordallo [1 ,3 ]
机构
[1] Univ Oulu, Ctr Machine Vis & Signal Anal CMVS, Oulu, Finland
[2] Ecole Technol Super, Dept Syst Engn, LIVIA, Montreal, PQ, Canada
[3] VTT Tech Res Ctr Finland, Espoo, Finland
基金
芬兰科学院;
关键词
Affective Computing; Depression Detection; Expression Recognition; Temporal Pooling; Two-stream Model;
D O I
10.1109/icassp40776.2020.9054375
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Depression is a mental illness that may be harmful to an individual's health. Using deep learning models to recognize the facial expressions of individuals captured in videos has shown promising results for automatic depression detection. Typically, depression levels are recognized using 2D-Convolutional Neural Networks (CNNs) that are trained to extract static features from video frames, which impairs the capture of dynamic spatio-temporal relations. As an alternative, 3D-CNNs may be employed to extract spatio-temporal features from short video clips, although the risk of overfitting increases due to the limited availability of labeled depression video data. To address these issues, we propose a novel temporal pooling method to capture and encode the spatio-temporal dynamic of video clips into an image map. This approach allows fine-tuning a pre-trained 2D CNN to model facial variations, and thereby improving the training process and model accuracy. Our proposed method is based on two-stream model that performs late fusion of appearance and dynamic information. Extensive experiments on two benchmark AVEC datasets indicate that the proposed method is efficient and outperforms the state-of-the-art schemes.
引用
收藏
页码:1080 / 1084
页数:5
相关论文
共 50 条
  • [41] AUTOMATIC FACIAL EXPRESSION RECOGNITION SYSTEM
    Balasubramani, A.
    Kalaivanan, K.
    Karpagalakshmi, R. C.
    Monikandan, R.
    [J]. ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 509 - 513
  • [42] Policing based on automatic facial recognition
    Guo, Zhilong
    Kennedy, Lewis
    [J]. ARTIFICIAL INTELLIGENCE AND LAW, 2023, 31 (02) : 397 - 443
  • [43] Automatic Facial Expression Recognition System
    Mliki, Hazar
    Fourati, Nesrine
    Smaoui, Souhail
    Hammami, Mohamed
    [J]. 2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [44] Review on automatic facial expression recognition
    Mase, Kenji
    [J]. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 1997, 51 (08): : 1136 - 1139
  • [45] AUTOMATIC ANALYSIS OF FACIAL ATTRACTIVENESS FROM VIDEO
    Kalayci, Sacide
    Ekenel, Hazim Kemal
    Gunes, Hatice
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4191 - 4195
  • [46] Automatic face analysis system based on face recognition and facial physiognomy
    Lee, Eung-Joo
    Kwon, Ki-Ryong
    [J]. ADVANCES IN HYBRID INFORMATION TECHNOLOGY, 2007, 4413 : 128 - 138
  • [47] Automatic Annotation of Corpora For Emotion Recognition Through Facial Expressions Analysis
    Diamantini, Claudia
    Mircoli, Alex
    Potena, Domenico
    Storti, Emanuele
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5650 - 5657
  • [48] Analysis of HMM temporal evolution for automatic speech recognition and verification
    Casar, Marta
    Fonollosa, Jose A. R.
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 359 - 366
  • [49] AUTOMATIC RECOGNITION OF VENTRICULAR ARRHYTHMIAS USING TEMPORAL ELECTROGRAM ANALYSIS
    PAUL, VE
    FARRELL, T
    GILL, J
    DAVIES, DW
    WARD, DE
    CAMM, AJ
    [J]. PACE-PACING AND CLINICAL ELECTROPHYSIOLOGY, 1991, 14 (08): : 1265 - 1273
  • [50] Facial Expression Recognition as markers of Depression
    Gue, Jia Xuan
    Chong, Chun Yong
    Lim, Mei Kuan
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 674 - 680