Audio-Visual Continuous Recognition of Emotional State in a Multi-User System Based on Personalized Representation of Facial Expressions and Voice

被引:0
|
作者
A. V. Savchenko
L. V. Savchenko
机构
[1] HSE University,
[2] Laboratory of Algorithms and Technologies for Network Analysis,undefined
来源
关键词
audio-visual emotion recognition; emotional state tracking; personalized facial expression recognition; speaker-dependent speech emotion recognition; fusion of audio and video classifiers;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:665 / 671
页数:6
相关论文
共 39 条
  • [1] Audio-Visual Continuous Recognition of Emotional State in a Multi-User System Based on Personalized Representation of Facial Expressions and Voice
    Savchenko, A., V
    Savchenko, L., V
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (03) : 665 - 671
  • [2] Audio-Visual Isolated Words Recognition for Voice Dialogue System
    Chaloupka, Josef
    [J]. ANALYSIS OF VERBAL AND NONVERBAL COMMUNICATION AND ENACTMENT: THE PROCESSING ISSUES, 2011, 6800 : 88 - 94
  • [3] SpeechXRays: A User Recognition Platform based on voice Acoustics Analysis and Audio-visual Identity verification
    Spanakis, Emmanouil G.
    [J]. ERCIM NEWS, 2018, (115): : 49 - 50
  • [4] Continuous Phoneme Recognition based on Audio-Visual Modality Fusion
    Richter, Julius
    Liebold, Jeanine
    Gerkamnn, Timo
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Audio-Visual Emotion Recognition Based on Facial Expression and Affective Speech
    Zhang, Shiqing
    Li, Lemin
    Zhao, Zhijin
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 46 - +
  • [6] Product HMMs for audio-visual continuous speech recognition using facial animation parameters
    Aleksic, PS
    Katsaggelos, AK
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, 2003, : 481 - 484
  • [7] Audio-Visual Voice Activity Detection Based on an Utterance State Transition Model
    Yoshida, Takami
    Nakadai, Kazuhiro
    [J]. ADVANCED ROBOTICS, 2012, 26 (10) : 1183 - 1201
  • [8] Audio-Visual Emotion Recognition System Using Multi-Modal Features
    Handa, Anand
    Agarwal, Rashi
    Kohli, Narendra
    [J]. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [9] Improved Decision Trees for Multi-stream HMM-based Audio-Visual Continuous Speech Recognition
    Huang, Jing
    Visweswariah, Karthik
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 228 - +
  • [10] DBN based multi-stream models for audio-visual speech recognition
    Gowdy, JN
    Subramanya, A
    Bartels, C
    Bilmes, J
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 993 - 996