Audio-visual based emotion recognition using tripled hidden Markov model

被引：0

作者：

Song, ML ^{[1
]}

Chen, C ^{[1
]}

You, MY ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Emotion recognition is one of the latest challenges in intelligent human/machine communication. Most of previous work on emotion recognition focused on extracting emotions from visual or audio information separately. A novel approach is presented in this paper to recognize the human emotion which uses both visual and audio from video clips. A tripled Hidden Markov Model is introduced to perform the recognition which allows the state asynchrony of he audio and visual observation sequences while preserving their natural correlation over time. The experimental results show that this approach outperforms only using visual or audio separately.

引用

页码：877 / 880

页数：4

共 50 条

[21] Method of speech recognition and speaker identification using audio-visual of polish speech and hidden Markov models
Kubanek, Mariusz
[J]. BIOMETRICS, COMPUTER SECURITY SYSTEMS AND ARTIFICIAL INTELLIGENCE APPLICATIONS, 2006, : 45 - 55
[22] Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition
Kubanek, M.
Bobulski, J.
Adrjanowicz, L.
[J]. BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2012, 60 (02) : 307 - 316
[23] Audio-visual sports highlights extraction using Coupled Hidden Markov Models
Ziyou Xiong
[J]. Pattern Analysis and Applications, 2005, 8 : 62 - 71
[24] Audio-visual sports highlights extraction using Coupled Hidden Markov Models
Xiong, ZY
[J]. PATTERN ANALYSIS AND APPLICATIONS, 2005, 8 (1-2) : 62 - 71
[25] Audio-Visual Emotion Recognition Based on Facial Expression and Affective Speech
Zhang, Shiqing
Li, Lemin
Zhao, Zhijin
[J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 46 - +
[26] Metric Learning-Based Multimodal Audio-Visual Emotion Recognition
Ghaleb, Esam
Popa, Mirela
Asteriadis, Stylianos
[J]. IEEE MULTIMEDIA, 2020, 27 (01) : 37 - 48
[27] Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition
Guo, Peini
Chen, Zhengyan
Li, Yidi
Liu, Hong
[J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 315 - 326
[28] Deep learning based multimodal emotion recognition using model-level fusion of audio-visual modalities
Middya, Asif Iqbal
Nag, Baibhav
Roy, Sarbani
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 244
[29] Deep learning based multimodal emotion recognition using model-level fusion of audio-visual modalities
Middya, Asif Iqbal
Nag, Baibhav
Roy, Sarbani
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 244
[30] Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
Islam, Md. Rabiul
Sobhan, Md. Abdus
[J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2014, 2014

← 1 2 3 4 5 →