共 50 条
- [42] Integration of Deep Bottleneck Features for Audio-Visual Speech Recognition 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 563 - 567
- [43] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
- [44] Audio-Visual Emotion Recognition using Gaussian Mixture Models for Face and Voice ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 250 - 257
- [45] Deep Reinforcement Learning for Audio-Visual Gaze Control 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1555 - 1562
- [46] Audio-Visual Attention Networks for Emotion Recognition AVSU'18: PROCEEDINGS OF THE 2018 WORKSHOP ON AUDIO-VISUAL SCENE UNDERSTANDING FOR IMMERSIVE MULTIMEDIA, 2018, : 27 - 32
- [48] Deep learning based multimodal emotion recognition using model-level fusion of audio–visual modalities Knowledge-Based Systems, 2022, 244
- [49] Audio-Visual Sentiment Analysis for Learning Emotional Arcs in Movies 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 829 - 834
- [50] To Join or Not to Join: A Study on the Impact of Joint or Unimodal Representation Learning on Audio-Visual Emotion Recognition 2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,