共 50 条
- [2] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [3] Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19144 - 19152
- [4] DEEP MULTIMODAL LEARNING FOR AUDIO-VISUAL SPEECH RECOGNITION [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2130 - 2134
- [5] An audio-visual corpus for multimodal automatic speech recognition [J]. Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
- [7] Data Augmentation for Audio-Visual Emotion Recognition with an Efficient Multimodal Conditional GAN [J]. APPLIED SCIENCES-BASEL, 2022, 12 (01):
- [8] Multimodal and Temporal Perception of Audio-visual Cues for Emotion Recognition [J]. 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
- [10] Multimodal Emotion Recognition using Physiological and Audio-Visual Features [J]. PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 946 - 951