共 50 条
- [1] EMID: An Emotional Aligned Dataset in Audio-Visual Modality [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT GENERATION AND EVALUATION, MCGE 2023: New Methods and Practice, 2023, : 41 - 48
- [2] Emotional Audio-Visual Speech Synthesis Based on PAD [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 570 - 582
- [3] A Cantonese Audio-Visual Emotional Speech (CAVES) dataset [J]. Behavior Research Methods, 2024, 56 (5) : 5264 - 5278
- [4] An audio-visual distance for audio-visual speech vector quantization [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
- [7] MODALITY ATTENTION FOR END-TO-END AUDIO-VISUAL SPEECH RECOGNITION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6565 - 6569
- [8] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3001 - 3010
- [9] An audio-visual speech recognition with a new mandarin audio-visual database [J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [10] Expressive audio-visual speech [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2004, 15 (3-4) : 297 - 304