共 50 条
- [3] CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6786 - 6793
- [4] EMID: An Emotional Aligned Dataset in Audio-Visual Modality [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT GENERATION AND EVALUATION, MCGE 2023: New Methods and Practice, 2023, : 41 - 48
- [6] Emotional Audio-Visual Speech Synthesis Based on PAD [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 570 - 582
- [7] An audio-visual distance for audio-visual speech vector quantization [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
- [9] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech [J]. Alm, M. (magnus.alm@svt.ntnu.no), 1600, Acoustical Society of America (134):
- [10] AUDIO-VISUAL RECOGNITION OF OVERLAPPED SPEECH FOR THE LRS2 DATASET [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6984 - 6988