共 50 条
- [43] DEEP AUDIO-VISUAL SPEECH SEPARATION WITH ATTENTION MECHANISM [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7314 - 7318
- [44] Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition [J]. INTERSPEECH 2019, 2019, : 4090 - 4094
- [46] Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 255 - 259
- [47] Audio-Visual Database for Spanish-Based Speech Recognition Systems [J]. ADVANCES IN SOFT COMPUTING, MICAI 2019, 2019, 11835 : 452 - 460
- [48] The Conversation: Deep Audio -Visual Speech Enhancement [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3244 - 3248
- [49] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [50] Speech enhancement and recognition in meetings with an audio-visual sensor array [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2257 - 2269