共 50 条
- [31] Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 279 - 286
- [32] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech Alm, M. (magnus.alm@svt.ntnu.no), 1600, Acoustical Society of America (134):
- [33] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3001 - 3010
- [35] Robust Audio-Visual Speech Recognition Based on Hybrid Fusion 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7580 - 7586
- [37] FaceFilter: Audio-visual speech separation using still images INTERSPEECH 2020, 2020, : 3481 - 3485
- [38] Attention-Based Audio-Visual Fusion for Video Summarization NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
- [39] Multi-Stream Gated and Pyramidal Temporal Convolutional Neural Networks for Audio-Visual Speech Separation in Multi-Talker Environments INTERSPEECH 2021, 2021, : 1104 - 1108
- [40] Deep audio-visual speech separation based on facial motion INTERSPEECH 2021, 2021, : 3540 - 3544